365 Data Science

KDnuggets News 20:n47 Dec 16: A Rising Library Beating Pandas in Performance; R or Python? Why Not Both?

Also: 10 Python Skills They Don’t Teach in Bootcamp; Data Science Volunteering: Ways to Help; A Journey from Software to Machine Learning Engineer; Data Science and Machine Learning: The Free eBook

Originally from KDnuggets https://ift.tt/2WjQGdX

source https://365datascience.weebly.com/the-best-data-science-blog-2020/kdnuggets-news-20n47-dec-16-a-rising-library-beating-pandas-in-performance-r-or-python-why-not-both

Applications of Data Science and Business Analytics

In recent times, a large number of businesses have begun realising the potential of Data Science. Business analytics and data science applications are far and wide. So let us have a look at them in detail.

Originally from KDnuggets https://ift.tt/3ag19iz

source https://365datascience.weebly.com/the-best-data-science-blog-2020/applications-of-data-science-and-business-analytics

Data Science and Machine Learning: The Free eBook

Check out the newest addition to our free eBook collection, Data Science and Machine Learning: Mathematical and Statistical Methods, and start building your statistical learning foundation today.

Originally from KDnuggets https://ift.tt/2IRsaO9

source https://365datascience.weebly.com/the-best-data-science-blog-2020/data-science-and-machine-learning-the-free-ebook

ML Model Evaluation: Insights Into Our Machine Learning Process

If you’re reading this you’ve landed straight in our blog post series about machine learning projects. We know that the implementation of these projects is still a big mystery for many of our customers. Therefore, we explain the phases of AI projects in a series of articles.

AI projects are usually carried out in a cyclic process. Our previous article dealt with the first two important phases of the cycle, data collection and data preparation. Today, we’re going to dive into the topic of model evaluation, which is a crucial part of phase 3 in our life cycle. Since this is a particularly complex topic, we will dedicate an entire article to it.

The six phases of our machine learning life cycle.

Let’s take a look at our example task again, so that we can figure out the process of selecting and evaluating the appropriate model. Our example project is about an umbrella federation for dance using a Digital Asset Management (DAM) system as a central hub for images. The DAM constantly receives images from all federation members. The dance federation uses these images as marketing collateral, but they do not want to use any of them; they only want to use the ones presenting the work of their members in a visually appealing way. The goal for the dance federation is to find aesthetic images. Consequently, our task here is to classify the images as aesthetic or unaesthetic.

Determine Classification Type

What many of our customers do not know: There are various types of classification tasks — from simple object classification to localization and complex classification on pixel level. The individual use case determines which classification type is right for a task:

Classification of the whole image: Classify the image into one or multiple label classes. The labels may describe objects or other aspects like aesthetic, color mood or technical quality.
Classification with localization: Classify the image into one or multiple label classes and give rough positions for each class. This is for example useful for placing shop links on product images.
Object detection: Classification and localization of one or more objects in
an image. The objects and its exact positions as bounding boxes are predicted.
Semantic segmentation: Each pixel of an image is labeled to a corresponding class, for example a medical image where each pixel is labeled to either healthy or diseased tissue.
Instance segmentation: Classification on pixel level that recognizes
each instance of a class. That means if we have an image with several shoes, each shoe is identified as a separate instance of the class “shoe”.

From simple classification to complex instance segmentation.

Don’t forget to give us your ? !

ML Model Evaluation: Insights Into Our Machine Learning Process was originally published in Becoming Human: Artificial Intelligence Magazine on Medium, where people are continuing the conversation by highlighting and responding to this story.

Via https://becominghuman.ai/ml-model-evaluation-insights-into-our-machine-learning-process-d39458da9b71?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/ml-model-evaluation-insights-into-our-machine-learning-process

Is Your Data Strategy AI-Ready?

As companies now have lavish amounts of data at their fingertips and AI continues to offer new business opportunities, data strategy becomes a fundamental part of maximizing ROI from AI initiatives. How exactly is data captured? How is it processed? What is the end goal of collecting and processing? These are just a fraction of important questions AI developers need to answer to succeed in their AI technology implementation.

Choosing the Use Case

Given that AI offers a myriad of opportunities for driving business growth, companies may be hesitant about where to start. Most importantly, an AI use case should correlate with a specific business objective. Unsurprisingly, organizations often tend to dive into AI implementation haphazardly because of the hype surrounding this technology.

However, the application area largely defines the implementation effort needed for successful AI adoption. For example, deploying AI for product development enhancement usually calls for structural changes, revamping of business workflows, and extensive data preparation. At the same time, augmenting customer service with chatbots will require less hassle when preparing data for this initiative.

Building a Data-Driven Culture

Once you have defined the use case, developing a data-centric culture should become a priority. Far too often, organizations are getting heavily invested in all the technicalities of AI implementation, leaving their workforce disincentivized and underprepared for taking advantage of the new tools. For AI adoption to succeed, it’s crucial to ensure that workforce is ready both psychologically and technically.

This is usually done by introducing a succession of training sessions dedicated to data literacy. It’s important to stress how exactly new AI tools will enhance current workflows and help achieve better performance. Workforce training usually takes a considerable amount of time, especially when carried out in large enterprises. This is why it’s often crucial to start upskilling incentives as early as possible in the AI adoption cycle. By creating functional prototypes with all the essential features, you can start retraining way before the actual AI deployment.

Don’t forget to give us your ? !

Is Your Data Strategy AI-Ready? was originally published in Becoming Human: Artificial Intelligence Magazine on Medium, where people are continuing the conversation by highlighting and responding to this story.

Via https://becominghuman.ai/is-your-data-strategy-ai-ready-40aa2e0e5c2?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/is-your-data-strategy-ai-ready

Geometric Deep learning with Graph Neural Network

First of all,

What is Euclidean Space and Non-Euclidean Space?

**Euclidian** geometry is developed by Euclid.

Euclidean Space

which involves the functional of 1 Dimensional, 2 Dimensional to N number of Dimension.

Whenever a Mathematician creates a formula in Geometry he has to prove that formula works for all Dimensional.

Let’s Get into non-euclidean Space.

Yeah, There is no Euclidean Space it’s about curve like hyperbolic and Sphere like space. Simply Euclidean looks for flat and Surface but non-Euclidean looks for curved.

This is much more useful than Euclidean space many times.

Neural networks are performing well in Euclidean Data. example: Text, Audio, Images, etc.

But what about Non-Euclidean Space? example: Graphs/Networks, Manifolds.Etc. (Likewise complex structures)

To know more about Non-Euclidean Space

So there comes Geometric Deep learning. Paper

Bronstein et al. first introduced the term Geometric Deep Learning (GDL) in their 2017 article “Geometric deep learning: going beyond euclidean data”

They are trying on the graphs and applying 3d model on CNN and etc.

Now we are going to look into one of that subdomain Graph lets jump into it.

Graphs

For a little bit, we will brush up on the basic stuff.

The graph is Simple G= (V, E)

Where, V= Vertices/Nodes. E =Edges.

Important types of Graph

Undirected Graph — have edges that do not have a direction
Directed Graph — have edges with direction.
Weighted Graph — edge has a numerical value called weight

the Weight/Labels can be numeric or string.

Nodes /vertices can have features , for example node A can be have features like it properties. (Weight and Features are not the same)

Computer Doesn’t understand the graph, So what to do? We can use Matrix

The architecture of Graph Neural Network

We know Neural networks and CNN are having fixed input values in the model But, Graphs are not fixed. So we have to do new approaches.

Get into CNN, CNN works in 3 steps

Locality — Kernal /filter are taking a particular grid from beginning to end of the image, They are fixed grid.
Aggregation — multiply with a mask by that particular grid and sum them up.
Composition — Function of function (hidden layer computation f(f(x))…)

We can use the same methodology for GNN. In here locality will be considered for the neighborhood(how a node is connected to another node localizing the nodes), Aggregation means how they are contributing to their corresponding nodes by weights. Composition (stacking layers) passing to more layers.

Node Embedding

Aim : Similarity (u , v)= (z_u)^T(z_v) , z denotes the embedded space.

Node embedding converting the node into d-dimensional, where embedding space dimensional is less than the original network space.

encoding function ENC() converts the nodes into d-dimensional without changing the distance of u and v nodes.

Consider, u has the feature vector named x and v has y.

so , (z_u)(z_v) = (x)^T (y).

Locality (neighbors)

we have to create a computational graph for the target node.A’s neighborhood are B, C, D, so first B, C, D connected towards A, then connecting neighbors of neighbors, B is connected with A, C, Likewise, it goes

here it is working as an encoder function, A node to Z^x is actually an encoding. (Z^x is the feature vector of the A)

Also, all of the nodes have their own feature vector.

Aggregate (neighbors)

To get B we will sum up the A and C, likewise, for A we will sum up the B, C, D. It is a permutation invariant, that means (A+ B) and (B+A).

In the above, we have taken A has the target node, but after that, we have to take all the nodes as target node, for after this, b will be the target node that will give us the different Computational graph, like for other nodes also.

Forward propagation

We know how the Forward propagation works.

But how Graph Convolutional Networks works? there comes Spectral GCN. Spectral GCNs make use of the Eigen-decomposition of graph Laplacian matrix to implement this method of information propagation. GCN Paper

where A is the adjacency matrix. A* is the normalized value of A, For the Self-loops, we can multiply the A with an identity matrix.

Spectral Graph Convolution works as the message passing network by embedding the neighborhood node information along with it.

For training GCN we need 3 elements

Adjacency matrix- learn the feature representations based on nodes connectivity.
Node attributes- which are input features
Edge attributes- data of Edge connectivity

Consider, X- as the input features, A as Adjacency matrix, D is degree matrix.

The dot product of Adjacency Matrix and Node Features Matrix represents the sum of neighboring node features

AX = np.dot(A,X)

Normalizing A is can be done in the way of

Doing the dot product with an inverse of degree matrix and AX but in this paper, Kipf and Welling are suggesting to do the symmetric normalization.

#Symmetrically-normalization
D_half_norm = fractional_matrix_power(D, -0.5)
DADX = D_half_norm.dot(A_hat).dot(D_half_norm).dot(X)

Otherwise, there is no need for Backpropagation. the function as it is just we are sending Adjacency matrix and input features with it, and only the forward propagation happens, each node is converted to the computational graph, and the forward propagation formula changes a little bit, also we are avoiding the bias b in the formula for simplicity sake, But the problem with the Graph neural network is data preparation, where we need an edge connectivity data and input features and adjacency matrix, the adjacency matrix is easy to create by one line of code, but the node, feature vectors and edge need to be clear.

For example, let us take Cora dataset :

The Cora dataset consists of 2708 scientific publications classified into one of seven classes. Each publication in the dataset is described by a 0/1-valued word vector indicating the absence/presence of the corresponding word from the dictionary. The dictionary consists of 1433 unique words.

Nodes = Publications (Papers, Books …)
Edges = Citations
Node Features = word vectors
7 Labels = Publication type e.g. Neural_Networks, Rule_Learning, Reinforcement_Learning, Probabilistic_Methods…

Number of graphs: 1 
Number of features: 1433 
Number of classes: 7

Code for this Model building.

Don’t forget to give us your ? !

Geometric Deep learning with Graph Neural Network was originally published in Becoming Human: Artificial Intelligence Magazine on Medium, where people are continuing the conversation by highlighting and responding to this story.

Via https://becominghuman.ai/geometric-deep-learning-with-graph-neural-network-ace43692622f?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/geometric-deep-learning-with-graph-neural-network

Covid or just a Cough? AI for detecting COVID-19 from Cough Sounds

Increased capabilities in screening and early testing for a disease can significantly support quelling its spread and impact. Recent progress in developing deep learning AI models to classify cough sounds as a prescreening tool for COVID-19 has demonstrated promising early success. Cough-based diagnosis is non-invasive, cost-effective, scalable, and, if approved, could be a potential game-changer in our fight against COVID-19.

Originally from KDnuggets https://ift.tt/34fQscb

source https://365datascience.weebly.com/the-best-data-science-blog-2020/covid-or-just-a-cough-ai-for-detecting-covid-19-from-cough-sounds

State of Data Science and Machine Learning 2020: 3 Key Findings

Kaggle recently released its State of Data Science and Machine Learning report for 2020, based on compiled results of its annual survey. Read about 3 key findings in the report here.

Originally from KDnuggets https://ift.tt/2LvmFFO

source https://365datascience.weebly.com/the-best-data-science-blog-2020/state-of-data-science-and-machine-learning-2020-3-key-findings

How The New World of AI is Driving a New World of Processor Development

Blaize’s novel stream processor for Edge AI offers a case study of new opportunities for smaller companies to leverage semiconductor industry resources in pursuit of their goals.

Originally from KDnuggets https://ift.tt/2ITVh3y

source https://365datascience.weebly.com/the-best-data-science-blog-2020/how-the-new-world-of-ai-is-driving-a-new-world-of-processor-development

365 Data Science

KDnuggets News 20:n47 Dec 16: A Rising Library Beating Pandas in Performance; R or Python? Why Not Both?

Applications of Data Science and Business Analytics

Data Science and Machine Learning: The Free eBook

ML Model Evaluation: Insights Into Our Machine Learning Process

Determine Classification Type

Trending AI Articles:

Choose the Right Model Complexity

Splitting the Data

Metrics to Assess Model Performance

Summary

Don’t forget to give us your ? !

Is Your Data Strategy AI-Ready?

Choosing the Use Case

Building a Data-Driven Culture

Trending AI Articles:

Addressing Data Quality

Cloud or On-Premises?

Ethical Data Usage

Conclusion

Don’t forget to give us your ? !

Geometric Deep learning with Graph Neural Network

First of all,

Graphs

Important types of Graph

Trending AI Articles:

Adjacency matrix

Degree Matrix

Laplacian matrix

The architecture of Graph Neural Network

Node Embedding

Locality (neighbors)

Aggregate (neighbors)

Forward propagation

Don’t forget to give us your ? !

Covid or just a Cough? AI for detecting COVID-19 from Cough Sounds

State of Data Science and Machine Learning 2020: 3 Key Findings

Top Stories Dec 7-13: 20 Core Data Science Concepts for Beginners

How The New World of AI is Driving a New World of Processor Development