365 Data Science

Bias in AI: A Primer

Those interested in studying AI bias, but who lack a starting point, would do well to check out this introductory set of slides and the accompanying talk on the subject from Google researcher Margaret Mitchell.

Originally from KDnuggets https://ift.tt/2Z0wqyX

source https://365datascience.weebly.com/the-best-data-science-blog-2020/bias-in-ai-a-primer

Machine Learning in Dask

In this piece, we’ll see how we can use Dask to work with large datasets on our local machines.

Originally from KDnuggets https://ift.tt/2NkO2QG

source https://365datascience.weebly.com/the-best-data-science-blog-2020/machine-learning-in-dask

Introduction to Optimization and Gradient Descent Algorithm [Part-2].

Gradient descent is the most common method for optimization.

This is the second part of the series Optimization, In this blog post, we’ll continue to discuss the Rod Balancing problem and try to solve using Gradient descent method. In Part-1 we understood what is an optimization and tried to solve the same problem using Exhaustive Search(a gradient-free method). If you haven’t read that here is the link.

Introduction to Optimization and Gradient Descent Algorithm [Part-1].

Gradient-Based Algorithms are usually much faster than gradient-free methods, the whole objective here is to always try to improve over time i.e., somehow try to make the next step which results in a better solution than previous one. The Gradient Descent Algorithm is one of the most well-known gradients based algorithm. The Decision Variables used here are continuous ones since it gives more accurate gradients(slope) at any given point on the curve.

So, to solve our problem with gradient descent we’ll reframe our Objective function to a minimization problem. To do this we’ll make an assumption and define our cost function(it is also sometimes known as loss function or error function). We will assume that the best solution would be the one which can balance the rod for at least 10 seconds (let’s state this assumption as ‘y’). The cost function at its base is the function which returns the difference of the actual output and desired output. For our problem, the cost function would become:

the **Cost function** for Rod Balancing problem

Note: we squared the difference to avoid negative values or you can just take absolute value, either will work.

Now for every test result [f(x)]i.e., the time in seconds, the rod stayed on the finger, we can calculate our cost [C(x)]. So our Objective function would now change to minimizing C(x) instead of maximizing f(x), we can state the modified Objective function as,

Since Objective function is changed now, our curve also gets inverse, i.e., on the y-axis instead of time we plot cost and will try to minimize it.

Any Gradient descent based algorithm follows 3 step procedure:
1. Search direction.
2. Step size
3. Convergence check.

Once we know the error, we have to find the direction of where we should move our finger on the rod for a better solution. The direction is decided by taking the derivative of the cost function with respect to the decision variable(s). This simply means calculating slope(‘dC/dx’ ) on the curve for a specific value of the decision variable, this slope is known as the gradient. The greater the slope, the further we are from the minima(i.e., the lowest point on the curve).

For Gradient descent we apply a simple rule,

“If the slope is negative, we increase decision variable(s) and if the slope is positive, we decrease decision variable(s) with some value.”

Once we know the direction of where we want to take our variables for the next step we update them, The above rule can be easily given in mathematical term as,

But using this update rule may overshoot the value, resulting in skipping the minima and jump to the other side of the curve. So, instead of reaching to the centre of the rod the variable may jump and go to the other corner and may introduce greater error. To avoid this we decide the step size by multiplying a very small value(usually 0.001 or 0.0001) to the gradient, this value prevents overshooting as we are not taking a very huge step. This is known as the learning rate(α), unlocks the key principle where Gradient descent shines,

“Big steps when away, small steps when closer.”

What above statement says is when the slope is greater(i.e., when the steepness is high) the variables will update with larger values and when the slope starts getting smaller(i.e., the steepness is low, reaching to the bottom) the variables will update with a very small value, This is the behaviour what we actually follow in the real world while solving this kind of problems. So our update rule now changes to,

We perform this operation of updating variable for a certain number of epochs(one complete pass to the training examples) until it converges.
Below is the video of me trying to solve the rod balancing problem using the gradient descent method, you‘ll notice how fast, compare to exhaustive search we discussed in part-1, we get the point ‘x’ on the rod where it balances perfectly.

Don’t forget to give us your ? !

Introduction to Optimization and Gradient Descent Algorithm [Part-2]. was originally published in Becoming Human: Artificial Intelligence Magazine on Medium, where people are continuing the conversation by highlighting and responding to this story.

Via https://becominghuman.ai/introduction-to-optimization-and-gradient-descent-algorithm-part-2-74c356086337?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/introduction-to-optimization-and-gradient-descent-algorithm-part-2

Churn Prediction in 5 minutes

Spoiler Alert, It’s not magic it’s machine learning

An Absurd Challenge

Today I will show you how to obtain churn predictions before your coffee is ready. Put some coffee on the machine or french press so that when you get all those churn predictions you can enjoy going through them with that hot coffee you just brewed in the meantime.

A Friendly Introduction 🙂

Let me introduce myself. I am M Ahmed Tayib working as a Data-Scientist in Gauss Statistical Solutions. I am your friendly neighborhood data-scientist guy who loves coffee and loves an irrelevant challenge like making coffee vs conducting churn prediction.

Firstly, A definition

Churn is a term/label that is given to the customers who discontinue the services/subscription a company provides. For instance; if a user has not renewed Spotify subscription for 4 months then Spotify may consider that user a Churn.

Similarly, this can be said for any business in this modern era. Every business has churn customers, every business has a few segments of customers, well to be precise ex-customers, that discontinued the services.

Why Churn Prediction is Important?

A wise guy once said;

“Retaining a customer is always less expensive than acquiring a new one.”

I guess the quote speaks for itself. Of course, you need to obtain new customers to grow but that does not mean you have to lose some of them and do nothing to retain them.

Solution; Once you know which customers are likely to churn and why you can take appropriate action to retain them. However, the problem is in real life it is much much and much hard to know which customers are about to pull a stun of churn and let alone why.

How to Predict Churn?

Now that I have established what churn is and why churn prediction is important, lemme wrap it up with how to actually do it and do it really fast like never had been done before.

All you need is to have those sales and customer/user data. Please follow the steps below;

Go to Enhencer.com and log in or signup
Upload your Sales data and Customer data
In about 2–3 minutes Enhencer will provide you with Churn Predictions like the picture below;

4. Machine Learning System Design

Enhencer Churn Prediction Summary Dashboard

Voila! That’s it. Literally all you need to do is just upload the data and everything rest is taken care of.

Don’t forget to give us your ? !

Churn Prediction in 5 minutes was originally published in Becoming Human: Artificial Intelligence Magazine on Medium, where people are continuing the conversation by highlighting and responding to this story.

Via https://becominghuman.ai/churn-prediction-in-5-minutes-1c24602fd9f3?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/churn-prediction-in-5-minutes

Trying to contribute to the fight against waste pollution with Computer VisionPart I

Trying to contribute to the fight against waste pollution with Computer Vision — Part I

An estimated 5 to 25 million tons of plastic are thrown in our oceans every year. Although we know that those devastating trashes for biodiversity and environment mainly come from rivers and beaches, it remains really tough to catch them all, especially in some poor and neglected countries.

From this observation, we decided to consider a solution using deep learning to detect rubbish with a camera.

Is it possible to generate a trash detector which would be the first step to clean up our rivers?

Such an issue is the kind we want to address at Picsell.ia.

The code for this experiment is available here.

The Dataset

The TACO trashes Dataset available on the Dataset Hub seemed to be really suited to begin our project. This Dataset contains 1500 pictures of everyday life trashes that we tried to annotate accurately.

As you can see the Dataset isn’t really well balanced and doesn’t have a lot of objects annotated but that’s a first try, we will allow you to contribute to this Dataset in the next few week !

The model

We fine tuned a Faster RCNN model pre-trained on COCO (model already available on the Model Hub) and within 2 hours, our “trash detector” model was ready to be tested on the playground with our own data.

Let’s have a look on the playground :

Here, the bottles are well detected on this picture, but to be honest, our model is not as good on every images.
The reasons are of course the lack of training data, and the fact that we didn’t optimized our network so far. This leaves a lot of room for improvements.

Future enhancements

Although this project has proved itself, it can hardly be used for the moment. Our final goal is to embed our model on an edge device (eg. NVIDIA Jetson..)and run it in real time. But for that the model has to be way more accurate than it is now and also suited for near real-time inference.

But how to improve the model ?

This article is the first part of a series, here we have just made a ‘prototype’ of our algorithm but the next parts will be dedicated to :

Better algorithm choice and optimization
The influence of training data
The deployment on edge device

The different ideas we would like to implement are the following ones :

The dataset : What if we had at our disposal a TACO trashes dataset where all trashes were floating on the water ? All of our annotations are in fact very precise segmentation of the objects (we extracted the bounding-boxes in this article for prototyping purpose) but we can easily perform data augmentation and simply cut and paste the objects on another backgrounds (like sea water) or perform some advanced techniques like Neural Style Transfer to make it look more like ‘real’ data. We will also increase the volume of training data along the way.
The model : Some models are more accurate but also a lot heavier than others so we will try a bunch of them and try to find a compromise so we can achieve our goal and compare the results.

Finally, if we succeed in setting this trashes detector up, the next goal will be to find a way to pull the trashes detected out of the water. That is now out of our field, but we are sure that you will find solutions such as the first floating devices that already exist.

Conclusion of this project

This is a long run project that will need a lot of iterations but after that’s what we want to facilitate at Picsell.ia !
Do not hesitate to come along and help us built the biggest Open Computer Vision Hub and share this with your relatives, also please come and ask for help if you need it, it’s always a pleasure to guide you 😉

Don’t forget to give us your ? !

Trying to contribute to the fight against waste pollution with Computer Vision — Part I was originally published in Becoming Human: Artificial Intelligence Magazine on Medium, where people are continuing the conversation by highlighting and responding to this story.

Via https://becominghuman.ai/trying-to-contribute-to-the-fight-against-waste-pollution-with-computer-vision-part-i-fc955982af9a?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/trying-to-contribute-to-the-fight-against-waste-pollution-with-computer-visionpart-i

4 Free Math courses to do and Level up your Data Science Skills

Just as there is no Data Science without data, there’s no science in data without mathematics. Strengthening your foundational skills in math will level you up as a data scientist that will enable you to perform with greater expertise.

Originally from KDnuggets https://ift.tt/3dszaKt

source https://365datascience.weebly.com/the-best-data-science-blog-2020/4-free-math-courses-to-do-and-level-up-your-data-science-skills

How to Deal with Missing Values in Your Dataset

In this article, we are going to talk about how to identify and treat the missing values in the data step by step.

Originally from KDnuggets https://ift.tt/3hOC3Jd

source https://365datascience.weebly.com/the-best-data-science-blog-2020/how-to-deal-with-missing-values-in-your-dataset

Demystifying Artificial Intelligence in very simple words

One of the most common statements you hear today is ‘We are living in a digitalized world’. But are you actually aware of what…

Continue reading on Becoming Human: Artificial Intelligence Magazine »

Via https://becominghuman.ai/demystifying-artificial-intelligence-in-very-simple-words-a516c23f5702?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/demystifying-artificial-intelligence-in-very-simple-words

What is Voice AI? Benefits and Use Cases for Transforming Quality Management

Voice AI arms call centers with actionable insights on 100% of voice calls, influencing better coaching for agents, and in turn, better customer experiences for those calling in.

What is Voice AI?

Voice AI sits at the intersection of speech analytics and quality management, using cutting edge speech technology and natural language processing to transcribe and analyze support calls at a massive scale. It enables organizations to analyze 100% of customer conversations with the ultimate goal of improving agent performance and the overall customer experience.

With Voice AI, key moments in conversation can be unearthed to provide a more accurate picture of how the contact centers as a whole, and the individual agents staffing them, are performing across key metrics. Analytics on interactions like sentiment, emotion, dead air, hold times, supervisor escalations, redaction, and more are often game-changing for businesses who previously had low QA coverage, and Voice AI is the key to identifying them.

Once transcribed and analyzed, Voice AI automatically scores some parts of conversations and enables organizations to create tailored coaching programs for agents.

The Advantages of Voice AI

Voice AI emerged as a result of the inefficiencies of highly manual traditional quality management (QM) programs. Organizations struggled to fully-understand performance, monitor mission-critical KPIs and compliance, and better enable their agents with relevant training.

Voice AI, built around Analytics-enabled Quality Management, radically transforms an organization’s quality programs in a number of ways:

Before Voice AI

QA is Manual Low Call Coverage: Quality checks take 30+ minutes per call, analysts use lengthy checklists, and scoring is subjective and calls selected at random.‍
Transcription Inaccurate and Simple: Accents, overtalk, industry specific terms, and spotty connectivity make speech-to-text transcription challenging.
Lack of Benchmarking: Performance is assessed from a few scored calls and minimal benchmarks across the organization.
One Size Fits All Training: Blanket, one size fits all trainings might not be relevant to groups of agents. Minimal data available to create targeted, impactful coaching programs.
Low Visibility Across the Organization: Nearly impossible to monitor organization-wide performance and monitor progress.

After Voice AI

⏰ Automates the Tedious Parts of QA: Analyze and score 100% of calls for every agent and identify benchmarks and spot performance trends.
? Improved Transcription Accuracy: Accurate transcription (80+%) at scale delivers full view of performance and insights with confidence.
?️ Comprehensive Evaluations: Get a full view into every voice call for every agent and enable supervisors to spot trends and identify critical areas of improvement.
? More Targeted Feedback/Coaching to Agents: Provide more targeted feedback to agents and use personal scorecards as reference points for more relevant training.
? Better Performance Analytics and Trends: Operation leaders can identify inefficiencies and trends and improve key metrics with data-driven training.

‍”Success for our team means bringing out the best in each agent. We’re able to do that by throwing out the one size fits all coaching approach and tailoring conversations on an individual basis. Voice AI helps ensure you’re an optimized leader by identifying and addressing the right gaps.”
– Kyle Kizer, Compliance Manager at Root Insurance

Top Voice AI Use Cases

Voice AI provides a wide variety of benefits to improve processes across a contact center. Next, we’ll dig into some real-world use cases of how Voice AI and quality automation is used today.

Mandatory Compliance Tracking

Voice AI monitors for compliance interactions, including redaction and customer verification.

Regulatory compliance is paramount across all industries, most notably financial, insurance, and healthcare. It ensures the protection of customer data, backed by strict legislation to enforce it. As a result, monitoring mandatory compliance dialogues and categorizing voice calls relevant to specific compliance regulations is mission-critical.

Examples

Mini Miranda
Settlement Disclosure
Recorded Line Message
PII Redaction (eg. credit card, account number, SSN)
Customer verification %
Mandatory compliance dialogue %

Measurable KPIs

Customer verification %
Mandatory compliance dialogue %

Openers & Closers

Voice AI gets granular with opener and closing dialogues, monitoring for important interactions for both compliance and customer satisfaction.

The beginning of a conversation is important from both a customer experience and a compliance standpoint. The end of a conversation is also important for customer experience, and it also is an opportunity to both better confirm how the call went and create next steps.

Examples

Mention company name
Self introduction
Offer assistance
Customer verification
Recorded line message
Thank customer for calling
Offer further assistance

Measurable KPIs

Increase positive sentiment
Decrease negative sentiment
NPS
Adherence to brand standards
Lower average handle time

Supervisor Escalations

*Teams can hone in on exact moments that led to a supervisor escalation, like lack of agent resources, negative sentiment or hold time violations.*

Supervisor escalations are a strong indicator of a negative customer experience, a metric for agent call-handling, or an organizational inefficiency. Escalations in any contact center are costly due to the amount of time and resources required to resolve them.

Examples

Issue cannot be solved by agent
Issue is outside of the agent’s role

Measurable KPIs

First call resolution (FCR)
Supervisor escalation rate
Average speed of answer (ASA)
CSAT

Customer Sentiment Analysis

Customer sentiment analysis is an indicator of how people feel about a brand, its products, and its service. Simple sentiment analysis is determined based on words alone (what’s being said), while advanced sentiment analysis (tonality-based) considers tone and volume as well (what, how, and why it’s said).

Examples

Negative experience based on agent, process, or product/service

Measurable KPIs

Customer satisfaction (CSAT)
Reduced negative sentiment
Improved products and services

Sentiment analysis is a key component of Voice AI, analyzing voice calls to gauge emotion for both *what* is being said, and *how* it’s being said.

Voice AI: Big Benefits, Bigger Potential

Voice AI is transforming the contact center as we know it, uncovering deep insights across every single voice call that takes place, and providing the data needed to drive more targeted training programs for agents.

“What’s exciting about Voice AI is that we can change the way we’re coaching and re-write our quality cards. We can move away from check-boxes and focus on real skill development. Using Voice AI helps us change behavior faster.”
– Dale Sturgill, VP Call Center Operations, EmployBridge

Don’t forget to give us your ? !

What is Voice AI? Benefits and Use Cases for Transforming Quality Management was originally published in Becoming Human: Artificial Intelligence Magazine on Medium, where people are continuing the conversation by highlighting and responding to this story.

Via https://becominghuman.ai/what-is-voice-ai-benefits-and-use-cases-for-transforming-quality-management-f0406975c326?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/what-is-voice-ai-benefits-and-use-cases-for-transforming-quality-management

365 Data Science

Bias in AI: A Primer

Machine Learning in Dask

Introduction to Optimization and Gradient Descent Algorithm [Part-2].

Gradient descent is the most common method for optimization.

Top 4 Most Popular Ai Articles:

Don’t forget to give us your ? !

Churn Prediction in 5 minutes

Top 4 Most Popular Ai Articles:

Don’t forget to give us your ? !

Trying to contribute to the fight against waste pollution with Computer VisionPart I

Trying to contribute to the fight against waste pollution with Computer Vision — Part I

The Dataset

The model

Future enhancements

Top 4 Most Popular Ai Articles:

Conclusion of this project

Don’t forget to give us your ? !

4 Free Math courses to do and Level up your Data Science Skills

How to Deal with Missing Values in Your Dataset

Top Stories Jun 15-21: Easy Speech-to-Text with Python; A Complete guide to Google Colab for Deep Learning

Demystifying Artificial Intelligence in very simple words

What is Voice AI? Benefits and Use Cases for Transforming Quality Management

Voice AI arms call centers with actionable insights on 100% of voice calls, influencing better coaching for agents, and in turn, better customer experiences for those calling in.

What is Voice AI?

The Advantages of Voice AI

Before Voice AI

After Voice AI

Top Voice AI Use Cases

Mandatory Compliance Tracking

Top 4 Most Popular Ai Articles:

Openers & Closers

Supervisor Escalations

Customer Sentiment Analysis

Voice AI: Big Benefits, Bigger Potential

Don’t forget to give us your ? !