365 Data Science

How to land an Amazon ML Engineer Interview and mistakes to avoid

So, are you confused by the title? Of course I got an interview for amazon ML Engineer position and I have failed because of my mistakes. I want to share my experience with you, so that you don’t make mistakes I have made.

I have given interview to amazon for ML Engineer position in January 2021. I am sharing my experience from how I got the interview and how not to bomb it.

So, in November 2020, amazon hosted a hackathon on Hacker-Earth website to hire ML Engineers in India. Naturally as a data science learner I have participated in the Hackathon.

In round 1 of the hackathon we were asked to built a classifier for an ecommerce website to know whether to target a person or not based on the historical data available for the ecommerce website. We were also told to make a presentation for the same.

Data Cleaning

I have loaded the train and test data and checked for null values.

I have filled the null values in columns having object data type by mode values and numeric data types by their median values.

Later on I have to eliminate the numerical features which were highly correlated with each other, so that the model will have lower the variance of the weights. I have retained features having correlation coefficients between -0.5 and 0.5.

Removing outliers:

I found that customer_affinity_score column has outliers. Removing outliers by keeping data points between IQR range was resulting in removing some of the unique categorical data points and was giving problems while doing One Hot Encoding. Hence I have retained the data points with outliers having customer_affinity_score less than 125. I have done One Hot Encoding for the object dtype columns to convert them to numeric data type.

At this point data was cleaned and I have to decide which features to keep for further analysis. I have created an OLS model to know the p values of the columns to find out which columns are important. Since, all p values were less than 0.05 that is in the range 0f 95% confidence interval, I have retained all the columns.

Model creation

Now it was the time to create the model. I have tried Random Forest, XG Boost and SVM models for this particular problem. Since, random state at the time of splitting the data gives different train and test data, I experimented with it. I have looped the data from 0 to 99 random state and calculated the score for each model with different split by random state. I chose the model with highest score for the evaluation of each SVM classifier, Random Forest Classifier and XG Boost Classifier. I chose SVM Classifier as it was giving me a generalized model compared to Random Forest Classifier and XG Boost Classifier.

I found that random state 67 giving best score on test data set. Also the model is generalized model as it is giving more score on validation dataset than the train dataset.

Since, the evaluation metric for this hackathon was precision and I was getting 97% precision on validation data, I have moved forward with this model.

Finally I have made prediction on test data and saved it as final submission.

Prediction

So, I have prepared the notebook and presentation for the hackathon and submitted it for evaluation. I have got 93.7% precision on their actual test data.

Round 1 was cleared, round 2 was a coding exercise, I have cleared that as well. Now, I was so happy after hearing that I have cleared the initial rounds, when the recruiter asked, if is it fine to schedule interview next day? I immediately told yes and that was my biggest mistake.

It was the interview of world’s most customer centric company which promises to deliver the customer anything they wanted on their online platform and I have not thought of how am I going to prepare for the interview in one day.

The result was, I failed to crack the amazon interview. I learnt from my this mistake that impatience is the enemy of success. I wanted to share my experience with you all so that you don’t repeat the mistake I have done.

After all patience is bitter but the fruit is sweet!

GitHub link for the repository:- https://github.com/pratikskarnik/Amazon_ML_Engineer_Hiring_Challenge_2020

Don’t forget to give us your ? !

How to land an Amazon ML Engineer Interview and mistakes to avoid was originally published in Becoming Human: Artificial Intelligence Magazine on Medium, where people are continuing the conversation by highlighting and responding to this story.

Via https://becominghuman.ai/how-to-land-an-amazon-ml-engineer-interview-and-mistakes-to-avoid-1f5b87d22702?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/how-to-land-an-amazon-ml-engineer-interview-and-mistakes-to-avoid

Machine learning adversarial attacks are a ticking time bomb

Software developers and cyber security experts have long fought the good fight against vulnerabilities in code to defend against hackers. A new, subtle approach to maliciously targeting machine learning models has been a recent hot topic in research, but its statistical nature makes it difficult to find and patch these so-called adversarial attacks. Such threats in the real-world are becoming imminent as the adoption of machine learning spreads, and a systematic defense must be implemented.

Originally from KDnuggets https://ift.tt/2MezULS

source https://365datascience.weebly.com/the-best-data-science-blog-2020/machine-learning-adversarial-attacks-are-a-ticking-time-bomb

What is Graph Theory and Why Should You Care?

Go from graph theory to path optimization.

Originally from KDnuggets https://ift.tt/3pvaOa0

source https://365datascience.weebly.com/the-best-data-science-blog-2020/what-is-graph-theory-and-why-should-you-care

Top 5 Reasons Why Machine Learning Projects Fail

The rise in machine learning project implementation is coming, as is the the number of failures, due to several implementation and maintenance challenges. The first step of closing this gap lies in understanding the reasons for the failure.

Originally from KDnuggets https://ift.tt/2KZSRRW

source https://365datascience.weebly.com/the-best-data-science-blog-2020/top-5-reasons-why-machine-learning-projects-fail

Top 5 AI implementation challenges and how your company could overcome them

Although corporate spending on artificial intelligence topped $50 billion last year, just 11% of companies that enhanced their workflows with AI have already seen a significant return on their investments. In this article, we’ll investigate business, technological, and ethical issues haunting AI projects — and provide several tips to seamlessly integrate Artificial Intelligence into your company’s Digital Transformation strategy.

A rundown of AI implementation challenges

Hitting Technology Roadblocks

Although AI has been around since the mid-50s, voice assistants, face swap apps, and robot dogs only became mainstream a couple of years ago. As of now, neither businesses nor their technology partners have a tried-and-true formula for creating and implementing artificial intelligence solutions. Some of the common AI pitfalls include:

Poor architecture choices

Making accurate predictions is not the only thing you should expect from an AI system. In multi-tenant applications (think AIaaS solutions serving thousands of users), performance, scalability, and effortless management are equally important. So you cannot expect your vendor to just write a Flask service, wrap it in a Docker container, and deploy your ML model. The approach might work for a certain number of users; once the system hits its limits, you’ll get an elephantine application that is also expensive to operate.

SNIPPETS

Inaccurate or insufficient training data

AI-based systems are only as good as the data they’ve been fed on. In some cases, companies struggle to provide quality data (and a substantial volume thereof!) to train AI algorithms. The situation is not uncommon in healthcare, where patient data like X-ray images and CT scans is hard to obtain due to privacy reasons. To increase the amount of training data and build a better model, it is sometimes necessary to manually label data using annotation tools like Supervise.ly. According to Gartner, data-related AI problems is the #1 reason why 85% of artificial intelligence projects will deliver erroneous results through 2022.

Lack of AI explainability

Explainable artificial intelligence (XAI) is a concept that revolves around providing enough data to clarify how AI systems come to their decisions. Powered by white-box algorithms, XAI-compliant solutions deliver results that can be interpreted by both developers and subject matter experts. Ensuring AI explainability is critical across a variety of industries where smart systems are used. For example, a person operating injection molding machines at a plastic factory should be able to comprehend why the novel predictive maintenance system recommends running the machine in a certain way — and reverse bad decisions. Compared to black-box models like neural networks and complicated ensembles, however, white-box AI models may lack accuracy and predictive capacity, which somewhat undermines the whole notion of artificial intelligence.

Replicating lab results in real-life situations

An AI-based breast cancer scanning system created by Google Health and Imperial College London reportedly delivers fewer false-positive results than two certified radiologists. In 2017, Oxford and Google DeepMind scientists developed a deep neural network that reads people’s lips with 93% accuracy (compared to just 52% scored by humans). And now there’s evidence that machine learning models can accurately detect COVID-19 in asymptomatic patients based on a cellphone-recorded cough! When fueled by powerful hardware and a wealth of training data, AI algorithms can perform a wide range of tasks on a par with humans specialists — and even outmatch them.

The problem with AI is, most companies fail to replicate the results achieved by Google, Microsoft, and MIT — or the accuracy displayed by their own AI prototypes — outside the laboratory walls.

The solution to this daunting AI problem partially lies in tech giants’ willingness to share complete research findings and source code with fellow scientists and AI developers. On a company level, it is crucial to analyze how smart algorithms will perform when faced with unfamiliar or poorly structured data and devise mechanisms to support the functioning of AI-powered applications under heavy load.

Scaling Artificial Intelligence

According to Gartner, only 53% of AI projects make it from prototypes to production, which means most companies lack the technical talent, skills, and tools to implement smart systems at scale. Continuous knowledge transfer might be a viable solution to this problem. While most companies currently rely on 3rd-party vendors to build smart systems and put them to work, forward-thinking CIOs and IT leaders must ensure their pilot projects help transfer knowledge from external DevOps, MLOps, and DataOps specialists. This way, enterprises could upscale their in-house capabilities before moving AI prototypes into production.

Overestimating AI’s power

Back in October, MIT Sloan Management Review and Boston Consulting Group unveiled a report that sheds some light on why some companies benefit from AI (while others don’t). DHL, a postal and logistics company that delivers 1.5 billion parcels a year, is among the AI winners. The company uses a computer vision system to determine whether shipping pallets can be stacked together and optimize space in cargo planes. Gina Chung, VP of innovation at DHL, says the AI solution performed poorly in its early days. Once the system started learning from human experts who had years of experience detecting non-stackable pallets, the results improved dramatically.

If complete automation and reduction in your company’s headcount lie at the heart of your AI implementation strategy, you are likely to fail.

For one thing, algorithms need human knowledge to eventually make accurate predictions. And for another, your employees will feel more enthusiastic about teaching algorithms if you make it clear smart machines won’t replace the human workforce in the foreseeable future.

Dealing with AI ethical issues

Greater adoption of smart applications comes along with several AI ethical issues, including:

Bias in algorithmic decision making, which stems from flawed training data prepared by human engineers and bears the mark of social and historical inequities
Moral implications, which mainly revolve around companies’ intent to replace human workers with highly productive, always-on robots

Some AI solutions do inherit racial and gender prejudice from their creators. A facial recognition system deployed by US law enforcement agencies, for instance, is more likely to identify a non-white person as a criminal. However, your company can solve most of these problems by creating balanced training datasets that include images of people representing different ethnic, gender, and age groups. In fact, artificial intelligence can help us eliminate racial, gender, age, and sexual orientation bias in the long run. For example, AI-powered HR management software can scan more resumes than human specialists and identify potential candidates based solely on their education and working experience. And while some industries indeed register persistent changes in their workforce size due to artificial intelligence implementation, it turns out AI will actually create 3% more jobs than it’s going to kill!

How to overcome AI implementation challenges: take-home message

Address an AI vendor with the relevant portfolio and expertise
Work with a skilled business analyst to determine which of your processes and IT systems could benefit from AI
Consider how ethical issues might prevent you from using AI to the fullest
Create a proof of concept to test the solution feasibility and work around technology-related AI pitfalls
Devise a detailed AI project implementation map covering solution development, integration, and scaling, as well as employee onboarding
Together with your vendor, start building your system while ensuring continuous knowledge sharing
Do not raise your hopes high: it takes time, patience, and lots of data to build AI solutions capable of enhancing or taking over critical tasks
Appoint subject matter experts to fine-tune AI algorithms
Educate your employees about the importance of data-driven decision making and optimization opportunities offered by artificial intelligence

Last but not least, continue experimenting with AI — even if your pilot project does not deliver on its promise! 73% of companies that overhaul their processes based on the lessons learned from failures eventually see a sizable ROI on their artificial intelligence investments.

If you need help building, scaling, or tuning an AI solution, feel free to contact the ITRex team, and we’ll connect you with the right expert!

Don’t forget to give us your ? !

Top 5 AI implementation challenges and how your company could overcome them was originally published in Becoming Human: Artificial Intelligence Magazine on Medium, where people are continuing the conversation by highlighting and responding to this story.

Via https://becominghuman.ai/top-5-ai-implementation-challenges-and-how-your-company-could-overcome-them-c7c2efe9ff52?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/top-5-ai-implementation-challenges-and-how-your-company-could-overcome-them

Difference between AI Machine Learning NLP and Deep Learning.

Most people often get confused with these terms since they are all interrelated. However, let me give my best shot and explain it with the simplest definition.

Picture this, artificial intelligence is the father of machine learning, and natural language processing, whereas deep learning is a subfield of machine learning.

Artificial Intelligence (AI)

Well first of all the term was meant to describe the goal that machines will be able to have humans-like intelligence in the future ( yeah they don’t have so far I know). A lot of money was invested in reaching this goal but we could not achieve our goal. Later we made a new type of AI ( say weak AI or the applied AI that we are having today) which focuses on making machines or systems that LOOK or SEEM to be intelligent ( but are not intelligent). Some of you may be confused. Well so basically there are two types of AI: weak AI and strong AI ( also can say applied AI and general AI ). So far the so-called AI everywhere you see in machines or listen about is weak AI. Strong AI systems will have their consciousness, sentience, etc ( say having brains just like that of humans).

SNIPPETS

Some people also consider that weak AI is not the true AI and companies for sake of better promotions of their products and better market brought the word weak AI ( that is not intelligence according to so some guys I mean and it used as AI by companies just because the word AI sounds very fancy ). So AI is just about creating intelligent machines ( let it get achieved anyhow) I mean make machines or systems that seem to be intelligent like us ( or are like us).

Machine Learning (ML)

Machine learning would not be a subset of AI completely had we achieved strong AI ( because we have only weak AI in real-world ML is a subset of AI … actually ML is subset of weak AI ). Let me clear this. What exactly makes machine learning different from normal learning. Machine learning is a better method of training machines than the old traditional methods ( i know even ML is quite old now but I’m comparing it to methods even before its origin) . Let me give an example. You have to make software for bitcoin trading. You know the exact algorithm that can give the desired output, so you make that algorithm and it inputs all the required values and gives an output. This is normal learning.

Don’t forget to give us your ? !

Difference between AI, Machine Learning, NLP and Deep Learning. was originally published in Becoming Human: Artificial Intelligence Magazine on Medium, where people are continuing the conversation by highlighting and responding to this story.

Via https://becominghuman.ai/difference-between-ai-machine-learning-nlp-and-deep-learning-9f63066087f1?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/difference-between-ai-machine-learning-nlp-and-deep-learning

Skills and traits that will help you outperform any AI

How to augment machine intelligence and what the future will look like for humans in terms of jobs.

Continue reading on Becoming Human: Artificial Intelligence Magazine »

Via https://becominghuman.ai/skills-and-traits-that-will-help-you-outperform-any-ai-6f9bdb091bd1?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/skills-and-traits-that-will-help-you-outperform-any-ai

Machine learning is going real-time

Extracting immediate predictions from machine learning algorithms on the spot based on brand-new data can offer a next level of interaction and potential value to its consumers. The infrastructure and tech stack required to implement such real-time systems is also next level, and many organizations — especially in the US — seem to be resisting. But, what even is real-time ML, and how can it deliver a better experience?

Originally from KDnuggets https://ift.tt/3iWdqey

source https://365datascience.weebly.com/the-best-data-science-blog-2020/machine-learning-is-going-real-time

Working With The Lambda Layer in Keras

In this tutorial we’ll cover how to use the Lambda layer in Keras to build, save, and load models which perform custom operations on your data.

Originally from KDnuggets https://ift.tt/2Yjvnds

source https://365datascience.weebly.com/the-best-data-science-blog-2020/working-with-the-lambda-layer-in-keras

How to Get a Job as a Data Scientist

Here’s a step-by-step guide to starting your career in data science.

Originally from KDnuggets https://ift.tt/3ojR3AF

source https://365datascience.weebly.com/the-best-data-science-blog-2020/how-to-get-a-job-as-a-data-scientist

365 Data Science

How to land an Amazon ML Engineer Interview and mistakes to avoid

Data Cleaning

Trending AI Articles:

Removing outliers:

Model creation

Prediction

Don’t forget to give us your ? !

Machine learning adversarial attacks are a ticking time bomb

What is Graph Theory and Why Should You Care?

Top 5 Reasons Why Machine Learning Projects Fail

Top 5 AI implementation challenges and how your company could overcome them

A rundown of AI implementation challenges

Hitting Technology Roadblocks

Trending AI Articles:

Replicating lab results in real-life situations

Scaling Artificial Intelligence

Overestimating AI’s power

Dealing with AI ethical issues

How to overcome AI implementation challenges: take-home message

Don’t forget to give us your ? !

Difference between AI Machine Learning NLP and Deep Learning.

Most people often get confused with these terms since they are all interrelated. However, let me give my best shot and explain it with the simplest definition.

Trending AI Articles:

Don’t forget to give us your ? !

Skills and traits that will help you outperform any AI

Machine learning is going real-time

Working With The Lambda Layer in Keras

How to Get a Job as a Data Scientist