365 Data Science

The story of machine proofsPart II

The story of machine proofs — Part II

Introduction
Landscape part-1 (Theorem prover as a system)
Accomplishments
Theorem Proving Systems — Tradeoffs & Architectures
Landscape part-2 (Distribution of efforts across provers)
Conclusion

Introduction

A quick recap. In “The story of machine proofs — Part-1”, we covered how humans approached formal mathematical proofs, then introduced early attempts at machine proofs and ended with this picture depicting our journey to Centaur Theorem Provers. (Figure-1):

Figure-1: Centaur Theorem Provers Journey

We’ll dig into both the machine-only and human-in-the-loop provers but here’s what you can expect to take away from part-2 of this series:

Landscape of theorem provers;
Comparison: Where they differ from one another;
Status check: How close machines have gotten to humans in the automated reasoning race

One point to note is that humans have been in the loop for a while now so it is more the degree of their involvement that has changed over the years. As noted back in 1969 in Semi-Automated Mathematics [Guard et al.], the idea isn’t new.

“Semi-automated mathematics is an approach to theorem-proving which seeks to combine automatic logic routines with ordinary proof procedures in such a manner that the resulting procedure is both efficient and subject to human intervention in the form of control and guidance. Because it makes the mathematician an essential factor in the quest to establish theorems, this approach is a departure from the usual theorem-proving attempts in which the computer unaided seeks to establish proofs.”

But it has required deliberate attention and effort to generate human readable proofs or any intermediate meaningful representations at critical stages of the pipeline in order to allow humans to guide the machine and co-create proofs.

Landscape part-1

“Names don’t constitute knowledge!”

To get at the landscape, I boiled the ocean a bit and compiled several efforts around automated reasoning from theorem provers and verifiers to related tools and languages risking some redundancy and scope creep. But at this point I’d rather it be an overkill than a best-of list so that we get to see the field in its entirety. Here’s a flat list (Figure-2) organized alphabetically with most if not all efforts in the last few decades.

Really? A perfect 10 x 16 table with 160 cells? I either got lucky or just kept finding new ones until there was diminishing returns. (the latter!)

Figure-2: Efforts in automated theorem provers

Feynman would have taken one look at that table and said what do you want me to do with 160 names? “Names don’t constitute knowledge!”. Here’s a short “blast” from the past that is very well worth the watch:

Even if I say those names are indices into strategies and tactics for theorem proving such as graph based resolution, induction, model generation, proof planning, semi-automatic interactive interfaces, programmability, intuitionistic predicate logic, multi-valued logic, constraint solvers etc. I wouldn’t be adding much value as they’re still just names, perhaps a tad bit more descriptive.

So how then do we slice and dice the above list? Maybe by categories of logical systems and reasoning engines (resolution, interactive, induction, satisfiability checking, model checking, constraint solvers…)? Or something like what the “theorem prover museum” (yes, museum!) does? Their goal is to archive all systems with source code and metadata. They compiled 30 odd theorem provers dating back to 1955 (Logic Theorist) and 1960 (OTTER). The highlighted cells in Figure-3 below are ones from the museum.

Figure-3: Highlighted cells from the theorem prover museum

But another compilation, a more interesting one is Freek Wiedijk’s “The Seventeen Provers of the World” in which he asked authors working on the top active (at the time — 2005) theorem provers to provide their machine proof for the same one problem — irrationality of sqrt(2).

While I had explored different approaches to the Pythagoraean theorem across time in part-1 going back all the way to the Babylonian tablet (Plimpton 322), what I really like about Freek’s approach is the structure and template with targeted questions that he provided to the authors making it easier to compare different approaches to the same problem. We will spend a considerable portion of part-2 discussing his paper. If you’re curious which theorem provers he picked (or got responses for), see Figure-4 below. Only 4 of his 17 appear in the museum above — IMPS, LEGO, Omega and OTTER.

Figure-4: [2005] The Seventeen Provers of the World

Theorem Prover — A system

Before we get into why or how theorem provers differ from one another, let us frame the prover as a system with input, output and a process. Figure-5 shows a blackbox that is fed with a theorem. And the output is a proof.

Let us expand further on the system notion with a concrete example:

Theorem: “The sum of two even integers is even”.

So the above theorem is our input. Wait! The words theorem, lemma, proposition and corollary are sometimes used interchangeably as the differences are mildly subjective. And the word theorem is typically reserved for a strong or major result that has been proven to be true. Lemma is a relatively minor result popularly acknowledged as a helping or auxiliary theorem. Proposition is even less important but interesting still. And corollary is a consequent result that can be readily deduced from a theorem. For our purposes, let’s then go with just a plain statement as input to the engine before we elevate its status to that of a theorem (Figure–6)

Now to prove the statement “The sum of two even integers is even”, we need to first understand everything about that statement, i.e. surface all prior knowledge about the terms and concepts involved (Figure-7).

Prior knowledge in this example includes definitions and properties of integers some of which are assumed to be true i.e. axioms. Here’s what we know about integers (Figure-8)

That statement was deceptively simple! The one input has grown to 3 inputs to include axioms and definitions as shown in Figure-9 and will grow to 4 (Figure-10) when we pull in previously proven theorems.

Now we need to know what to do with the knowledge, i.e. strategies and clear rules on how to use the knowledge about the statement (Figure-11)

Let us shift attention to the right hand side, i.e. the output. A proof assistant can not only provide the construction of a proof but also check the validity of a proof (Is the statement true or false) and additionally generate counterexamples. So the system can have multiple outputs as well. We call them proof checkers or verifiers.

The above Figure-12 system holds true for all theorem provers. But what exactly happens inside an automated theorem prover (ATP)? Figure-13 shows a flowchart for the ATP process from TPTP (Thousands of Problems for Theorem Provers) which begins with checking syntax and semantics, then checking the consistency and satisfiability of the axioms followed by establishing a logical consequence before proving the theorem and verifying the proof.

ATP Flow chart — Figure-13: ATP Flowchart: http://tptp.org/Seminars/TPTPWorldTutorial/ATPProcess.html

So where do all provers differ? The answer to that lies in the fundamental building blocks of AI — Representation and Reasoning.

Representation: What formal language do we use for mapping the world to the machine; “Each formalism has its intrinsic axioms as the starting point for the prover. Nevertheless, custom axioms can help the prover advance more rapidly with the resolution process in the next step.”
Reasoning: What algorithms do we use to manipulate the formal expressions. Additionally how the rules are generated (human experts, machine learned or hybrid) is important. “The prover has several techniques at its disposal to process the theorems and the system description. Some examples, among many others, are: Rewriting — substitute terms to simplify computations; Skolemization — elimination of existential quantifiers (∃), leaving only universal ones (∀); Tableaux — break down a theorem into its constituent parts and prove or refute them separately”.

The blackbox in our once-simple system essentially breaks down as shown in Figure-14 where reasoning is split into the logical formalism and the reasoning engine.

We’ll use the above template to walkthrough Freek’s seventeen theorem provers.

[Sidenote: While I have reframed the authors’ responses to Freek’s question so as to fit the above template, I would like to acknowledge that the underlying content is entirely from the paper and any error in translation/representation is solely my responsibility. I would also suggest that you refer to the paper for a richer and more detailed explanation of each solution.]

Do you have to go through all 17 provers? Not necessary. But if you’re curious, it can be interesting. My suggestion would be to skim through the notes in the centre table and just glance at the input and output to get a sense of what the machines take in and spit out. In some of the provers further down, the proofs are too long (like IMPS, Metamath, Nuprl) to even print here so they’ve been compressed to fit the template and will be hard to read anyway. I still wanted them in here to give readers a sense of the formats and readability. If after a few it seems overwhelming feel free to jump ahead to the accomplishments section.

How to read the picture?

The input on the left hand side includes the statement to be proven along with relevant definitions both specified in the language (i.e. REPRESENTATION in green box). The output proof also typically uses the same language. The LOGIC (yellow box) and ENGINE (blue box) together give us the rules and approaches to operate on the input. For instance, in the PVS prover the inputs are written in common LISP using a higher order logic system. Operations such as induction, rewriting, symbolic model checking or simplification using decision procedures will be applied on the input to generate proof chains. In PVS, the user can interact by creating specification files, type checking them and providing proofs.

1. HOL: Higher Order Logic Theorem Prover

Figure-16 shows the family tree of provers going from LCF (Logic of Computable Functions) to HOL (Higher Order Logic).

2. Mizar

References:

http://mizar.org/
Presenting and Explaining Mizar — Josef Urban:
Geuvers, H. Proof assistants: History, ideas and future

3. PVS = Prototype Verification System

PVS is an interactive theorem prover. The philosophy behind the PVS logic has been to exploit mechanization in order to augment the expressiveness of the logic

References:

An Integrated Development Environment for the Prototype Verification System — Paolo Masci, Cesar A. Munoz

4. Coq

References:

5. OTTER

OTTER: Organized Techniques for Theorem-Proving and Effective Research.

References:

6. Isabelle

References:

From LCF to Isabelle/HOL

7. Alfa/Agda

References:

8. ACL2

ACL2 (“A Computational Logic for Applicative Common Lisp”) is a software system consisting of a programming language, an extensible theory in a first-order logic, and an automated theorem prover.

9. PhoX

Reference:

The PhoX Proof Assistant

10. IMPS

Reference:

https://github.com/theoremprover-museum/imps
IMPS: An Interactive Mathematical Proof System — William M. Farmer, Joshua D. Guttman, F. Javier Thayer

11. Metamath

Reference:

12. Theorema

Reference:

The Theorema Environment for Interactive Proof Development

13. LEGO

Reference:

The LEGO Proof Assistant

14. Nuprl

Reference:

Nuprl Tutorial

15. Omega

16. B method

Reference:

Software Component Design with the B Method — A Formalization in Isabelle/HOL — David Déharbe, Stephan Merz

17. Minlog

Reference:

The Minlog System
Minlog — A Tool for Program Extraction Supporting Algebras and Coalgebras — Ulrich Berger, Kenji Miyamoto, Helmut Schwichtenberg, and Monika Seisenberger

Accomplishments: Formalizations by provers

In the tables below (Figures 33–35), we can see an impressive list of formalizations from some of the above provers.

Figure 33 — Source: http://www.cs.ru.nl/~freek/comparison/comparison.pdf

Figure 34 — Source: http://www.cs.ru.nl/~freek/comparison/comparison.pdf

Figure 35 — Source: http://www.cs.ru.nl/~freek/comparison/comparison.pdf

Theorem Proving Systems: Tradeoffs & Architectures

Tradeoffs

As shown in the above sections, even among just these 17 provers some use higher order logic, first order logic and ZFC set theory while others use Tarski-Grothendieck set theory which is a non-conservative extension of ZFC that includes the Tarski’s axiom to help prove more theorems. Some use Hilbert-style axiomatic systems while others use Jaskowski’s system of natural deduction where proofs are based on the use of assumptions which are freely introduced but dropped under some conditions. Some use classical logic systems while others use intuitionistic or constructive logic which is a restriction of classical logic wherein the law of excluded middle and double negation elimination have been removed. And one uses LUTINS (Logic of Undefined Terms for Inference in a Natural Styles), a variant of Church’s simple type theory

Type theory was originally introduced in contrast to set theory to avoid paradoxes and is known to be more expressive. But expressiveness tends to impact automation negatively. As noted in this article “A Survey on Formal Verification Techniques for Safety-Critical Systems-on-Chip”: (Figure-36)

“…increase in expressiveness has the inverse effect on automation. The more expressive a logic is, the less automated it can be. This means that tools for higher-order logic usually work iteratively, where the user is responsible for providing simpler proofs to help guide the verification process.”

Figure-36— Source: https://www.mdpi.com/2079-9292/7/6/81

With each choice there is a tradeoff. But to help tease these tradeoffs out in a foolproof or reliable and faster way, we need help. The turnaround time for humans to spot an error, correct it and re-check can be far too long. Around 1900, Bertrand Russell, a mathematician, logician and philosopher found a bug (Russell’s paradox) in set theory which eventually led to the discovery/invention of type theory. A hundred years later around 2000, Vladimir Voevodsky (a Fields medalist) finds an error in his own paper (written seven years earlier!) which leads to the discovery/invention of univalent foundations. Voevodsky notes in this article titled The Origins and Motivations of Univalent Foundations the following:

“In 1999–2000, again at the IAS, I was giving a series of lectures, and Pierre Deligne (Professor in the School of Mathematics) was taking notes and checking every step of my arguments. Only then did I discover that the proof of a key lemma in my paper contained a mistake and that the lemma, as stated, could not be salvaged. Fortunately, I was able to prove a weaker and more complicated lemma, which turned out to be sufficient for all applications. A corrected sequence of arguments was published in 2006. This story got me scared. Starting from 1993, multiple groups of mathematicians studied my paper at seminars and used it in their work and none of them noticed the mistake. And it clearly was not an accident. A technical argument by a trusted author, which is hard to check and looks similar to arguments known to be correct, is hardly ever checked in detail.”

Voevodsky goes on to propose a new homotypy type theory to provide univalent foundations for mathematics, based on relations between homotopy and type theory. This helped formalize a considerable amount of mathematics and proof assistants such as Coq and Agda moving the needle in ATP.

Choices such as set theory versus type theory are so core to automated reasoning that this field needs a much deeper understanding of the fundamentals of mathematics and hence needs a lot more mathematicians to work alongside computer scientists, logicians and philosophers to accelerate progress. In this very nicely written article titled Will Computers Redefine the Roots of Math?, the author talks about a recent cross disciplinary collaboration which we need more of:

Along with Thierry Coquand, a computer scientist at the University of Gothenburg in Sweden, Voevodsky and Awodey organized a special research year to take place at IAS during the 2012–2013 academic year. More than thirty computer scientists, logicians and mathematicians came from around the world to participate. Voevodsky said the ideas they discussed were so strange that at the outset, “there wasn’t a single person there who was entirely comfortable with it.” The ideas may have been slightly alien, but they were also exciting. Shulman deferred the start of a new job in order to take part in the project. “I think a lot of us felt we were on the cusp of something big, something really important,” he said, “and it was worth making some sacrifices to be involved with the genesis of it.”

Architectures

One trend to note is that the complexity of theorem provers has slowly been growing from simple systems to system of systems, i.e. pulling in the best reasoners from multiple sources. But this also means that maintaining such systems (for performance, robustness, accuracy, reliability) is getting harder. Let’s take a sneak peak at the architectures of some of them: ACL2, Omega, LEAN, ALLIGATOR, iGeoTutor/GeoLogic, Matryoshka and Keymaera.

ACL2

We saw ACL2 earlier. It has an IDE supporting several modes of interaction, it provides a powerful termination analysis engine and has automatic bug-finding methods. Most notably, ACL2 (and its precursor Nqthm) won the ACM Software System Award in 2005 (Boyer-Moore Theorem Prover) which is awarded to software systems that have had a lasting influence. Figure-37 shows a high-level flow.

Omega

We saw Omega earlier as well. Interesting to note here (Figure-38) is how external reasoners (like OTTER, SPASS for first-order or TPS, LEO for higher order) are invoked as needed.

LEAN

LEAN is an open source theorem prover developed at Microsoft Research and CMU. It is based on the logical foundations of CiC (Calculus of Inductive Constructions) and has a small trusted kernel. It brings together tools and approaches into a common framework to allow for user interaction hence bringing ATP and ITP closer. Figure-39 shows some of the key modules.

ALLGATOR

ALLIGATOR is a natural deduction theorem prover based on dependent type systems. It breaks goals down and generates proof objects (text files with assertions leading to a conclusion), an interesting part of the architecture shown in Figure-40. As noted in the paper “The ALLIGATOR Theorem Prover for Dependent Type Systems: Description and Proof Sample”, the authors identify the following key advantages of having proof objects:

Reliability: Satisfies de Bruijn’s criterion that an algorithm can check the proof objects easily.
Naturalness: Based on natural deduction (closer to human reasoning — less axiomatic)
Relevance: Proof objects can identify proofs which are valid but spurious (do not consume their premises)
Justification of behavior: Proof objects provide direct access to the justifications that an agent has for the conclusions and the interpretations that it constructs, i.e. more explainability.

Figure-40— Source: https://www.aclweb.org/anthology/W06-3918.pdf

iGeoTutor and GeoLogic

An important goal in efforts like iGeoTutor (See Automated Geometry Theorem Proving for Human-Readable Proofs, Figure-41) and GeoLogic is to generate more human readable proofs in geometry. GeoLogic is a logic system for Euclidean geometry allowing users to interact with the logic system and visualize the geometry (equal distances, point being on a line …). Here’s a detailed presentation by Pedro Quaresma on Geometric Automated Theorem Proving dating back to efforts in 1959 by H.Gelerntner (See Realization of a geometry-theorem proving machine)

Matryoshka

Matryoshka fuses two lines of research — Automated Theorem Proving and Interactive Theorem Proving. These systems are based on the premise that first-order (FO) provers are best suited for performing most of the logical work but they go on to to enrich superposition (Resolution+Rewriting) and SMT (Satisfiability Module Theories) with higher-order (HO) reasoning to preserve their desirable properties (Figure-42 and Figure-43)

Figure-43- Source: https://matryoshka-project.github.io/

KeYmaera

KeYmaera is a theorem prover for differential dynamic logic for verifying properties of hybrid systems (i.e. mixed discrete and continuous dynamics). Figure-44 shows three big parts to the system — the axiomatic core, the tactical prover and the user interface. The architecture gives us a sense of the multiple moving parts.

What I do find missing though is a single picture describing the entire design space of theorem provers that captures key dimensions and situates automated and interactive theorem provers, Lean based provers, resolution and Tableau provers, SMT and SAT provers, model checkers, equational reasoning, quantifier-free combination of first order theories, conflict-driven theory combination and the more recent machine learning based ones. Maybe it exists but neither could I find one easily and nor am I currently qualified (as an ATP enthusiast) enough to weave one together (perhaps a future project). I did like this taxonomy from back in 1999 in the paper “A taxonomy of theorem-proving strategies” but it is dated and limited to first-order theorem proving (Figure-45).

Figure-45 — Source: http://profs.sci.univr.it/~bonacina/papers/LNAI1600-1999taxonomy.pdf

Landscape (part 2) — Distribution of efforts

One of the surveys on theorem provers showed a significant percentage of efforts are first order logic provers (Figure-46) and the most popular calculus of reasoning is deduction. (Figure-47)

Figure 46- Source: https://arxiv.org/pdf/1912.03028.pdf

Figure 47- Source: https://arxiv.org/pdf/1912.03028.pdf

Look at the variety of languages used across provers. They range from ML (MetaLanguage) and its dialects like OCaml (Objective Categorical Abstract Machine Language), Prolog, Pascal, Mathematica, Lisp and its dialect Scheme to more esoteric specification languages like Gallina (exclusively for Coq). Other theorem provers not mentioned here use Scala, C/C++, Java, SML or Haskell. As for the reasoning engines, some use functional programming, some have a proof plan with goals that are automatically broken down into sub-goals, some are interactive while others pull in multiple external reasoning systems or model checkers.

The same survey above also broke down the programming designs (Figure-48) and languages (Figure-49) across provers indicating that close to a quarter of the efforts use functional programming with OCaml as the choice of language.

Figure 48— Source: https://arxiv.org/pdf/1912.03028.pdf

Figure 49 — Source: https://arxiv.org/pdf/1912.03028.pdf

In addition to the above differences, as noted in “A Survey on Theorem Provers in Formal Methods”, there are some criteria like de Bruijn’s criterion and Poincare’s principle to help determine how good a prover is. According to the de Bruijn criterion “the correctness of the mathematics in the system should be guaranteed by a small checker”. That is generate proofs that can be certified by a simple, separate checker known as the ‘proof kernel’. So if a prover has a small proof kernel, then it improves the reliability. Figure-50 shows which provers satisfy the criterion and which don’t.

Figure-50 — Source: https://arxiv.org/pdf/1912.03028.pdf

And a theorem prover satisfies the Poincare principle, if it has the ability to automatically prove the correctness of calculations or smaller trivial tasks. Figure-51 shows which provers satisfy the principle and which don’t.

Figure-51 — Source: https://arxiv.org/pdf/1912.03028.pdf

Here’s a comparison (Figure-52) of the 17 provers from Freek’s paper. A “+’ means its present and a “-” that it is not.

Figure-52 — Source: http://www.cs.ru.nl/~freek/comparison/comparison.pdf

In summary, here’s the distribution of theorem provers as per the survey (Figure-53)

Figure 53- Source: https://arxiv.org/pdf/1912.03028.pdf

Conclusion

Let’s return to the promised takeaways:

Landscape: We saw a huge dump of 160 efforts out of which we looked at 17 provers with some of their proofs. We further looked at the architectures of a handful of provers noting the emerging complexity of software systems.
Comparison: We compared the logical systems, the representation and reasoning engines across multiple provers calling out some success criteria and dominating efforts.
Status Check: To address this, let us revisit the comparison table (Figure-) from part-1. But a quick digression first:

In the article, “The Work of Proof in the Age of Human–Machine Collaboration“, Stephanie Dick quotes Larry Wos and Steve Winker as having said this in their contribution to a 1983 conference on ATP:

“Proving theorems in mathematics and in logic is too complex a task for total automation, for it requires insight, deep thought, and much knowledge and experience.”

Larry Wos joined Argonne National Laboratory in 1957 and began using computers to prove mathematical theorems in 1963. What is amazing is that he is still working on it! Wos and his colleague at Argonne were the first to win the Automated Theorem Proving Prize in 1982.

With that, lets get back to the original table below (Figure-54):

And our updated table with an extra column to the right below: (Figure-55)

At a high level, it appears that theorem provers have gotten better with logical systems, explainability, geometric reasoning and model checking. There remain issues with graceful failures when decidability and satisfiability cannot be determined. There isn’t much yet on learning and “imagining/generating scenarios”. To borrow from Wos’s and Winker’s statement above, we are still missing the experience (learning), the insight and deep-thought (what I call imagination) bits. And that is precisely what we will dig into in our next and final part. We’ll venture into the role that machine learning, transformers, deep neural networks and commonsense reasoning are beginning to play in the ATP/ITP space. Stay tuned …

Don’t forget to give us your ? !

The story of machine proofs — Part II was originally published in Becoming Human: Artificial Intelligence Magazine on Medium, where people are continuing the conversation by highlighting and responding to this story.

Via https://becominghuman.ai/the-story-of-machine-proofs-part-ii-f97dfa5655ea?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/the-story-of-machine-proofspart-ii

Building a small knowledge graph using NER

Here , we have implemented a knowledge graph from a WikiPedia actors dataset. The article [1] by analyticsvidya has been heavily referred for this. But we have made improvements in the form of :

Data Preprocessing — Removed punctuations , stop words
Named Entity Recognition — This has helped in constructing a more meaningful graph with less noise.

Using Named Entity Recognition has helped us obtain top occuring Actors, writers,stuidio names etc

import re
import pandas as pd
import spacy
from tqdm import tqdm
# Load English tokenizer, tagger, parser, NER and word vectors
nlp = spacy.load("en_core_web_sm")
from spacy.matcher import Matcher
import networkx as nx
import matplotlib.pyplot as plt
import nltk
from nltk.tokenize import word_tokenize
from nltk.corpus import stopwords
import re
import string 
tqdm._instances.clear()

from google.colab import drive
drive.mount('/content/drive')

Mounted at /content/drive

Importing data from csv and storing in a pandas dataframe

data_dir = "/content/drive/My Drive/data/wiki_sentences_v2.csv"
sentences = pd.read_csv(data_dir)
sentences.shape

(4318, 1)

sentences.head()

Here are some sample sentences containing the words “bollywood” and “contemporary” together

Don’t forget to give us your ? !

Building a small knowledge graph using NER was originally published in Becoming Human: Artificial Intelligence Magazine on Medium, where people are continuing the conversation by highlighting and responding to this story.

Via https://becominghuman.ai/building-a-small-knowledge-graph-using-ner-296930592bcf?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/building-a-small-knowledge-graph-using-ner

The Four Jobs of the Data Scientist

So, what do you do for a living? Sometimes, the answer to that question can feel like, “everything!” Well, for the Data Scientist, an extreme sense of being a “jack of all trades” is common. In fact, four such trades can be defined that a top-quality Data Scientist will iterate through during any one project.

Originally from KDnuggets https://ift.tt/35ExFrF

source https://365datascience.weebly.com/the-best-data-science-blog-2020/the-four-jobs-of-the-data-scientist

The Best Tool for Data Blending is KNIME

These are the lessons and best practices I learned in many years of experience in data blending, and the software that became my most important tool in my day-to-day work.

Originally from KDnuggets https://ift.tt/3bv3gQq

source https://365datascience.weebly.com/the-best-data-science-blog-2020/the-best-tool-for-data-blending-is-knime

KDnuggets News 21:n02 Jan 13: Best Python IDEs and Code Editors; 10 Underappreciated Python Packages for Machine Learning Practitioners

Best Python IDEs and Code Editors You Should Know; 10 Underappreciated Python Packages for Machine Learning Practitioners; Top 10 Computer Vision Papers 2020; CatalyzeX: A must-have browser extension for machine learning engineers and researchers

Originally from KDnuggets https://ift.tt/35SDMZN

source https://365datascience.weebly.com/the-best-data-science-blog-2020/kdnuggets-news-21n02-jan-13-best-python-ides-and-code-editors-10-underappreciated-python-packages-for-machine-learning-practitioners

Creating Good Meaningful Plots: Some Principles

Hera are some thought starters to help you create meaningful plots.

Originally from KDnuggets https://ift.tt/39nnDwh

source https://365datascience.weebly.com/the-best-data-science-blog-2020/creating-good-meaningful-plots-some-principles

GPT-3 And Code GenerationAI-enabled Instant Software Development

Cooking noodles like ramen used to be a long and arduous task. Now we have instant noodles. Just add hot water into a bowl and add your…

Continue reading on Becoming Human: Artificial Intelligence Magazine »

Via https://becominghuman.ai/gpt-3-and-code-generation-ai-enabled-instant-software-development-270795077cbd?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/gpt-3-and-code-generationai-enabled-instant-software-development

How to stop your chatbot giving the wrong answers

You’ve built your chatbot, you’ve carefully and tirelessly trained and tested it, and you’re finally ready to launch it to go live — hoorah! But after monitoring its performance over a period of time after go-live, you notice that some user questions return incorrect intents (so give the wrong answers), despite the fact there’s training data in your model that should result in the correct intent being returned. You also spot that some user questions return the correct intent with very low confidence — so the answer isn’t presented to the user. This leaves everyone very frustrated.

As a chatbot builder, you want to understand what influence each utterance has (and even what influence each word in each utterance has) on your model’s performance. Most of the popular NLP providers use a variety of different models and algorithms that are virtually impossible for chatbot builders to fathom. And it’s also very difficult and time-consuming to do a deep dive of your training data and analyse its learning value. But you really do need to do a thorough analysis if you want to improve the performance of your bot.

Here are three techniques: two to measure your bot’s performance and one for visualising the results of its performance.

1. Cross-validation testing

This involves preparing a separate labelled dataset of utterances that your model hasn’t been trained on (this data would typically be real user questions), and then testing it agains your chatbot to assess your model performance and see if there are any gaps in your bot’s knowledge. Cross-validation testing has its advantages and disadvantages though:

Advantage:

If you have a large file from many user interactions over many months, you are likely to have a great dataset that represents all the intents/subject areas you wish to cover.

Disadvantages:

If you are at the early stage of your chatbot model, you may still be in the process of creating new intents, and splitting or merging some existing intents. Each time this happens, you’ll have to update your cross-validation file, and this can be time-consuming.
Keeping this dataset out of the training data can be a challenge. Naturally, if you know the data set, you’ll check the test results and where your test fails, you may be tempted to train your model with the potential to overfit it to the cross-validation dataset
It will only be as good as the data in it.
Ideally, cross-validation testing will give meaningful and accurate results, but there’s no way of accurately predicting what your bot will encounter in the future, and you will constantly have to update your cross-validation dataset. You’ll also have to keep testing on your cross-validation dataset to monitor for any regression.

2. K-fold cross-validation testing

K-fold cross-validation testing solves some of the issues mentioned above. It’ll generate test data from your training data automatically, by removing some of the training data from where it is (the intent) and using them as test data. It’ll then evaluate your training data by dividing it into a number of sub-samples (or folds) and then using one fold at a time as a testing dataset to test on the rest of the training data. For example, you might divide your training data into 10 equal folds (you could use more or fewer folds, but 10 folds is commonly used), and then perform 10 separate tests, each time holding back one of the folds and testing your model based on the data in the “held back” fold not in your training data. This means that all training data will become test data at one point. This technique helps you to see weaknesses in your data. Again, it has its advantages and disadvantages

Advantages:

If you don’t have test data, and you are in the early stage of model building, it generates the test data itself.
K-fold is a known technique.

Disadvantages:

It’s time-consuming
K-fold doesn’t work well with low levels of training data (fewer than 200 samples per class or intent)
Each change to the training data affects the fold the initial training data was in, generating variation in your test results due to the randomisation of the folding. This then makes it difficult to understand the learning value of your new changes.

Here are some Jupiter Notebooks that show examples in Python code for doing cross-validation and K-fold.

3. Visualising your test results through a confusion matrix

This technique allows you to visualize the performance of the intent predictions in the form of a table. To build a confusion matrix, you’d use a test validation dataset. Each piece of data in your dataset needs a predicted outcome (the intent that the data should return) and an actual outcome (the intent that the data actually returns in your model). From the predicted and actual outcomes, you will get a count of the number of correct and incorrect predictions.

These numbers are then organised into a table, or matrix where

True positive (TP) for correctly predicted event values
False positive (FP) for incorrectly predicted event values
True negative (TN) for correctly predicted no-event values
False negative (FN) for incorrectly predicted no-event values

IMPORTANT: You’ll want to think how you categorise the right and wrong predictions that are below your chatbot confidence threshold.

These values can then be used to calculate more classification metrics like precision and recall, and F1 scores. Precision highlights any false-positive problems you may have in your model and recall highlights any false-negative problems. The F1 score is an overall measure of a model’s accuracy. The confusion matrix technique has its own advantages and disadvantages.

Confusion matrix with precision, recall and F1

Advantages:

It provides a good visual of your bot’s performance (see diagram above as an example)

Disadvantages:

It is a very time-consuming task
Calculations for these additional metrics are quite complex
It can be quite challenging to interpret unless you’re familiar with the statistics involved
It won’t necessarily help you see why an utterance isn’t working

Here’s a link to some further information on the confusion matrix approach

In summary, the cross-validation, K-fold and confusion matrix methods for diagnosing and improving a chatbot are very time-consuming, and difficult to understand if you’re not a statistician. Also, you’re likely to find a lot of issues that need fixing, and you’ll probably want to fix them all at once — but this in turn will generate challenges as you try to understand which changes worked/helped your model and which didn’t. You also need to think about the possible further regression (the ripple effect of changing data in one part of the model modifying the performance of the rest of the model) of your chatbot, and it’s very difficult to identify and unpick these newly created problems.

Modern tools are arriving on the market

Some companies choose to rely on the help of a tool such as QBox. QBox is another way of testing your chatbot to help improve your its accuracy and performance — in a matter of minutes. It analyses and benchmarks your chatbot training data and gives insight by visualising and understanding where your chatbot does and doesn’t perform, and why. You can see your chatbot’s performance at model level, intent level and utterance level, even word-by-word analysis, in clear and easy-to-understand visuals.

QBox intent diagram — Visualisation of intent utterances

For example, we built a very simple banking bot based on common customer-banking questions. But I noticed that some user questions requesting a contact telephone number didn’t always return the correct intent with good confidence — despite having training data to that effect. I was able to diagnose the problem quickly by doing a deep dive into the actual training data and getting word-by-word analysis, using the visuals that QBox provides:

QBox utterance influence analysis — training-word density

This helped me to diagnose why the training data wasn’t working as expected.

From the colour-coded words (on the left), I had a good understanding of the word density, and could see I had several instances of “telephone” in a different intent, and a lack of variations on asking for a telephone number in my contact_us intent.

QBox utterance influence analysis — Words’ influence on prediction

You can get an even deeper word-by-word analysis by using the QBox Explain feature (bottom illustration). This chart shows each word according to its influence on the prediction. In this illustration, you can easily see the biggest influencer to return an incorrect intent is “telephone”, and the lack of influence this word has on my Contact intent. This information clearly showed what the problem was and what I needed to do to fix it: add a little more training data to my Contact intent to strengthen the concept of asking for a telephone number.

Once you make changes to your chatbot, you can test your model again through QBox to validate those changes and see what effect they’ve had on your model overall — whether good or bad.

To find out more, visit QBox.ai.

Conclusion

Nobody would dispute the importance of ensuring your chatbot performs well to secure customer satisfaction. More recently, we’re seeing that it’s becoming a business requirement to understand the NLP model performance, analysing regression, ensuring clear KPIs are set and building your model development as part of a DevOps (MLOps) strategy. Whichever technique you use, it should help you on the path to improving performance, and ultimately your confidence in your model.

Don’t forget to give us your ? !

How to stop your chatbot giving the wrong answers was originally published in Becoming Human: Artificial Intelligence Magazine on Medium, where people are continuing the conversation by highlighting and responding to this story.

Via https://becominghuman.ai/how-to-stop-your-chatbot-giving-the-wrong-answers-50ffcd7a22c9?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/how-to-stop-your-chatbot-giving-the-wrong-answers

Top 20 Most Popular Programming Languages For 2021 and Beyond

The most popular programming languages list available here will help your business in making the right choice for the next project

Here, the most popular programming languages 2021 listing is present; this list will help you choose the best language for web and mobile app development.

Several programming languages are there; still, new ones are constantly emerging. But the major concern is which one running the whole market or which programming language is the most popular and well suited for web and mobile app development.

It is not so simple to list down the most popular programming languages 2021. But this task can be executed efficiently by considering various metrics, such as technology popularity, trends, career-prospects, open-source, etc.

IEEE spectrum comes with the listing sheet of top programming language 2021. The list of the programming language is based on popularity.

List Of Top Programming Languages

Based on metrics mentioned above, I have done the listing of most popular coding languages, so now you can blindly pick any of the best programming languages for your next project. But always remember to choose the one which suits your project profile.

If you are finding difficulty in choosing the best-suited language for your mobile app, then you can consult the best mobile app development company as the company experts can help you by picking the best technology for your app in just a few hours.

1. JavaScript

JavaScript used for both backend and frontend programming. This is one of the most popular coding languages widely used within the Internet of Things (IoT). JavaScript works well with other languages, is very versatile, and updated annually.

69.7% of developers prefer employing the JavaScript programming language.

Image Source: Stack Overflow

Instagram, Accenture, Airbnb, Slack are a popular organization using JavaScript.

JavaScript Vital features

Simple Client-side Calculations
Offer greater control
Handling Dates and Time
Generating HTML Content
Detecting the User’s Browser and OS

Ending Words

There is constant progress in the world of programming languages. Some evergreen technologies such as JAVA, PHP, and JavaScript have gained an enduring place in the list, whereas, other programming languages like R and Kotlin have boosted at an exceptional pace and have made it to the list of the most popular programming languages.

The languages present in the list proffer leading features and functionality that can help you build robust web and mobile apps. So now choose the right technology to develop agile web and mobile apps for your business.

In order to make efficient use of selected programming language in mobile app development get connected with the top mobile app development company and from there hire mobile app developers; this will help your business get a full-fledged mobile application.

Don’t forget to give us your ? !

Top 20 Most Popular Programming Languages For 2021 and Beyond was originally published in Becoming Human: Artificial Intelligence Magazine on Medium, where people are continuing the conversation by highlighting and responding to this story.

Via https://becominghuman.ai/top-20-most-popular-programming-languages-for-2021-and-beyond-735ee8370c61?source=rss—-5e5bef33608a—4

source https://365datascience.weebly.com/the-best-data-science-blog-2020/top-20-most-popular-programming-languages-for-2021-and-beyond

Working With Sparse Features In Machine Learning Models

Sparse features can cause problems like overfitting and suboptimal results in learning models, and understanding why this happens is crucial when developing models. Multiple methods, including dimensionality reduction, are available to overcome issues due to sparse features.

Originally from KDnuggets https://ift.tt/3bydpfi

source https://365datascience.weebly.com/the-best-data-science-blog-2020/working-with-sparse-features-in-machine-learning-models

The story of machine proofs — Part II

Table of contents

Introduction

Trending AI Articles:

Landscape part-1

“Names don’t constitute knowledge!”

Theorem Prover — A system

1. HOL: Higher Order Logic Theorem Prover

2. Mizar

3. PVS = Prototype Verification System

4. Coq

5. OTTER

6. Isabelle

7. Alfa/Agda

8. ACL2

9. PhoX

10. IMPS

11. Metamath

12. Theorema

13. LEGO

14. Nuprl

15. Omega

16. B method

17. Minlog

Accomplishments: Formalizations by provers

Theorem Proving Systems: Tradeoffs & Architectures

Tradeoffs

Architectures

Omega

LEAN

ALLGATOR

iGeoTutor and GeoLogic

Matryoshka

KeYmaera

Landscape (part 2) — Distribution of efforts

Conclusion

Don’t forget to give us your ? !

Trending AI Articles:

Don’t forget to give us your ? !

1. Cross-validation testing

Advantage:

Disadvantages:

Trending AI Articles:

2. K-fold cross-validation testing

Advantages:

Disadvantages:

3. Visualising your test results through a confusion matrix

Advantages:

Disadvantages:

Modern tools are arriving on the market

Conclusion

Don’t forget to give us your ? !

The most popular programming languages list available here will help your business in making the right choice for the next project

Here, the most popular programming languages 2021 listing is present; this list will help you choose the best language for web and mobile app development.

List Of Top Programming Languages

1. JavaScript

JavaScript Vital features

Trending AI Articles:

2.Python

Python Vital Features

3. Java

Java vital features

4. PHP

PHP vital features

5. Kotlin

Kotlin vital features

6. HTML

HTML vital features

7. C++

C++ vital features

8. Rust

Rust vital features

9. TypeScript

TypeScript vital features

10. CSS

CSS vital features

11. Swift

Swift vital features

12. C#

C# vital features