Practical Introduction to Machine Learning with TensorFlow and Keras

#1

Machine Learning is a system that can learn from examples through self-improvement and without being explicitly coded by a programmer. The idea is that a machine can learn from the data/example to produce accurate results in the future. Machine learning combines data with statistical tools to predict an output -- that can be used to make actionable insights. The machine receives data as input and uses an appropriate algorithm to formulate answers. Machine learning is based on the idea that there exist some patterns in the data that when identified can be used for future predictions.

Definition

Machine learning is the art of studying algorithms that learn and improve from examples and experiences.

A typical machine learning tasks are to provide recommendations. For example, for those who have social media accounts such as Facebook, Twitter, and Instagram, all recommendations and advertisements that pop up when you visit these sites are based on the user's historical data. Technical companies use machine learning techniques to improve the user experience with personalized recommendations. Machine learning is used for enormous tasks, such as fraud detection, predictive maintenance, portfolio optimization, and lots of other automation.

Traditional Programming versus machine learning

In traditional programming, a programmer codes all the rules for which software is being developed. In this approach, a programmer writes a logical step-by-step program/instructions to do something. The computer will then execute an output following the logical statements given to it. In contrast to traditional programming, the machine learns how the input and output data are correlated and it writes a rule. The programmers do not need to write new rules each time there is new data. The algorithms adapt and respond to new data and new experiences to improve effectiveness over time. As the machine learns and experiences more data, it is likely to predict more accurately when it did given new data.

Within machine learning are neural networks inspired by the brain (neuroscience), and then deep learning. In principle, deep learning is a subset/sub-field of machine learning and machine learning is one part of Artificial Intelligence (AI). An example of deep learning networks is convolutional neural networks (CNNs), developed by Yann LeCun. Deep learning has been there since the 1990s, but recent advancements in deep learning and its ever-growing popularity, are propelled by more powerful hardware, data availability and more sophisticated accurate algorithms.

Deep Learning

Deep learning is a sub-field of machine learning. Deep learning in the context of AI means the machine uses different layers to learn and improve from the data. The depth of the model is represented by the number of layers in the model. The learning phase in deep learning takes place through a neural network. A neural network is an architecture where the layers are stacked on top of each other.

AI versus machine learning

Although machine learning is one part of AI, the daily use of AI connotes that AI is the science of training machines to apply cognitive functions such as perceiving, learning, reasoning in order to perform human tasks. AI started in 1950 when scientists began to explore ways to make computers mimic human aptitudes and solve problems on their own. Their primary intention which led to the invention of intelligent machines/computers was to develop an ability to "tackle every aspect of learning or any other feature of intelligence that can in principle be so precisely described, that the machine can be made to simulate it."

AI has three different levels, which are, the general state when AI can perform any intellectual task with the same accuracy level as a human, a level when it can perform a specific task better than a human, and a level when AI can beat humans in many tasks.

On the other hand, machine learning (ML) is a distinct sub-field/subset of AI that trains a machine on how to learn. ML is based on the idea that machines can learn on their own from examples/data to recognize patterns and develop rules that it can use for future predictions. Recently, applications of AI have been very successful, and examples of areas where AI rocks are in reducing or avoiding repetitive tasks and improving an existing product by adding new features or enhancing their functionalities. AI is used in almost all industries from marketing to supply chain, to finance and food-processing sector. Recent surveys show that financial services and high technology communications are frontiers of the AI fields. AI is a cutting-edge technology that can deal with complex data which is impossible to handle by a human being.

How does it work?

Way back in 1950s engineers decided to write a computer program that would try to imitate human intelligence. This attempt led to the emergence of a new field of study within AI, called Machine Learning. This is the field of study where machines are trained through examples and experiences -- they learn through lots of data about something so that they can automate a process that dissects out various features/behaviors/characteristics and make accurate predictions. Just like we humans, the more we know or have experience about something/situation, the more easily we can predict similar situations in the future and vice versa. Machines are trained the same.

In summary, a machine learns through an example; when we feed the machine with a similar example, it can figure out the outcome and make an accurate prediction. On the other side, the machine has difficulties to predict if you give it an example that it has never experienced before.

The machine learns by discovering patterns in the given data and make an inference -- a conclusion reached on the basis of evidence and reasoning. Machine learning uses mathematically sophisticated algorithms to simplify a physical reality of some phenomenon and transform this discovery into a model.

Features Vector

The data used to train the machine is very crucial, and should thus be chosen very carefully. This data is usually associated with the list of attributes called a features vector -- a vector that contains information that characterizes an object's important attributes/features. We can think of a features vector as a subset of data that is used to tackle a problem. A features vector is simply a collection of attributes (features) extracted from the input data, usually in the form of a matrix.

Machine Learning Phase

The machine learning phase orderly includes Training data, Features vector, Algorithm, and then Model.

We have seen that machine learning is the general term that implies computers learn from data. Computers learn different algorithms (ways) which can be grouped into supervised, unsupervised, and reinforcement algorithms:-

A. Supervised learning

The data that is fed to a machine learning algorithm can be in the form of input-output pairs or just inputs. Supervised learning algorithms require input-output pairs for their training. It uses labeled data to train the model.

B. Unsupervised learning

Here machine only requires the input data. unsupervised learning algorithm takes an example input data repeatedly, the algorithm learns the data, clusters/classifies the inputs into groups and eventually, it picks up a pattern between the inputs and outputs. The algorithm will then be able to predict the output when it is fed with a brand new input.

C. Reinforcement algorithms

Here machine learns and improves through feedback. It concerns with training an agent to interact with the real-world and maximize its reward. This model of learning in combination with deep learning is on the bleeding edge and massively drives AI. Example implementations in this area include self-driving cars and machines that can play video games.

TensorFlow

Google's TensorFlow is currently the world's most popular open-source ML (deep learning) framework/library for research and production. TensorFlow has a very large API stack, it powers almost every machine learning project at Google, that's, it runs at scale and as well as on small devices like mobile phones. This very large ML framework has been developed rapidly, has a lot of different pieces, and runs on devices, ranging from CPU, GPU, Cloud TPU (tensor processing unit), to android, iOS, and embedded systems. According to this post, "for the most part, the TensorFlow core is written in a combination of highly-optimized C++ and CUDA (Nvidia's language for programming GPUs). Much of that happens, in turn, by using Eigen (a high-performance C++ and CUDA numerical library) and NVidia's cuDNN (a very optimized DNN library for NVidia GPUs, for functions such as convolutions)."

Google uses AI and machine learning to take advantage of its massive data sets to improve efficiency and give users, such as researchers, scientists, and programmers/developers the best experience. Most Google products use machine learning to improve the search engine, translation, image captioning, recommendations, and email communications. TensorFlow library was initially developed by the Google Brain Team to accelerate machine learning and deep neural network research. TensorFlow has several wrappers in a number of languages such as Python, C++ or Java, and is confidently built to run on CPUs or GPUs.

TensorFlow has been around for several years and was first made public in late 2015, with the first stable version appearing in 2017. By July 2018 it had around 1500 active developers, with about 1000 contributors coming outside Google. TensorFlow is licensed under the Apache License 2.0.

At high level, almost every programmer writes TensorFlow code in Python. TensorFlow is massively powered by C++ back-end. Under this TensorFlow introduction, we will only focus on Python front-end. There is also something called TensorFlow.js, which is fantastic for writing really useful TensorFlow projects in JavaScript. TensorFlow thus supports other several languages, such as Java, JavaScript, and R.

TensorFlow is not just a Python API for training certain models, but, it contains a massive collection of projects - a giant codebase, which includes everything from-to putting the code into production. According to Google, TensorFlow has no secret source, the open-source is what exactly used internally by Google.

Why the term TensorFlow?

TensorFlow takes inputs as multi-dimensional arrays, known as tensors. A tensor is an N-dimensional array, a vector or matrix that represents all types of data. Values in a tensor carry identical data type with a known (or partially known) dimensionality (shape of a matrix or array).
It is possible to construct a set of operations about input in TensorFlow that proceed in a flowchart model also called graph - a set of successive computations. The input/tensors enter the system at one end and flow through a series of operations coming out at the other side as output, and hence the name TensorFlow.

A tensor may originate from the input data or as the result of a computation. Thus, computations in TensorFlow involve tensors, and all the operations are carried inside a graph (by connecting tensors together). Each operation is called an "op node" and are connected to each other. The edge of the nodes is the tensor, which is a way of populating the operation with data.

Deep learning relies on a lot of algebraic operations/computations, especially, matrix multiplication.

API Styles

There are several API styles, they include 1. Keras 2. Estimators 3. Eager execution 4. Deferred execution. Here we will focus on tf.keras which is a TensorFlow (tf) implementation of Keras API.

When you define a neural network, one task you have to do is to lay down a sequence of layers. Keras is broadly an API for defining and training neural networks using lego-like building blocks. Keras is an API stack, traditionally runs on top of other deep learning libraries. You can run Keras on top of TensorFlow, or on top of other libraries, such as MXNet and CNTK. In addition, we can incorporate Keras in the tf, which means we can write a good Keras code inside the tf.

Keras: A high-level API specification

Used to build and train neural networks using lego-like building blocks. If we want to use Keras outside the tf, this is how we would install and call it:

Practical Introduction to Machine Learning with TensorFlow and Keras

Who is online

Login • Register