Skip to main content

Do you know Reinforcement Learning?

 Reinforcement Learning is different from the other 2 kinds as here an agent interacts with the environment. The agent initially selects action by random. But after performing action it receives positive or negative reinforcement (reward or punishment) and ultimately the agent tries to select those actions which will earn it most rewards.


The agent selects action based on a policy. 

A policy is just a probability function that takes in the state or the observed environment as input and gives a probability for each valid action (Summing to one). As the agent learns or observes the relation between state and action and rewards it updates its policy function. 

There are 2 kinds of value functions: State-value function and Actions-value function.
The state-value function takes the state as input and gives value or expectation of reward associated with the state.
The action-value function takes state and action as input and gives value or expected reward of taking that action in that state.

Along with the policy function, the value function too needs to be updated as the experience increases.

The interaction between agent and environment may or may not be continuous. It means a task may have a terminal or an end state after which a new episode of interaction may start. eg. Self-driving car-episode ends when the car reaches the destination. eg. Robot in a factory, the task never ends but the robot is just switched off when shutting down the factory at night. Therefore, the rewards may be given at the end of an episode. There are several algorithms that account for this delayed reward.

Most of the game playing systems are created using Reinforcement Learning. Some of the famous examples are Samuel's Checkers Playing program, Deep Blue, AlphaGo (It recently made headlines when it won against the world champion in the game of Go).

Some of the Reinforcement learning algorithms are:
Monte-Carlo 
Sarsa
Expected Sarsa
Q-learning
Actor-Critic 
TD (lambda)







Comments

Popular posts from this blog

What is AI?

 It actually seems funny to write answer to this question (as it's so unusual to find an article about this these days 🤔).  AI is short for Artificial Intelligence or intelligence which is created by humans. But what is intelligence then? Intelligence breaks into the tasks that beings are capable of doing. Like thinking, memorising, remembering, deciding, reasoning, predicting, recognising, improving, inventing, reproducing, dreaming, assuming, surviving, feeling, hoping, coping, all these tasks ending with an 'ing' reminds us that they will never end until life (except for reproducing😉). The thing that makes beings actually alive is knowing that they are.  But are all the beings intelligent? Not all of them carry out all those tasks. As being smartest of all we humans still don't know if a mouse dreams or not (atleast Jerry does😏). But we do know, beings with a smaller brain or number of brain cells cannot carry out complex tasks. I'm sure ameoba can't recog...

Do you know Machine Learning?

 Machine Learning is like Jesus, It's everywhere... From pizzerias to Notco (a company which uses AI to make vegan food that tastes like meat) and from banks to Netflix all are using Machine Learning. But can machines actually learn something? 🧐 There are several algorithms that improve performance on a particular task with experience, that's it. By the way, if anyone asked, that was the definition of Machine Learning. The thing that computer systems can actually increase their performance or learn tasks is what AI is driven by.  Machine Learning is basically divided into 3 categories, viz, Supervised Learning, Unsupervised Learning, and Reinforcement Learning.       Supervised Learning is learning from a training set of labeled examples provided by a knowledgeable external supervisor. Each example is a description of a situation together with a specification—the label—of the correct action the system should take to that situation, which is often to ident...

Do you know steps in building a full Machine Learning model?

1. Data Collection In Machine Learning the data is the most important thing, unlike humans who look at a person's face a few times and recognize him/her, ML needs tons of data. The 2001's paper from Microsoft showed that moderate and complex models performed almost the same given sufficient data.  Apart from it, the quality of data is also important, data that does not represent appropriate relation between features and their label is of no use.  2. Data Preprocessing The preprocessing of data is very essential before feeding it to the algorithm, removing irrelevant features, merging highly correlated features, removing or manually adding missing values, and converting data to numeric values, suppose the data contains a feature representing the country and your dataset consists of many countries which might be moderately correlated to your output so you might not wanna remove it, you can convert it into a one-hot encoding  (a zero vector of length equal to the number of c...