Karma Yoga

How often do we yield a difficult situation or person over to the karma gods? With the belief that the law of cause & effect will eventually catch up over some kind of selfish or inappropriate…

Smartphone

独家优惠奖金 100% 高达 1 BTC + 180 免费旋转




GELU activation

GELUs full form is GAUSSIAN ERROR LINEAR UNIT

Activations like ReLU, ELU and PReLU have enabled faster and better convergence of Neural Networks than sigmoids.

Also, Dropout regularizes the model by randomly multiplying a few activations by 0.

Both of the above methods together decide a neuron’s output. Yet, the two work independently from each other. GELU aims to combine them.

Also, a new RNN regularizer called Zoneout stochastically multiplies the input by 1.

We want to merge all 3 functionalities by stochastically multiplying the input by 0 or 1 and getting the output value (of the activation function) deterministically.

We chose this distribution since neuron’s input follow a normal distribution, especially after Batch Normalization.

But the output of any activation function should be deterministic, not stochastic. So, we find the expected value of our transformation.

Since Φ(x) is a cumulative distribution of Gaussian distribution and is often computed with the error function, hence we define Gaussian Error Linear Unit (GELU) as-

GELU (μ=0, σ=1), ReLU and ELU (α=1)

References-

Add a comment

Related posts:

El paciente inmortal

Algunos afirman que la corrupción es un fenómeno social, unos consideran que es el síntoma y otros la reconocen como la enfermedad misma.

51 Tips to help you live with dementia

What a lot of people don’t know is that Dementia is not a disease, but rather a syndrome. It’s basically a general term used to describe different disorders that affect the brain. Dementia occurs…

Can You Imagine Living And Working In The Same Building? Seattle Can!

There are plans for a new skyscraper in Seattle. Yes, a new skyscraper. And this may be the tallest one in the city yet! People are planning for a bright future ahead and they are looking forward to…