✍️ typos

2025-08-26 08:41:23 +08:00 · 2021-02-12 14:55:40 +05:30
parent 948f473ee6
commit 56a460a243
7 changed files with 67 additions and 68 deletions
--- a/labml_nn/normalization/layer_norm/init.py
+++ b/labml_nn/normalization/layer_norm/init.py
@ -16,13 +16,13 @@ This is a [PyTorch](https://pytorch.org) implementation of
 * Tricky for RNNs. Do you need different normalizations for each step?
 * Doesn't work with small batch sizes;
 large NLP models are usually trained with small batch sizes.
-* Need to compute means and variances across devices in distributed training
+* Need to compute means and variances across devices in distributed training.

 ## Layer Normalization

 Layer normalization is a simpler normalization method that works
 on a wider range of settings.
-Layer normalization transformers the inputs to have zero mean and unit variance
+Layer normalization transforms the inputs to have zero mean and unit variance
 across the features.
 *Note that batch normalization fixes the zero mean and unit variance for each element.*
 Layer normalization does it for each batch across all elements.
--- a/labml_nn/normalization/layer_norm/readme.md
+++ b/labml_nn/normalization/layer_norm/readme.md
@ -9,13 +9,13 @@ This is a [PyTorch](https://pytorch.org) implementation of
 * Tricky for RNNs. Do you need different normalizations for each step?
 * Doesn't work with small batch sizes;
 large NLP models are usually trained with small batch sizes.
-* Need to compute means and variances across devices in distributed training
+* Need to compute means and variances across devices in distributed training.

 ## Layer Normalization

 Layer normalization is a simpler normalization method that works
 on a wider range of settings.
-Layer normalization transformers the inputs to have zero mean and unit variance
+Layer normalization transforms the inputs to have zero mean and unit variance
 across the features.
 *Note that batch normalization fixes the zero mean and unit variance for each element.*
 Layer normalization does it for each batch across all elements.