🚀 Work 1:1 with a Software Engineer and let AI handle the busywork → https://www.skool.com/ai-academy-with-robby-6849/about
Why Does AI Need a Safety Net?
Hi! I’m Robby. I spend my days building real-world AI systems, and one thing I’ve learned is that teaching a computer to "think" is hard. If you give a model too much new information at once, it can get lost and forget what it already knew.
This is where the Add & Norm step comes in. It’s like a stabilizer for the model’s brain.
What is Add & Norm?
In a transformer (the technology behind tools like ChatGPT), the model processes information in layers. As it digs deeper into the data, we use a trick called Add & Norm to keep things steady.
Think of it as two separate steps working together:
1. The "Add" (The Shortcut)
Imagine you are reading a long book. Every time you start a new chapter, you want to keep the main idea of the story in mind.
In the model, the "Add" step takes the information from before and adds it to the new information the model just learned. By adding the original input back in, we make sure the AI doesn't lose the important stuff it already figured out.
2. The "Norm" (The Balance)
"Norm" stands for Normalization. If the numbers inside the AI get too big or too small, the math stops working properly.
Normalization keeps those numbers in a healthy range. It’s like keeping your car on the road instead of swerving into the grass.
Why It Matters
Without Add & Norm, deep learning models would be very unstable. They would "forget" their training as they got deeper.
By combining these two steps, we get:
- Better Memory: The AI holds onto important details longer.
- Stable Learning: The math stays smooth, so the model learns faster.
- Reliability: It prevents the model from getting confused as it processes complex ideas.
Next time you hear about transformers, remember that it’s not just about the big, fancy math—it’s also about these clever little tricks that keep the model balanced and ready to learn!