Roadmap to get into Generative AI in 2024

Top-down approach

This curriculum follows a top-down approach — code first, theory later.

I like to learn out of necessity. So, if I have to figure out something, a problem to solve, or a prototype to make, I will reach far and wide for the information I need, study, make sense of it, and then act on it.

For example, I aim to be an AI engineer who understands LLMs at a fundamental level, which involves having the skill to code transformers from scratch and fine-tuning LLMs on GPUs, etc. I can’t do that now because there are gaps in my knowledge, and I aim to fill in those gaps.

It is also NLP-focused; if you’re looking for other AI specializations like computer vision or reinforcement learning, DM me on Linkedin. I will pass you some recommendations.

Before I dump a bunch of links on you, I wish somebody had told me two important things before I started learning anything.

Learn in Public

There’s a lot to learn, and you will never be done learning, especially with AI, when new revolutionary papers and ideas are released weekly.

The biggest mistake you can make is to learn in private. You don’t create any opportunities for yourself if you do that. You don’t have anything to show for it besides being able to say you completed something. What matters more is what you made of the information, how you turned it into knowledge to be shared with the public, and what novel ideas and solutions came from that information.

So, you should learn in public.

That means having a habit of creating.

This can mean:

writing blogs and tutorials
join hackathons and collaborate with others
ask and answer questions in Discord communities
work on side projects you’re passionate about
tweeting about something interesting you discovered new

Use twitter, LinkedIn, Discord for learning in public and sharing your work

Mathematics
Tools
∘ Python
∘ PyTorch
Machine Learning
∘ Write from Scratch
∘ Compete
∘ Do side projects
∘ Deploy them
∘ Supplementary
Deep Learning
∘ Fast.ai
∘ Do more competitions
∘ Implement papers
∘ Computer Vision
∘ NLP
Large Language Models
∘ Watch Neural Networks: Zero to Hero
∘ Free LLM boot camp
∘ Build with LLMs
∘ Participate in hackathons
∘ Read papers
∘ Write Transformers from scratch.
∘ Some good blogs
∘ Watch Umar Jamil
∘ Learn how to run open-source models.
∘ Prompt Engineering
∘ Fine-tuning LLMs
∘ RAG
How to stay updated
Other curriculums/listicles you may find useful

Mathematics

Machine learning relies heavily on three pillars of mathematics: linear algebra, calculus, probability, and statistics. Each plays a unique role in enabling algorithms to function effectively.

Linear Algebra: the mathematical toolkit for data representation and manipulation, where matrices and vectors form the language for algorithms to interpret and process information
Calculus: The engine for optimization in machine learning, enabling algorithms to learn and improve by understanding gradients and rates of change.
Probability and Statistics: The foundation for decision-making under uncertainty, allowing algorithms to predict outcomes and learn from data through models of randomness and variability.

This is a great series on Math for ML from a programmer’s perspective: Math for Machine Learning by Weights & Biases (code)

If you want a code-first approach to Linear Algebra, do Computational Linear Algebra (video, code) by the creators of fast.ai.

Read Introduction to Linear Algebra for Applied Machine Learning with Python alongside the course.

If you want something more traditional, look at Imperial College London lectures — Linear Algebra & Multivariate Calculus.

Watch 3Blue1Brown’s Essence of Linear Algebra and Essence of Calculus.

Watch Statistics Fundamentals by StatQuest for statistics

Supplementary