← Topics

Machine Learning

members
15 tweets
Columns:
# Tweet User Followers Views Ratio Engagement Posted
1
[video] 🧊 Los servidores de IA consumen cantidades brutales de agua para no sobrecalentarse. Una sola instalación como la del video usa 30 millones de litros de agua al mes solo para enfriar los servidores. Mientras tanto, en muchas partes del mundo falta agua potable… ¿Vale la pena el
@tendenciaytuits 229.9K 25.5K 0.1x 98 Jun 5
2
[image] day 4/60 of summer break - almost completed the attention chapter - wrote a casual attention mask - learnt about multi head attention - extended single head attention to multi head attention once am done with this, i'll spend some more time into revising all this
@sharpeye_wnl 8.0K 25.3K 3.2x 165 May 30
3
[image] day 9/60 of summer break - wanted to learn about RAG so started reading this paper - read about categories of rag - pausing the book for now, done with what i needed from the book - will start working on embeddings and cosine similarity after i complete the first 3 sections
@sharpeye_wnl 8.1K 24.3K 3.0x 118 Jun 4
4
[image] day 3/60 of summer break - started chapter of the book - started reading about attention mechanisms - wrote a small attention mechanism without trainable weights and then another with trainable weights i have completed half of the chapter and the other half i'll complete today
@sharpeye_wnl 7.9K 23.3K 2.9x 123 May 29
5
[image] > revising all the basic LLM concepts by rebuilding nanogpt today will be a fun ride
@sharpeye_wnl 8.1K 18.2K 2.2x 110 Jun 12
6
[image] day 5/60 of summer break - pulled an allnighter - completely understood masked attention, casual attention and multi head attention - started with implementing a gpt model - will try to cover chapter 4 today rerevising was really worth it, watched rasbt's video and iterated a
@sharpeye_wnl 8.0K 16.1K 2.0x 135 May 31
7
[image] day 1/60 of summer break - completed chapter 1 of building a large language model from scratch - started chapter 2, completed 25% of it - morning workout + 45 min cardio the book has a lot of depth and i want to cover it whole. i dont wanna rush it so i'll try to give it 5-6
@sharpeye_wnl 7.9K 15.1K 1.9x 135 May 27
8
[image] day 11/60 of summer break > continued learning about building rag pipelines > read about document loaders > read about types of chunking > why semantic and late chunking are used if you wanna read more about late chunking use this :
@sharpeye_wnl 8.1K 8.7K 1.1x 121 Jun 6