AI Unplugged
Subscribe
Sign in
Home
Notes
Archive
About
Latest
Top
AI Unplugged 23: nGPT normalised transformer, LAUREL, TokenFormer
Insights over information
Nov 21, 2024
•
Datta
and
Ashwanth
Share this post
AI Unplugged
AI Unplugged 23: nGPT normalised transformer, LAUREL, TokenFormer
Copy link
Facebook
Email
Notes
More
AI Unplugged 22: RoPE internals explained, Stuffed Mamba, Planning with Transfomer: Chess.
Insights over information.
Nov 7, 2024
•
Datta
and
Ashwanth
Share this post
AI Unplugged
AI Unplugged 22: RoPE internals explained, Stuffed Mamba, Planning with Transfomer: Chess.
Copy link
Facebook
Email
Notes
More
October 2024
AI Unplugged 21: TPI LLM, Differential Transformer, ARIA, Ministral 3B and 8B
Insights over information.
Oct 17, 2024
•
Datta
,
Ashwanth
, and
Amit Saha
Share this post
AI Unplugged
AI Unplugged 21: TPI LLM, Differential Transformer, ARIA, Ministral 3B and 8B
Copy link
Facebook
Email
Notes
More
September 2024
AI Unplugged 20: SCoRE Self Correction via RL, OpenAI o1 models, Qwen 2.5 coder, Spectrum Fine Tuning.
Insights over information
Sep 26, 2024
•
Datta
,
Ashwanth
, and
Amit Saha
1
Share this post
AI Unplugged
AI Unplugged 20: SCoRE Self Correction via RL, OpenAI o1 models, Qwen 2.5 coder, Spectrum Fine Tuning.
Copy link
Facebook
Email
Notes
More
AI Unplugged 19: KTO for model alignment, OLMoE, Mamba in the LlaMa, Plan Search
Insights over Information
Sep 12, 2024
•
Datta
,
Ashwanth
, and
Amit Saha
1
Share this post
AI Unplugged
AI Unplugged 19: KTO for model alignment, OLMoE, Mamba in the LlaMa, Plan Search
Copy link
Facebook
Email
Notes
More
August 2024
AI Unplugged 18: MiniTron and Llama-MiniTron, 1.5 Pints, Jamba 1.5 and FocusLLM.
Insights over Information
Aug 29, 2024
•
Datta
,
Ashwanth
, and
Gavrish Prabhu
Share this post
AI Unplugged
AI Unplugged 18: MiniTron and Llama-MiniTron, 1.5 Pints, Jamba 1.5 and FocusLLM.
Copy link
Facebook
Email
Notes
More
AI Unplugged 17: The AI Scientist, Apple Foundation Models, MiniCPM-V and more.
Insights over Information. Happy Independence Day India 🇮🇳
Aug 15, 2024
•
Datta
and
Ashwanth
Share this post
AI Unplugged
AI Unplugged 17: The AI Scientist, Apple Foundation Models, MiniCPM-V and more.
Copy link
Facebook
Email
Notes
More
AI Unplugged 16: Llama 3, AIMO winners, Segment Anything Model 2, LazyLLM
Insights over Information
Aug 1, 2024
•
Datta
,
Ashwanth
, and
Gavrish Prabhu
Share this post
AI Unplugged
AI Unplugged 16: Llama 3, AIMO winners, Segment Anything Model 2, LazyLLM
Copy link
Facebook
Email
Notes
More
July 2024
AIUnplugged 15: Gemma 2, Flash Attention 3, QGaLoRE, MathΣtral and Codestral Mamba
Insights over Information
Jul 18, 2024
•
Datta
,
Ashwanth
, and
Gavrish Prabhu
Share this post
AI Unplugged
AIUnplugged 15: Gemma 2, Flash Attention 3, QGaLoRE, MathΣtral and Codestral Mamba
Copy link
Facebook
Email
Notes
More
AI Unplugged 14: Adam mini, GrokFast, MobileLLM, JEST
Insights over information
Jul 11, 2024
•
Datta
,
Ashwanth
, and
Gavrish Prabhu
Share this post
AI Unplugged
AI Unplugged 14: Adam mini, GrokFast, MobileLLM, JEST
Copy link
Facebook
Email
Notes
More
June 2024
AI Unplugged 13: Qwen2, DiscoPOP, Mixture of Agents, YOLO v10, Grokked Transformers
Insights over Information
Jun 20, 2024
•
Datta
,
Ashwanth
, and
Gavrish Prabhu
Share this post
AI Unplugged
AI Unplugged 13: Qwen2, DiscoPOP, Mixture of Agents, YOLO v10, Grokked Transformers
Copy link
Facebook
Email
Notes
More
AI Unplugged 12: MoRA. DPO vs PPO. CoPE Contextual Position Encoding. S3D Self Speculative Decoding.
Insights over Information
Jun 6, 2024
•
Datta
,
Ashwanth
, and
Gavrish Prabhu
1
Share this post
AI Unplugged
AI Unplugged 12: MoRA. DPO vs PPO. CoPE Contextual Position Encoding. S3D Self Speculative Decoding.
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts