↓
Skip to main content
Shital Shah’s Chain of Thought
Home
Blog
About
Home
Blog
About
Blog
Phi-4: The Most Powerful smol Model!
13 December 2024
·
279 words
·
2 mins
Announcement: PhD Internship for AI for Reasoning
11 December 2024
·
109 words
·
1 min
Some Hand Wavy Estimates on FLOPs and Computational Reducibility
9 December 2024
·
253 words
·
2 mins
Data Efficiency with LoRA
4 December 2024
·
54 words
·
1 min
ChatGPT and GR: An All-Night Adventure
30 November 2024
·
40 words
·
1 min
Erdős has A Great Benchmark for ASI
29 November 2024
·
40 words
·
1 min
Transformer Weak Links: Tokenization and Decoding
21 November 2024
·
224 words
·
2 mins
One Neat trick in FrontierMath
20 November 2024
·
42 words
·
1 min
Scaling Laws and Data Wall
15 November 2024
·
341 words
·
2 mins
OpenCoder Uses 3X Less Data
11 November 2024
·
133 words
·
1 min
←
1
2
3
⋯
100
→