↓
Skip to main content
Shital Shah’s Chain of Thought
Home
Blog
About
Home
Blog
About
Blog
Minor Tweaks, Major Leaps: Glimpsing Frontier Models
6 March 2024
·
157 words
·
1 min
Challenging Claude 3: Extinct Codes, Maps, and Puzzles
5 March 2024
·
83 words
·
1 min
A Lifetime in 1T Tokens: AI's Human Experience
22 February 2024
·
32 words
·
1 min
Byte Feeding Frenzy: Spiky Training with Token Overdose
16 February 2024
·
48 words
·
1 min
Random Success: When Sampling Weights Generalizes
13 February 2024
·
47 words
·
1 min
When Random Sampling Packs a Punch: The Balls-in-Bins Problem
8 February 2024
·
194 words
·
1 min
PyTorch Gold Rush: Unearthing Distributed Debugging Tips
22 January 2024
·
10 words
·
1 min
Paperception: Unveiling Post-Hoc EMA Tuning
19 January 2024
·
43 words
·
1 min
Phixtral Fusion: Phi-2 and Pre-trained Experts Smash the Leaderboard
15 January 2024
·
73 words
·
1 min
Searching 'I Cannot Fulfill' Opens AI Spam Floodgates
14 January 2024
·
52 words
·
1 min
←
1
⋯
8
9
10
⋯
100
→