Language Models for Hackers

Language Models for Hackers This is not an exhaustive list of LLM literature. This is an opinionated collection of papers from the LLM landscape useful for hackers. This document will keep getting updated. If you have any questions, DM me on Twitter at: @nishantiam and follow for general updates. I presume you already know Attention is all you need, GPT-3 and GPT-4. Prompting We can ask LLMs questions, and get answers....

May 25, 2024 · 13 min · Nishant Nikhil