Accidental Factors

Needless text

Home MITM Categories About

Thinking about high quality human data

· July 14, 2024

(Updated: Nov 24, 2024)

links-and-recs

https://lilianweng.github.io/posts/2024-02-05-human-data-quality/

The alignment stage of LLM post-training requires lots of human-generated data, and the quality is bounded on how good that data is. Lots of techniques here, with citations.