Related Notes
- Deliberative Alignment - Reasoning Enables Safer Language Models
- G-Eval - NLG Evaluation using GPT-4 with Better Human Alignment
- Compressed Chain of Thought - Efficient Reasoning Through Dense Representations
- Molmo and PixMo
- $τ$-bench - A Benchmark for Tool-Agent-User Interaction in Real-World Domains