Extrinsic Evaluation

Extrinsic evaluation evaluates how the LLM is good for a downstream task like summarization, QnA.

Example:

  1. BLEU Score
  2. ROUGE-N Score
  3. ROUGE-LSUM Score
  4. Meteor Score
  5. F1 Score
  6. Word Error Rate

References


Related Notes