Summary

다음 단어를 예측하는 task.

  • NTP(Next Token Prediction)..
  • 대표적인 benchmark task
  • 많은 NLP task의 subcomponent 이다.
    • predictive typing
    • speech recognition
    • handwriting recognition
    • spelling/grammar correctopm
    • machine translation
    • etc..

formal 하게는 아래와 같이 정의됨.

NOTE

give a sequence of words ,
compute the probability distribution of the next word :

where can be any word in voca

A system that does this called a Language Model.

  • Language model : a system that assigns a probability to a piece of text.