본문으로 건너뛰기

Juhyeon's Blog

태그: NeurIPS

7건의 항목

  • 2026년 4월 13일

    Training Compute-Optimal Large Language Models

    • paper
    • scaling_law
    • compute_optimal
    • chinchilla
    • LLM
    • DeepMind
    • NeurIPS
  • 2026년 4월 13일

    Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena

    • paper
    • benchmark
    • LLM_judge
    • MT_Bench
    • chatbot
    • multi_turn
    • NeurIPS
    • LMSYS
  • 2026년 4월 13일

    MMLU-Pro - A More Robust and Challenging Multi-Task Language Understanding Benchmark

    • paper
    • benchmark
    • MMLU_Pro
    • knowledge
    • reasoning
    • 10_choice
    • NeurIPS
  • 2026년 4월 13일

    Measuring Mathematical Problem Solving with the MATH Dataset

    • paper
    • benchmark
    • mathematics
    • MATH
    • competition_math
    • reasoning
    • NeurIPS
  • 2026년 4월 13일

    WebShop - Towards Scalable Real-World Web Interaction with Grounded Language Agents

    • paper
    • benchmark
    • web_agent
    • WebShop
    • web_shopping
    • sim_to_real
    • NeurIPS
    • Princeton
  • 2026년 4월 13일

    Are Emergent Abilities of Large Language Models a Mirage?

    • paper
    • emergent_abilities
    • scaling_laws
    • measurement
    • metric_choice
    • BIG-Bench
    • LLM_evaluation
    • NeurIPS
    • outstanding_paper
  • 2026년 4월 13일

    Visual Instruction Tuning

    • paper
    • multimodal
    • instruction-tuning
    • LLaVA
    • vision-language
    • NeurIPS

키보드 단축키

/ 또는 Ctrl+K검색
?단축키 도움말
Esc모달 닫기

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Blog