publications

publications by categories in reversed chronological order.

2024

  1. NeurIPS 2024
    mmlu_pro.png
    Mmlu-pro: A more robust and challenging multi-task language understanding benchmark
    Yubo Wang , Xueguang Ma , Ge Zhang , and 14 more authors
    2024
  2. NeurIPS 2024
    genai_arena.png
    GenAI Arena: An Open Evaluation Platform for Generative Models
    Dongfu Jiang , Max Ku , Tianle Li , and 4 more authors
    2024
  3. EMNLP 2024
    video_score.png
    VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
    Xuan He , Dongfu Jiang , Ge Zhang Max Ku , and 15 more authors
    2024

2023

  1. ICCV 2023 Workshop
    deepfake_art.png
    DeepfakeArt Challenge: A Benchmark Dataset for Generative AI Art Forgery and Data Poisoning Detection
    Hossein Aboutalebi , Dayou Mao , Rongqi Fan , and 3 more authors
    2023