LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion D Jiang, X Ren, BY Lin ACL 2023, 2023 | 76* | 2023 |
Mmmu: A massive multi-discipline multimodal understanding and reasoning benchmark for expert agi X Yue, Y Ni, K Zhang, T Zheng, R Liu, G Zhang, S Stevens, D Jiang, ... CVPR 2024 Oral, 2023 | 66 | 2023 |
Tigerscore: Towards building explainable metric for all text generation tasks D Jiang, Y Li, G Zhang, W Huang, BY Lin, W Chen arXiv preprint arXiv:2310.00752, 2023 | 12 | 2023 |
VIEScore: Towards Explainable Metrics for Conditional Image Synthesis Evaluation M Ku, D Jiang, C Wei, X Yue, W Chen arXiv preprint arXiv:2312.14867, 2023 | 3 | 2023 |
MANTIS: Interleaved Multi-Image Instruction Tuning D Jiang, X He, H Zeng, C Wei, M Ku, Q Liu, W Chen arXiv preprint arXiv:2405.01483, 2024 | | 2024 |