Agentbench: Evaluating llms as agents X Liu, H Yu, H Zhang, Y Xu, X Lei, H Lai, Y Gu, H Ding, K Men, K Yang, ... arXiv preprint arXiv:2308.03688, 2023 | 221 | 2023 |
Chatglm: A family of large language models from glm-130b to glm-4 all tools T GLM, A Zeng, B Xu, B Wang, C Zhang, D Yin, D Zhang, D Rojas, G Feng, ... arXiv preprint arXiv:2406.12793, 2024 | 117 | 2024 |
WebGLM: Towards an efficient web-enhanced question answering system with human preferences X Liu, H Lai, H Yu, Y Xu, A Zeng, Z Du, P Zhang, Y Dong, J Tang Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023 | 69 | 2023 |
Middleware for llms: Tools are instrumental for language agents in complex environments Y Gu, Y Shu, H Yu, X Liu, Y Dong, J Tang, J Srinivasa, H Latapie, Y Su arXiv preprint arXiv:2402.14672, 2024 | 15 | 2024 |
AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent H Lai, X Liu, IL Iong, S Yao, Y Chen, P Shen, H Yu, H Zhang, X Zhang, ... arXiv preprint arXiv:2404.03648, 2024 | 7 | 2024 |
Visualagentbench: Towards large multimodal models as visual foundation agents X Liu, T Zhang, Y Gu, IL Iong, Y Xu, X Song, S Zhang, H Lai, X Liu, H Zhao, ... arXiv preprint arXiv:2408.06327, 2024 | 6 | 2024 |
Openwebagent: An open toolkit to enable web agents on large language models IL Iong, X Liu, Y Chen, H Lai, S Yao, P Shen, H Yu, Y Dong, J Tang Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024 | 4 | 2024 |
AutoWebGLM: A Large Language Model-based Web Navigating Agent H Lai, X Liu, IL Iong, S Yao, Y Chen, P Shen, H Yu, H Zhang, X Zhang, ... Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and …, 2024 | 1 | 2024 |
AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents Y Xu, X Liu, X Sun, S Cheng, H Yu, H Lai, S Zhang, D Zhang, J Tang, ... arXiv preprint arXiv:2410.24024, 2024 | | 2024 |