Program

Program

Time Agenda Speaker
December 3 - Morning
Main Forum (Chair: Prof. Lin Zou, Civil Aviation Flight University of China)
9:00-9:05 Opening Ceremony Prof. Weiping Li, Chief Scientist, Civil Aviation Flight University of China
9:05-9:10 Release of the English Monograph on Evaluatology / Establishment of the Expert Committee for the Evaluatology Series BenchCouncil (International Open Benchmark Council)
9:10-9:55 ICICLE: Intelligent Cyberinfrastructure for Next-Generation AI Applications using Computing Continuum Prof. D. K. Panda, IEEE/ACM Fellow
9:55-10:40 AI: Automation of Intelligence Powered by Data Prof. Aoying Zhou, East China Normal University
10:40-10:50 Tea Break
10:50-10:55 BenchCouncil Press Release BenchCouncil (International Open Benchmark Council)
10:55-11:40 Release of Achievements by the International Aviation Evaluatology Research Center, Civil Aviation Flight University of China Prof. Weiping Li, Prof. Lin Zou
11:40-12:00 Release of CPU Evaluation Standards Achievements BenchCouncil CPU Evaluation Standards Working Group
12:00-12:15 Development of Large Language Models in China and Abroad in 2025, Comparison of Computing Infrastructure, and Practical Applications Beijing An-link Technology Co., Ltd
December 3 - Afternoon
Main Forum (Chair: Dr. Guoxin Kang, Institute of Computing Technology, Chinese Academy of Sciences)
14:00-14:45 Release of Achievements from the Open-Source Evaluatology Research Center (East China Normal University) Prof. Wei Wang
14:45-15:30 Latest Developments in European Open-Source AI Systems and Applications Hajdi Cenan & Davor Runje (European AI experts)
15:30-15:40 Tea Break
15:40-16:15 Panel on Topic Selection for the Evaluatology Studies Book Series Prof. Jianfeng Zhan, Prof. Maodeng Li, and Other Experts
16:15-16:30 Benchmark Evaluation System for Mobile-Side Large Language Models China Certification & Inspection (Group) Co., Ltd., CCIC & China Academy of Information and Communications Technology
16:30-16:45 Release of Achievements from the Large Language Models Evaluatology Standards Group BenchCouncil Large Language Models Evaluatology Standards Group
16:45-17:00 Database Evaluatology – Evaluation of Vector Databases Dr. Guoxin Kang
17:00-17:30 Science and Technology Evaluation Panel Dr. Fanda Fan, Dr. Guoxin Kang, and Domain Experts
December 4 - Morning
9:45-12:00 AI-based Publishing Seminar - BenchCouncil Press: AI-based Publishing Open Discussion
December 4 Afternoon - December 5 Morning
Dec 4, 14:00-17:00 Evaluatology Tutorial — Part I Open Tutorial
Dec 5, 9:00-12:00 Evaluatology Tutorial — Part II Open Tutorial
December 4 - All Day: Bench Conference Report I
9:00-9:45 Keynote: Challenges to Evaluation from a LET Perspective Prof. Weining Qian, East China Normal University
9:45-10:05 Meta Evaluation Hongxiao Li, ICT, CAS
10:05-10:25 Leveraging Network and Content Features for Open Source Software Value Assessment Wentong Dai, East China Normal University
10:25-10:45 Compiler Tuning Method Based on Program Feature Extraction and Model Prediction Chenghua Xu, University of Science and Technology of China
10:45-11:05 Examining TPC-C Characteristics on Modern E-Commerce Applications Xueyuan Ren, The Ohio State University
11:05-11:25 Multidimensional Identification and Complex System Transmission Pathway Analysis of Scale-up Risks for Sustainable Aviation Fuel (SAF) in China Zhujun Liu, Civil Aviation Flight University of China
11:25-11:45 An Empirical Analysis of Contribution Evaluation in Open Source Courses Using OpenRank Wentong Dai, East China Normal University
11:45-12:05 Relationship Evaluation for Developer Recommendation in Open Source Communities Xuanhao Zhao, East China Normal University
Break
14:00-14:20 Current Status and Future Trends of Evaluation Methods for Artificial Intelligence Chips Xiaotong Yu, Aerospace Science & Industry Defense Technology Research And Test Center
14:20-14:40 Phys-TSGAIN: A Physics-Informed Generative Imputation for Data Completeness Governance of Lithium-Ion Battery Time Series Xinyue Jin, Shanghai University
14:40-15:00 Empirical Bias in Theoretical Frameworks: Validation of Distance and Load Assumptions in ICAO Aviation Carbon Emissions Calculation Methodologies Jianxiong Chen, Civil Aviation Flight University of China
15:00-15:20 Auto-tuning Compiler Flags with Pretrained Language Models and Surrogate-guided Search Yinjun Pan, University of Science and Technology of China
15:20-15:40 PerfMamba: Performance Analysis and Pruning of Selective State Space Models Abdullah Al Asif, Iowa State University
15:40-16:00 Review of LLM Jailbreaks: White-Box and Black-Box Perspectives on Attacks, Defenses, and Critical Metrics Shuyuan Liu, East China Normal University
16:00-16:20 OceanNNEval: Benchmark for Three-Dimensional Temperature and Salinity Reconstruction Guohang Peng, Sun Yat-sen University
December 4 - All Day: Bench Conference Report II
9:45-10:05 iDATA: An Open-source Vectorization Dataset for AI-EDA Xingquan Li, Pengcheng Laboratory
10:05-10:25 M-CORE: A Dual-Axis Grading Framework for Evaluating the Completeness and Openness of Model Release Units Zhen Zhang, East China Normal University
10:25-10:45 The Global Pulse of Code: A Framework for Evaluating the Globalization of Open Source Projects Jiaheng Peng, East China Normal University
10:45-11:05 An On-Device Evaluation Framework for LLMs with Budget-Constrained Subsets Minghao Wang, China Academy of Information and Communications Technology
11:05-11:25 Open Source Development Goals: A Comprehensive Framework for Evaluating and Guiding Global Open Source Initiatives Fanyu Han, East China Normal University
11:25-11:45 GeoClaim: Programmable Geoscientific Fact Verification and Judge-Guided Evaluation for Open-Ended Mineral Exploration QA Yuang Zhang, China University of Geosciences
11:45-12:05 AC Bench: An open Artificial intelligence chip performance benchmark tool Qian Zhang, China Academy of Information and Communications Technology
Break
14:00-14:20 OpenChartInsight: A Lightweight Framework for Automatic Interpretation of GitHub Repository Charts Xie Siyi, East China Normal University
14:20-14:40 An Evaluation Framework for the Museum and Cultural Heritage Sector from the Perspective of Evaluatology Xiang Li, The Palace Museum
14:40-15:00 Dynamic Multi-View RAG Mitigating Hallucinations of Large Language Models in Education Weijun Zhao, China Academy of Information and Communications Technology
15:00-15:20 MicroGen: Agent-Driven Automated Extraction of Realistic Microbenchmarks from Complex Software Systems Fei Tang, Inspur Data Co.,Ltd.
15:20-15:40 Systematic Evaluation of Miniaturized Lunar Navigation and Communication Satellite Systems Siyuan Han, Lunar Exploration and Aerospace Engineering Center
15:40-16:00 FashionAtlas: Enhancing Semantics and Control in Multimodal Fashion Image Editing Enzhen Gu, Beijing Institute of Fashion Technology
16:00-16:20 Concurrent Priority Queues on GPUs: A Systematic Evaluation of Designs, Workloads, and Relaxation Semantics Jingwei Sun, University of Science and Technology of China
16:20-16:40 Research on the Effectiveness Evaluation System of Lunar-Based Near-Earth Asteroid Monitoring Systems Zhiliu Lu, Deep Space Exploration Lab