| 9:00-9:05 |
Opening Ceremony |
Prof. Weiping Li, Chief Scientist, Civil Aviation Flight University of China |
| 9:05-9:10 |
Release of the English Monograph on Evaluatology / Establishment of the Expert Committee for the Evaluatology Series |
BenchCouncil (International Open Benchmark Council) |
| 9:10-9:55 |
ICICLE: Intelligent Cyberinfrastructure for Next-Generation AI Applications using Computing Continuum |
Prof. D. K. Panda, IEEE/ACM Fellow |
| 9:55-10:40 |
AI: Automation of Intelligence Powered by Data |
Prof. Aoying Zhou, East China Normal University |
| 10:40-10:50 |
Tea Break |
| 10:50-10:55 |
BenchCouncil Press Release |
BenchCouncil (International Open Benchmark Council) |
| 10:55-11:40 |
Release of Achievements by the International Aviation Evaluatology Research Center, Civil Aviation Flight University of China |
Prof. Weiping Li, Prof. Lin Zou |
| 11:40-12:00 |
Release of CPU Evaluation Standards Achievements |
BenchCouncil CPU Evaluation Standards Working Group |
| 12:00-12:15 |
Development of Large Language Models in China and Abroad in 2025, Comparison of Computing Infrastructure, and Practical Applications |
Beijing An-link Technology Co., Ltd |
| 14:00-14:45 |
Release of Achievements from the Open-Source Evaluatology Research Center (East China Normal University) |
Prof. Wei Wang |
| 14:45-15:30 |
Latest Developments in European Open-Source AI Systems and Applications |
Hajdi Cenan & Davor Runje (European AI experts) |
| 15:30-15:40 |
Tea Break |
| 15:40-16:15 |
Panel on Topic Selection for the Evaluatology Studies Book Series |
Prof. Jianfeng Zhan, Prof. Maodeng Li, and Other Experts |
| 16:15-16:30 |
Benchmark Evaluation System for Mobile-Side Large Language Models |
China Certification & Inspection (Group) Co., Ltd., CCIC & China Academy of Information and Communications Technology |
| 16:30-16:45 |
Release of Achievements from the Large Language Models Evaluatology Standards Group |
BenchCouncil Large Language Models Evaluatology Standards Group |
| 16:45-17:00 |
Database Evaluatology – Evaluation of Vector Databases |
Dr. Guoxin Kang |
| 17:00-17:30 |
Science and Technology Evaluation Panel |
Dr. Fanda Fan, Dr. Guoxin Kang, and Domain Experts |
| 9:45-12:00 |
AI-based Publishing Seminar - BenchCouncil Press: AI-based Publishing Open Discussion |
|
| Dec 4, 14:00-17:00 |
Evaluatology Tutorial — Part I |
Open Tutorial |
| Dec 5, 9:00-12:00 |
Evaluatology Tutorial — Part II |
Open Tutorial |
| 9:00-9:45 |
Keynote: Challenges to Evaluation from a LET Perspective |
Prof. Weining Qian, East China Normal University |
| 9:45-10:05 |
Meta Evaluation |
Hongxiao Li, ICT, CAS |
| 10:05-10:25 |
Leveraging Network and Content Features for Open Source Software Value Assessment |
Wentong Dai, East China Normal University |
| 10:25-10:45 |
Compiler Tuning Method Based on Program Feature Extraction and Model Prediction |
Chenghua Xu, University of Science and Technology of China |
| 10:45-11:05 |
Examining TPC-C Characteristics on Modern E-Commerce Applications |
Xueyuan Ren, The Ohio State University |
| 11:05-11:25 |
Multidimensional Identification and Complex System Transmission Pathway Analysis of Scale-up Risks for Sustainable Aviation Fuel (SAF) in China |
Zhujun Liu, Civil Aviation Flight University of China |
| 11:25-11:45 |
An Empirical Analysis of Contribution Evaluation in Open Source Courses Using OpenRank |
Wentong Dai, East China Normal University |
| 11:45-12:05 |
Relationship Evaluation for Developer Recommendation in Open Source Communities |
Xuanhao Zhao, East China Normal University |
| Break |
| 14:00-14:20 |
Current Status and Future Trends of Evaluation Methods for Artificial Intelligence Chips |
Xiaotong Yu, Aerospace Science & Industry Defense Technology Research And Test Center |
| 14:20-14:40 |
Phys-TSGAIN: A Physics-Informed Generative Imputation for Data Completeness Governance of Lithium-Ion Battery Time Series |
Xinyue Jin, Shanghai University |
| 14:40-15:00 |
Empirical Bias in Theoretical Frameworks: Validation of Distance and Load Assumptions in ICAO Aviation Carbon Emissions Calculation Methodologies |
Jianxiong Chen, Civil Aviation Flight University of China |
| 15:00-15:20 |
Auto-tuning Compiler Flags with Pretrained Language Models and Surrogate-guided Search |
Yinjun Pan, University of Science and Technology of China |
| 15:20-15:40 |
PerfMamba: Performance Analysis and Pruning of Selective State Space Models |
Abdullah Al Asif, Iowa State University |
| 15:40-16:00 |
Review of LLM Jailbreaks: White-Box and Black-Box Perspectives on Attacks, Defenses, and Critical Metrics |
Shuyuan Liu, East China Normal University |
| 16:00-16:20 |
OceanNNEval: Benchmark for Three-Dimensional Temperature and Salinity Reconstruction |
Guohang Peng, Sun Yat-sen University |
| 9:45-10:05 |
iDATA: An Open-source Vectorization Dataset for AI-EDA |
Xingquan Li, Pengcheng Laboratory |
| 10:05-10:25 |
M-CORE: A Dual-Axis Grading Framework for Evaluating the Completeness and Openness of Model Release Units |
Zhen Zhang, East China Normal University |
| 10:25-10:45 |
The Global Pulse of Code: A Framework for Evaluating the Globalization of Open Source Projects |
Jiaheng Peng, East China Normal University |
| 10:45-11:05 |
An On-Device Evaluation Framework for LLMs with Budget-Constrained Subsets |
Minghao Wang, China Academy of Information and Communications Technology |
| 11:05-11:25 |
Open Source Development Goals: A Comprehensive Framework for Evaluating and Guiding Global Open Source Initiatives |
Fanyu Han, East China Normal University |
| 11:25-11:45 |
GeoClaim: Programmable Geoscientific Fact Verification and Judge-Guided Evaluation for Open-Ended Mineral Exploration QA |
Yuang Zhang, China University of Geosciences |
| 11:45-12:05 |
AC Bench: An open Artificial intelligence chip performance benchmark tool |
Qian Zhang, China Academy of Information and Communications Technology |
| Break |
| 14:00-14:20 |
OpenChartInsight: A Lightweight Framework for Automatic Interpretation of GitHub Repository Charts |
Xie Siyi, East China Normal University |
| 14:20-14:40 |
An Evaluation Framework for the Museum and Cultural Heritage Sector from the Perspective of Evaluatology |
Xiang Li, The Palace Museum |
| 14:40-15:00 |
Dynamic Multi-View RAG Mitigating Hallucinations of Large Language Models in Education |
Weijun Zhao, China Academy of Information and Communications Technology |
| 15:00-15:20 |
MicroGen: Agent-Driven Automated Extraction of Realistic Microbenchmarks from Complex Software Systems |
Fei Tang, Inspur Data Co.,Ltd. |
| 15:20-15:40 |
Systematic Evaluation of Miniaturized Lunar Navigation and Communication Satellite Systems |
Siyuan Han, Lunar Exploration and Aerospace Engineering Center |
| 15:40-16:00 |
FashionAtlas: Enhancing Semantics and Control in Multimodal Fashion Image Editing |
Enzhen Gu, Beijing Institute of Fashion Technology |
| 16:00-16:20 |
Concurrent Priority Queues on GPUs: A Systematic Evaluation of Designs, Workloads, and Relaxation Semantics |
Jingwei Sun, University of Science and Technology of China |
| 16:20-16:40 |
Research on the Effectiveness Evaluation System of Lunar-Based Near-Earth Asteroid Monitoring Systems |
Zhiliu Lu, Deep Space Exploration Lab |