| 评价科学与工程大会 & 开源与人工智能大会 | 
                                    
                                        | 时间 | 日程 | 报告人 | 
                                    
                                        | 12.4 上午  主论坛(主持人:范帆达博士、陈思敏、葛佳媛),三楼南华厅 | 
                                    
                                        | 9:00-9:10 | 开幕式 | 詹剑锋教授(BenchCouncil创始主席),Geoffrey Fox 教授(BenchCouncil Steering Committee, ACM/IEEE Fellow) | 
                                    
                                        | 9:10-9:15 | 开源贡献世纪榜发布 | 林伟伟教授(华南理工大学) | 
                                    
                                        | 9:15-9:55 | 主旨报告:Benchmarking AI for Science | Geoffrey Fox 教授(ACM/IEEE Fellow) | 
                                    
                                        | 9:55-10:00 | Evaluatology研究中心授牌仪式:中国民用航空飞行学院、华东师范大学、中科院计算所、中科院软件所、广西师范大学、北京本尺康舍研究院 | Geoffrey Fox 教授(BenchCouncil Steering Committee),Tilmann Rabl教授,Hajdi Cenan,Davor Runje,詹剑锋教授 | 
                                    
                                        | 10:00-10:05 | 开源人工智能贡献世纪榜发布 | 待定 | 
                                    
                                        | 10:05-10:45 | 主旨报告:Evaluatology in Aviation | 李维萍教授(中国民用航空飞行学院首席科学家) | 
                                    
                                        | 10:45-10:55 | 茶歇 | 
                                    
                                        | 10:55-11:00 | 颁发开源贡献证书(项目名:EasyGraph,贡献者:陈阳,复旦大学教授) | Geoffrey Fox 教授(BenchCouncil Steering Committee Member), Hajdi Cenan (欧洲AI专家) | 
                                    
                                        | 11:00-11:40 | 主旨报告:Introducing FastAgency - the fastest way to bring AutoGen workflows to production | Hajdi Cenan & Davor Runje (欧洲AI专家) | 
                                    
                                        | 11:40-12:10 | Open Source Evaluatology: Towards a Global Standard for Contribution Evaluation | 王伟教授(华东师范大学) | 
                                    
                                        | 12:10-12:15 | 开源国家榜和地区榜发布 | 王伟教授(华东师范大学) | 
                                    
                                        | 12.4 下午  主论坛(主持人:范帆达博士、陈思敏、葛佳媛),三楼南华厅 | 
                                    
                                        | 14:00-14:40 | 主旨报告:Challenges in Modern Benchmarking | Tilmann Rabl教授(University of Potsda) | 
                                    
                                        | 14:40-14:45 | 开源大模型贡献世纪榜发布 | 龙赛琴教授(暨南大学) | 
                                    
                                        | 14:45-15:15 | 邀请报告 | 张拳石教授(上海交通大学) | 
                                    
                                        | 15:15-15:45 | 邀请报告:Evaluatology: The Science and Engineering of Evaluation | 高婉铃副研究员(中国科学院计算技术研究所) | 
                                    
                                        | 15:45-15:50 | 开源无人机贡献榜发布 | 阳建华副教授(广东技术师范大学) | 
                                    
                                        | 15:50-16:00 | BenchCouncil标准工作组细则 | 詹剑锋教授(BenchCouncil创始主席) | 
                                    
                                        | 16:00-16:02 | 开源标准工作组授牌仪式 | Geoffrey Fox 教授(BenchCouncil Steering Committee),   Hajdi Cenan | 
                                    
                                        | 16:02-16:20 | 开源标准工作组方法、思路与愿景 | 周傲英教授,王伟教授(华东师范大学) | 
                                    
                                        | 16:20-16:25 | 开源医疗人工智能榜贡献世纪榜发布 | 待定 | 
                                    
                                        | 16:25-16:27 | 低空经济标准工作组授牌仪式 | Geoffrey Fox 教授(BenchCouncil Steering Committee),   Hajdi Cenan | 
                                    
                                        | 16:27-16:45 | 低空经济标准工作组方法、思路与愿景 | 李维萍教授(中国民用航空飞行学院首席科学家);陈新国高级工程师(中国科学院软件所) | 
                                    
                                        | 16:45-16:50 | 开源金融人工智能榜贡献世纪榜发布 | 待定 | 
                                    
                                        | 16:50-17:10 | AI算力现况与技术实践 | 朱世海(北京安联通CTO) | 
                                    
                                        | 17:10-17:12 | 大模型标准工作组授牌仪式 | Geoffrey Fox 教授(BenchCouncil Steering Committee),   Hajdi Cenan | 
                                    
                                        | 17:12-17:30 | 大模型标准工作组方法、思路与愿景 | 詹剑锋教授,高婉铃副研究员,罗纯杰副研究员(中国科学院计算技术研究所) | 
                                    
                                        | 17:30-17:40 | 邀请报告:评价科学的基本理论研究 | 汤建民教授(杭州电子科技大学) | 
                                    
                                        | 17:40-18:00 | 开源分领域贡献榜发布: 安全/RISC-V/具身智能榜 | 待定 | 
                                    
                                        | 12.5 上午 | 
                                    
                                        | 分论坛I Bench大会论文报告,三楼聚谊厅 | 
                                    
                                        | 9:00-9:40 | 邀请报告 | 钱卫宁(华东师范大学) | 
                                    
                                        | 9:40-10:00 | LWMEval: Evaluating Large-Scale Neural Networks for Six-Hour Weather Nowcasting | Chaochong Zhang (中山大学) | 
                                    
                                        | 10:00-10:20 | DNN-schedule: A Predictive Scheduler for Minimizing Interference of Co-located DNN Workload | Jiamin Lu(中科大) | 
                                    
                                        | 10:20-10:40 | Benchmarking Distributed Transactional Database Systems | Hailin He (华东师范大学) | 
                                    
                                        | 10:40-11:00 | CaloBench: A Benchmark Study of Generative Models for Calorimeter Showers | Geoffrey Fox(弗吉尼亚大学) | 
                                    
                                        | 11:00-11:20 | StockNNEval:Evaluating Neural Network Methods for Predicting Stock Trend | Zikai Liao (中山大学) | 
                                    
                                        | 11:20-11:40 | StellarTop : An Integrated Multi-Topic Dataset on GitHub Repositories | Zhiwei Zhu(华东师范大学) | 
                                    
                                        | 11:40-12:00 | Evaluating Kernel Anti-Exploitation Capabilities: A Evaluatology-based Scalable and General Framework | Simin Chen (中关村实验室) | 
                                    
                                        | 12:00-12:20 | Benchmarking Edge Computing System for Autonomous Vehicle via CAV Motifs | Yifan Wang (中科院计算所) | 
                                    
                                        | 分论坛II  SimAI论坛:面向大模型集群训练的高精度模拟器,三楼南湖厅 | 
                                    
                                        | 9:00-9:30 | 面向大规模集群训练的模拟器SimAI | 阿里巴巴技术专家 | 
                                    
                                        | 9:30-9:50 | AICB 通信 benchmark 实践 | 阿里巴巴技术专家 | 
                                    
                                        | 9:50-10:30 | SimAI-Analytical 仿真实践 | 阿里巴巴技术专家 | 
                                    
                                        | 10:30-11:00 | Tea break |  | 
                                    
                                        | 11:00-11:30 | SimAI-Simulation 全栈仿真实践 | 阿里巴巴技术专家 | 
                                    
                                        | 11:30-12:00 | SimAI-Physical CPU-NCCL物理打流实践 交流互动 | 全体人员 | 
                                    
                                        | 分论坛III  IC 大会论文报告,三楼南山厅 | 
                                    
                                        | 9:00-9:20 | 邀请报告:三体计算星座—太空计算基础设施 | 宫禄齐(之江实验室) | 
                                    
                                        | 9:20-9:40 | Parallel Computing on RTEMS Operating System | Zeyu Liang (东北大学) | 
                                    
                                        | 9:40-10:00 | GRAC:a method for cancer drug response prediction based on graph residual attention and contrastive learning similarity | Na Luo (东北师范大学) | 
                                    
                                        | 10:00-10:20 | Construction and Application of a Semantic Linked Network for Space Weather Data Based on Metadata | Ci-Feng Wang (国家空间科学研究中心) | 
                                    
                                        | 10:20-10:40 | Personalized Exercise Recommendations: Federated Learning with Hierarchical Attention for School-Specific Needs | Ye Zhang (东北师范大学) | 
                                    
                                        | 10:40-11:00 | Patent Information Extraction Based on Teacher Student Model - A Case Study of Zinc Battery Patent Dataset | Lingchen Cai (四川大学) | 
                                    
                                        | 11:00-11:20 | Artificial Intelligence Modelling Paths for Reasoning Argumentation Methods for Criminal Evidence | Xiaohan Shao (浙江大学) | 
                                    
                                        | 11:20-11:40 | Parallel Decomposition Method for Deep Learning Models Based on Improved Dual Population Genetic Algorithm | Zi Han (山东师范大学) | 
                                    
                                        | 11:40-12:00 | Integrating CNNs and Transformers for Mid-Price Prediction in High-Frequency Trading | Yuqing Tang (西交利物浦大学) | 
                                    
                                        | 12:00-12:20 | Hierarchical Recurrent Network for Active Stereo Matching | Yuan Liu (之江实验室) | 
                                    
                                        | 12:20-12:40 | FewNovelBench: A Benchmark for Few-Shot Learning with Many Novel Classes | Zhipeng Lin (中国人民解放军军事科学院) | 
                                    
                                        | 12.5 下午 | 
                                    
                                        | 分论坛I:开源贡献标准工作组筹备会议和研讨  (周傲英教授,华东师范大学;王伟,华东师范大学),三楼聚谊厅 | 
                                    
                                        | 分论坛II: 低空经济标准工作组筹备会议与研讨(李维萍教授,中国民用航空飞行学院首席科学家;陈新国,中国科学院软件所),三楼南湖厅 | 
                                    
                                        | 分论坛III: 大模型标准工作组筹备会议与研讨(詹剑锋教授,国际测试委员会主席),三楼南山厅 | 
                                    
                                        | 12.6 上午 | 
                                    
                                        | 分论坛I:Evaluatology论坛报告1-Evaluatology Foundations and Frameworks,三楼聚谊厅 | 
                                    
                                        | 9:00-9:20 | Open Source Evaluatology: Theoretical Framework and Practical Pathways for Systematic Evaluation of Open Source Ecosystem | Fanyu Han (华东师范大学) | 
                                    
                                        | 9:20-9:40 | Constructing Benchmarks for Open Source Ecosystems: A Stakeholder Needs-Driven Approach | Zhen Zhang (湖北大学) | 
                                    
                                        | 9:40-10:00 | Open Source Informetrics: Theoretical Framework and Practical Path of Open Source Ecosystem | Zehua Lou (华东师范大学) | 
                                    
                                        | 10:00-10:20 | Evaluatology's Perspective on AI Evaluation in Critical Scenarios: From Tail Quality to The Landscape | Zhengxin Yang (中国科学院计算技术研究所) | 
                                    
                                        | 10:20-10:40 | Evaluating Long-Term Usage Patterns of Open Source Datasets: A Citation Network Approach | Jiaheng Peng (华东师范大学) | 
                                    
                                        | 10:40-11:00 | Evaluating Large Language Models on the Edge: A Use Case of Evaluatology | Zhikun Dong (中国科学院计算技术研究所) | 
                                    
                                        | 11:00-11:20 | A Benchmark Dataset and Evaluation of Collaboration Network in Open Source Software Community | Fan Huang (华东师范大学) | 
                                    
                                        | 11:20-11:40 | The Theory of Computational Evaluatology | Hedong YAN (中国科学院计算技术研究所) | 
                                    
                                        | 分论坛II:Evaluatology论坛报告2-Benchmarkology and Performance Evaluation,三楼南湖厅 | 
                                    
                                        | 9:00-9:20 | Evaluating the Performance of Complex Textual Tasks Generated by Large Language Models | Fenglin Bi (华东师范大学) | 
                                    
                                        | 9:20-9:40 | A Framework for Evaluating Cultural Bias and Historical Misconceptions in LLM Outputs | Moon-Kuen Mak (中国科学院) | 
                                    
                                        | 9:40-10:00 | Patrick Star: A Comprehensive Benchmark for Multi-Modal Image Editing | Di Cheng (北京服装学院) | 
                                    
                                        | 10:00-10:20 | AICB: a Benchmark Suite for Evaluating the Communication Subsystem of LLM Training Clusters | Gang Lu (阿里巴巴) | 
                                    
                                        | 10:20-10:40 | A Performance Evaluation Method for Recommendation Model Training on Heterogeneous NPUs | Qiang Liu (腾讯) | 
                                    
                                        | 10:40-11:00 | Design and Practice of Performance Evaluation System for High Performance General-purpose CPU | Weijun Zhong (中国电子技术标准化研究院) | 
                                    
                                        | 11:00-11:20 | A Context-Driven Benchmark for Evaluating Task Management Capabilities of Digital Assistants | JIACHEN DU (清华大学) | 
                                    
                                    
                                        | 12.6 下午 | 
                                    
                                        | 分论坛I: Evaluatology论坛报告3-Evaluatology Applications Across Multi-Disciplines,三楼聚谊厅 | 
                                    
                                        | 14:00-14:20 | Missing materials data imputation workflow towards improving the prediction performance of machine learning | Yue Liu (上海大学) | 
                                    
                                        | 14:20-14:40 | Research on intelligent traffic surveillance video compression quality assessment method | Xiangnan Zhao (中国计量科学研究院) | 
                                    
                                        | 14:40-15:00 | Research on Multidimensional Evaluation Technology of Teachers; Digital Literacy Based on Large Language Models | Di Fan (东北大学) | 
                                    
                                        | 15:00-15:20 | Knuth Test: Enhancing Assessment Accuracy in Introductory Computer Science Education | Yu Du (中国科学院计算技术研究所) | 
                                    
                                        | 15:20-15:40 | MixSchedSim: A Simulator for Mixed Workload Scheduling in Heterogeneous Computing Environments | Fei Tang (浪潮) | 
                                    
                                        | 15:40-16:00 | BigTensorDB-Coupled Artificial Intelligence for Science: A Retrosynthetic Analysis Case Study | Xueya Zhang (中国科学院大学) | 
                                    
                                        | 16:00-16:20 | An Experimental study on Evaluating Senior High School Gifted Talented Students’ Academic Literacy | Ping Lei | 
                                    
                                        | 16:20-16:40 | Real-World Drug Clinical Research Based on Artificial Intelligence | Kunqian Yu(中国科学院) | 
                                    
                                        | 16:40 -17:00 | CodeAgent - Collaborative Agents for Software Engineering | Daniel Tang (卢森堡大学) | 
                                    
                                        | 分论坛II: Tbench论坛,三楼南湖厅 | 
                                    
                                        | 14:00-14:20 | Could bibliometrics reveal top science and technology achievements and researchers? The case for evaluatology-based science and technology evaluation | Wanling Gao (中国科学院大学) | 
                                    
                                        | 14:20-14:40 | BinCodex: A comprehensive and multi-level dataset for evaluating binary code similarity detection techniques | Peihua Zhang (腾讯) | 
                                    
                                        | 14:40-15:00 | An approach to workload generation for modern data centers: A view from Alibaba trace | Yi Liang(北京工业大学) | 
                                    
                                        | 15:00-15:20 | TensorTable: Extending PyTorch for mixed relational and linear algebra pipelines | Xu Wen(华为) | 
                                    
                                        | 15:20-15:40 | Evaluation of mechanical properties of natural fiber based polymer composite | Tarikur Jaman Pramanik(Khulna University of Engineering Technology) | 
                                    
                                        | 15:40-16:00 | Enhanced deep learning based decision support system for kidney tumour detection | Taha ETEM(Cankkiri Karatekin University) | 
                                    
                                        | 16:00-16:20 | Analyzing the impact of opportunistic maintenance optimization on manufacturing industries in Bangladesh: An empirical study | Md. Ariful Alam(Bangladesh Army University of Science and Technology) |