250709 250716 阅读
Math Reasoning
Datasets
- Arxiv’2501 URSA Understanding and Verifying Chain-of-Thought Reasoning in Multimodal Mathematics
- CIKM’24 InfinityMath A Scalable Instruction Tuning Dataset in Programmatic Mathematical Reasoning
- ⭐⭐⭐⭐⭐Arxiv’2505 Reasoning with OmniThought A Large CoT Dataset with Verbosity and Cognitive Difficulty Annotations
- ⭐⭐⭐⭐ACL’24 CHAMP A Competition-level Dataset for Fine-Grained Analyses of LLMs’ Mathematical Reasoning Capabilities
- ⭐⭐⭐Arxiv’2506 SciDA Scientific Dynamic Assessor of LLMs
This line appears after every note.
Notes mentioning this note
Weekly Summary
2025
[[250722-250729 阅读]]
[[250717-250723 阅读]]
[[250709-250716 阅读]]
[[250701-250708 阅读]]