Projects

Open models, datasets, benchmarks, and developer tools that I share through GitHub and HuggingFace. GitHub entries below exclude forked repositories.

DatasetMath reasoning8,237 downloads last month

AIME 1983-2024

A curated AIME benchmark collection for olympiad-style mathematical reasoning and test-time search evaluation.

ModelScientific intelligence1,742 downloads last month

ChemVLM

Multimodal large language models and instruction data for chemistry reasoning across text, molecular, and visual inputs.

ModelChemistry LLM1,658 downloads last month

ChemLLM

Chemical large language models released for chemistry question answering, molecular reasoning, and scientific language tasks.

GitHubLLM reasoning1,034 stars

MathBlackBox

An open-source project around mathematical reasoning workflows and black-box problem solving.

HuggingFace Resources

Reasoning Data

Science Data

Models

GitHub Projects

Python1,034 stars

MathBlackBox

Mathematical reasoning project for black-box problem solving workflows.

XML5 stars

citeclaw

Bun-migrated Citoid-style citation pipeline work for reference-first agent workflows.

Python3 stars

CodeMem

Code memory experiments for agentic software development workflows.

Python3 stars

EZTinker

A compact RL-as-a-service demo inspired by Tinker-style post-training workflows.

Python2 stars

PaddleOCR-MCP

Fast PaddleOCR MCP server for extracting text from images.

Rust1 star

ra

Rust-native agent work with ACP, A2A, shell tools, file editing, search, MCP, and remote-agent integration.

TypeScript0 stars

openlsp

OpenLSP CLI for coding-agent language intelligence.