Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Releases: openprose/press

OOLONG Eval Data v1

12 Feb 17:32
@irl-dan irl-dan

Choose a tag to compare

Pre-built OOLONG eval dataset (trec_coarse split from oolongbench/oolong-synth).

Contents: 550 rows of trec_coarse validation data across 11 context lengths (1K, 2K, 4K, 8K, 16K, 32K, 64K, 128K, 256K, 512K, 1M) — matching the RLM paper's evaluation configuration.

Format: Gzipped JSONL (validation.jsonl.gz), ~134 MB compressed, ~535 MB decompressed.

Usage: The eval harness downloads this automatically via npx tsx eval/download.ts (default --from-release mode). Use --from-hf to regenerate from HuggingFace instead.

Assets 4
Loading

AltStyle によって変換されたページ (->オリジナル) /