What's happening.

Papers, awards, and talks — newest first.

w.
LatestINSIGHT

Why we distill 30B into 2B instead of serving the big model

Cost, latency, and where the data lives — the case for small, on-device models from our WigtnOCR work.

2026年05月22日Read
More notes

AltStyle によって変換されたページ (->オリジナル) /