Youngchae (James) Chee μ§μμ± litcoderr
-
KAIST EE IVYLAB
- litcoderr.github.io
Highlights
- Pro
Organizations
@HanyangTechAILists (1)
Sort Name ascending (A-Z)
Starred repositories
π Collection of evaluation code for natural language generation.
HYUFA is an AI finance assistant for university students and young professionals β from basic financial knowledge to personalized portfolio guidance. HYUFAλ κ°λ¨ν κΈμ΅ μμλΆν° ν¬νΈν΄λ¦¬μ€ μΆμ²κΉμ§, λνμκ³Ό μ¬νμ΄λ μμ μν A...
[ICCV 2023] Official PyTorch implementation of FocalFormer3D
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
An additive cyclic noise shader that makes a cool effect.
[AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.
VS-TDX benchmark, assessing VLMs on their capacity for sensor-specific reasoning.
[INTERSPEECH 2025] Official code for "SEED: Speaker Embedding Enhancement Diffusion Model"
A web-based collaborative LaTeX editor
Boost LaTeX typesetting efficiency with preview, compile, autocomplete, colorize, and more.
π€ LeRobot: Making AI for Robotics more accessible with end-to-end learning
Accelerated First Order Parallel Associative Scan
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
[ACL 2025 π₯] Rethinking Step-by-step Visual Reasoning in LLMs
Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch
π€π€ High-contrast, Futuristic & Vibrant Neovim Colorscheme
Extremely fast Query Engine for DataFrames, written in Rust
[Technical Report] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with enlarged hidden dimension to build super frontier vision lan...
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
An API wrapper for Discord written in Python.
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
[ECCV 2024] FlexAttention for Efficient High-Resolution Vision-Language Models
A self-supervised learning framework for audio-visual speech
[AAAI-24] VVS : Video-to-Video Retrieval With Irrelevant Frame Suppression
Official Implementation of RoboCLIP (NeurIPS 2023)
Build your own AI-powered collaborative markdown editor in just 5 minutes
Empower your CLI experience with a command search tool driven by LLM magic!
S3D Text-Video model trained on HowTo100M using MIL-NCE