Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
@mookiezi
mookiezi
Follow
View mookiezi's full-sized avatar
💭
Thinking about thinking

Jason mookiezi

💭
Thinking about thinking
🌱 Developing software tools for archiving online platforms, with a focus on NLP

Block or report mookiezi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. interface interface Public

    A Python-based interactive CLI interface for chatting with Hugging Face language models, optimized for casual, Discord-style conversation using ChatML. Supports both quantized and full-precision mo...

    Python 2

  2. dataset-cleaning-toolkit dataset-cleaning-toolkit Public

    A dataset toolbox for preparing and analyzing conversational datasets, including CSV splitting, CSV → Parquet conversion, dataset statistics, Parquet cleaning and sorting, HuggingFace–style metadat...

    Python 3

  3. dataset-pipeline dataset-pipeline Public

    A full Discord dataset pipeline with end-to-end flow from raw Discord data to final Parquet dataset with full statistics — every stage independant, idempotent, and CLI-driven for ease of automation.

    2

  4. dataset-toolbox dataset-toolbox Public

    A dataset toolbox for preparing and analyzing conversational datasets, including CSV splitting, CSV → Parquet conversion, dataset statistics, dialogue-turn filtering, turn-based filtering, token an...

    Python

AltStyle によって変換されたページ (->オリジナル) /