CanhuaChen v01cano
- Beijing, China
-
23:26
(UTC -12:00)
Highlights
- Pro
Stars
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、...
快速上手AI理论及应用实战:基础知识、Transformer、NLP、ML、DL、竞赛。含大量注释及数据集,力求每一位能看懂并复现。
零知识证明入门教程。Comprehensive Zero-Knowledge Proofs Tutorial. #zk #WIP
《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
本『ChatGPT资源库(原理/微调/代码/论文)』的初始版本来自July CSDN博客上阅读量高达50万的ChatGPT系列,联合发起人:七月ChatGPT原理课学员,6月初正式对外发布
Academic Papers about LLM Application on Security
A collection of important graph embedding, classification and representation learning papers with implementations.
Flash-IDS is an open-source system developed by the DART Laboratory for advanced intrusion detection using provenance graph representation learning. It implements the techniques presented in our IE...
🔥🌟《Machine Learning 格物志》: ML + DL + RL basic codes and notes by sklearn, PyTorch, TensorFlow, Keras & the most important, from scratch!💪 This repository is ALL You Need!
Implementation and experiments of graph embedding algorithms.
A Python Library for Graph Outlier Detection (Anomaly Detection)
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
log anomaly detection toolkit including DeepLog
Log Parsing with Prompt-based Few-shot Learning (ICSE 2023, Technical Track)
A large collection of system log datasets for AI-driven log analytics [ISSRE'23]
A machine learning toolkit for log parsing [ICSE'19, DSN'16]
Log-based Anomaly Detection with Deep Learning: How Far Are We? (ICSE 2022, Technical Track)
A comprehensive guide to understanding and implementing large language models with hands-on examples using LangChain for GenAI applications.
LangChain 的中文入门教程
[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
Graph Representation Approach for Streaming Graph