bfcl

Here are 6 public repositories matching this topic...

Language: All

Filter by language

All 6 Python 3 Jupyter Notebook 2 TypeScript 1

oleksandr-shyshchuk / tool-probe

Pre-generation tool-call gating via linear probes on LLM hidden states. F1 ≈ 0.91–0.94 on BFCL v4, ×ばつ faster than full generation. Cross-architecture transfer across Llama / Qwen / Phi / Mistral (3B–7B) with ≥96% retention.

reproducible-research ai-safety hidden-states interpretability probing tool-use llm mechanistic-interpretability function-calling activation-patching representation-engineering bfcl linear-probe

Updated May 8, 2026
Jupyter Notebook

sukhrobnurali / tooltuned-qwen

Sponsor

Star 1

A bf16 LoRA fine-tune of Qwen 3.5 4B for function calling on xLAM. v1.0 ships below the BFCL gate with full per-category failure analysis.

lora fine-tuning peft tool-use huggingface function-calling qwen unsloth bfcl qwen3

Updated May 16, 2026
Python

HenryMorganDibie / llm-audit-poc

Star 0

Independent audit of a fine-tuned LLM tool-calling PoC — BFCL regression decomposition, inference stack risk assessment, and production recommendation for a FinTech client. Qwen-2.5, LoRA, SGLang, H100.

machine-learning inference fintech lora model-evaluation fine-tuning mlops huggingface llm vllm function-calling qwen sglang bfcl toolace

Updated Apr 27, 2026
Python

butiploka / mimo-bench-arena

Star 0

Open LLM leaderboard featuring Xiaomi MiMo v2.5 & MiMo 100T head-to-head with GPT-5, Claude, Gemini, DeepSeek, Llama 4. ARC-AGI · SWE-Bench · MMLU-Pro · GPQA · HumanEval · BFCL.