Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

gaowatch/veyranova

Folders and files

NameName
Last commit message
Last commit date

Latest commit

History

10 Commits

Repository files navigation

Blind Spatial Protocol (BSP)

Pure-Text LLM Desktop GUI Control via ElementMap & UIA Element ID

πŸ“„ Preprint Paper

✨ Core Innovations (Eliminate Coordinate Hallucination at Root)

  1. ElementMap Blind Operation Protocol: A structured element mapping mechanism that lets pure-text LLMs directly reference UIA element IDs to locate GUI components
  2. Zero Coordinate Guessing: Fundamentally eliminates the coordinate hallucination problem that plagues all vision-based agents, ensuring 100% operation accuracy
  3. Vision-Free & Privilege-Free: No screenshots, no elevated system privileges, accessible to all ordinary users out of the box
  4. Complete Tool-Call Parsing Pipeline: End-to-end parsing of LLM instructions to ensure accurate execution of operations
  5. Constitutional-Level Security Pre-Check: Built-in inviolable security rules to prevent malicious operations and protect system safety
  6. Pure-Text LLM Native Support: Compatible with any pure-text large language model, no multimodal model required

🎯 Related Project

This paper corresponds to the open-source desktop agent project VeyraNova, which is under active development, with its core implementation based on the BSP protocol.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

Contributors

AltStyle γ«γ‚ˆγ£γ¦ε€‰ζ›γ•γ‚ŒγŸγƒšγƒΌγ‚Έ (->γ‚ͺγƒͺγ‚ΈγƒŠγƒ«) /