Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Transform your camera captures into immersive audio-visual experiences using cutting-edge AI

Notifications You must be signed in to change notification settings

bilsimaging/soundsnapper

Repository files navigation

๐ŸŽต SoundSnapper: AI-Powered Reality Remix

Transform your camera captures into immersive audio-visual experiences using cutting-edge AI

Nano Banana Hackathon Gemini 2.5 Fal AI ElevenLabs

Transform your camera captures โ€” AI banner


โ“ The Problem

Creating engaging audio-visual content typically requires expensive software, technical skills, and hours of editing. Most people can't instantly transform everyday objects into creative, shareable experiences.


๐Ÿ’ก Our Solution

SoundSnapper makes creativity one-tap simple:
๐Ÿ“ท Snap โ†’ ๐Ÿง  Analyze โ†’ ๐ŸŽจ Transform โ†’ ๐ŸŽต Generate โ†’ โœจ Share

A seamless fusion of reality and AI-powered imagination.


๐ŸŒŸ Key Features

  • ๐Ÿ“ธ Instant Camera Capture - Intuitive mobile-first interface
  • ๐Ÿง  AI Scene Intelligence - Gemini 2.5 Flash understands your photos
  • ๐ŸŽจ Artistic Transformations - Anime, Cyberpunk, Watercolor & more
  • ๐ŸŽต Immersive Soundscapes - ElevenLabs generates matching audio
  • ๐Ÿ”Š Interactive Controls - Volume, zoom, and playback options
  • ๐Ÿ“ฑ Responsive Design - Works perfectly on any device
  • โšก No Setup Required - Try instantly without API keys

๐ŸŽฏ Who It's For

๐ŸŽฌ Content Creators - Turn mundane objects into viral TikTok moments
๐Ÿ“š Educators - Help kids discover the "sounds" of everyday items
๐ŸŽถ Musicians - Find inspiration in unexpected visual-audio combinations
๐Ÿข Brands - Create interactive campaigns with object-to-sound experiences


๐Ÿš€ Real-World Examples

  • ๐Ÿ“ฑ Social Media: Snap your coffee โ†’ Get cyberpunk visuals + cafรฉ ambiance
  • ๐ŸŽ“ Education: Kids explore how different materials "sound" in their imagination
  • ๐ŸŽต Music Production: Random objects spark new ambient textures
  • ๐Ÿ›๏ธ Marketing: Product scans generate branded soundscapes

๐ŸŽฅ Live Demo

๐ŸŒ Try SoundSnapper Now (No Setup Required)

๐ŸŽฌ Watch Demo Video
SoundSnapper Demo


๐Ÿ”ฎ Roadmap

  • ๐Ÿ“ฑ TikTok/Reels Export - Vertical video output with audio sync
  • ๐ŸŽฏ Multi-Object Mode - Layer multiple items for complex soundscapes
  • ๐ŸŽญ Style Packs - Premium themes (Retro, Minimal, Sci-Fi)
  • ๐Ÿ—‚๏ธ Personal Gallery - Save and revisit your creations
  • ๐ŸŒ Community Hub - Share and remix with others
  • ๐Ÿ›ก๏ธ Privacy-First - Zero data retention, ephemeral processing

๐Ÿ› ๏ธ Tech Stack

Frontend: React 19 + TypeScript + Vite
AI Vision: Google Gemini 2.5 Transformations: Fal AI (gemini-25-flash-image/edit)
Audio Generation: ElevenLabs API
UI/UX: Custom CSS with Glassmorphism
Deployment: Vercel + Serverless Functions


โšก Quick Start

Prerequisites

Setup

# Clone & Install
git clone https://github.com/bilsimaging/soundsnapper.git
cd soundsnapper
npm install
# Configure Environment
cp .env.example .env.local
# Add your API keys to .env.local
# Launch
npm run dev
# Open http://localhost:5173

โš ๏ธ Security Note: Use serverless functions to proxy API calls and protect keys.


๐ŸŽฎ How to Use

  1. ๐Ÿ“ท Grant camera access when prompted
  2. ๐Ÿ“ธ Snap a photo of any object
  3. โณ Wait for AI magic (analysis + audio generation)
  4. ๐ŸŽจ Choose your style (Anime, Cyberpunk, etc.)
  5. โœจ Apply transformation and enjoy the result
  6. ๐Ÿ”Š Adjust volume or zoom to view full-size
  7. ๐Ÿ“ค Share your creation with the world

๐Ÿ† Competition Entry - Google Nano Banana Hackathon 2025 ๐ŸŒ

๐ŸŽฏ Judging Criteria Alignment

โœจ Innovation & "Wow" Factor (40%)
SoundSnapper pioneers a new creative medium: instant reality-to-art transformation with synchronized soundscapes. This multi-modal AI pipeline (vision โ†’ transformation โ†’ audio) creates magical experiences impossible before Gemini 2.5 Flash.

โš™๏ธ Technical Excellence (30%)
Modern React 19 architecture with TypeScript, secure serverless API proxying, mobile-optimized responsive design, and seamless integration of three AI services.

๐ŸŒ Real Impact (20%)
Democratizes creative content creation for millions - from TikTok creators to classroom teachers to music producers. Removes technical barriers to artistic expression.

๐ŸŽฅ Presentation Quality (10%)
Professional live demo, clear documentation, and engaging video showcase demonstrate the full potential.


๐Ÿง  Gemini 2.5 Flash Integration

Gemini 2.5 Flash Image ("nano banana" technology) is SoundSnapper's intelligent core, accessed via Fal AI's fal-ai/gemini-25-flash-image/edit endpoint.

Core Capabilities:

  • ๐Ÿ” Scene Understanding - Recognizes objects, materials, environments, and context
  • ๐ŸŽจ Style Generation - Creates artistic transformations (Anime, Cyberpunk, Watercolor)
  • ๐Ÿง  Smart Context - Provides rich descriptions for audio generation

The Magic Flow:

  1. Photo captured โ†’ Gemini analyzes visual elements
  2. Gemini generates artistic style variants via Fal AI
  3. Scene understanding informs ElevenLabs audio creation
  4. Result: Perfectly matched visual + audio experience

Gemini 2.5 Flash is the "brain" that makes everything possible - understanding your photos and transforming them into creative art while providing context for matching soundscapes. Without nano banana technology, SoundSnapper couldn't bridge the gap between visual input and meaningful audio-visual output.


๐Ÿค Contributing

While this is a hackathon project, contributions are welcome:

  • ๐Ÿ› Report bugs via GitHub Issues
  • ๐Ÿ’ก Suggest features for future versions
  • โญ Star the repo if you love the concept!

๐Ÿ“„ License

MIT License

Copyright (c) 2025 Bilsimaging


๐Ÿ™ Acknowledgments

  • Google for Gemini 2.5 Flash Image technology
  • Fal for providing seamless API access
  • ElevenLabs for revolutionary audio generation
  • Nano Banana Hackathon organizers for this amazing opportunity

Made with โค๏ธ by Bilsimaging for the Nano Banana Hackathon 2025 ๐ŸŒ

About

Transform your camera captures into immersive audio-visual experiences using cutting-edge AI

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

AltStyle ใซใ‚ˆใฃใฆๅค‰ๆ›ใ•ใ‚ŒใŸใƒšใƒผใ‚ธ (->ใ‚ชใƒชใ‚ธใƒŠใƒซ) /