Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

abbinavv/AmazonML

Repository files navigation

πŸ† AmazonML Price Prediction Project

An advanced machine learning project for predicting product prices using ensemble methods, meta-learning, and computer vision techniques. Achieved 50.12% SMAPE using sophisticated stacking approaches.

πŸš€ Quick Start

# Clone the repository
git clone <repository-url>
cd amazonml-price-prediction
# Install dependencies
pip install -r requirements.txt
# Run the best model (Meta-Learning)
cd models
python meta_learning_model.py

πŸ“ Project Structure

β”œβ”€β”€ models/ # 🎯 Main optimized models (BEST)
β”‚ β”œβ”€β”€ meta_learning_model.py # πŸ₯‡ Best: 50.12% SMAPE
β”‚ β”œβ”€β”€ neural_enhanced_model.py # πŸ₯ˆ 50.45% SMAPE 
β”‚ └── computer_vision_model.py # πŸ₯‰ ~50% SMAPE
β”‚
β”œβ”€β”€ experiments/ # πŸ§ͺ Research & development
β”‚ β”œβ”€β”€ ensemble/ # Ensemble approaches
β”‚ β”œβ”€β”€ optimization/ # Advanced optimizations
β”‚ └── legacy/ # Earlier experiments
β”‚
β”œβ”€β”€ results/ # πŸ“Š Model outputs & analysis
β”œβ”€β”€ scripts/ # πŸ”§ Utility scripts
β”œβ”€β”€ src/ # πŸ“š Core utilities
β”œβ”€β”€ dataset/ # πŸ’Ύ Training/test data
└── image_cache/ # πŸ–ΌοΈ Downloaded images (2,183 files)

πŸ† Model Performance

Model SMAPE Status Features
Meta-Learning 50.12% βœ… Best Advanced stacking + comprehensive features
Neural Enhanced 50.45% πŸ₯ˆ Second Deep learning + feature interactions
Computer Vision ~50% πŸ§ͺ Experimental Image features + text analysis
Ensemble Models 70-80% ❌ Need work Various ensemble attempts

🎯 Target: <48% SMAPE | πŸ“ˆ Best Achievement: 50.12% SMAPE

πŸ”§ Key Features

  • 🧠 Meta-Learning: Advanced stacking with 7 base models
  • πŸ–ΌοΈ Computer Vision: Real image feature extraction (2,183+ images)
  • πŸ“ NLP: Comprehensive text feature engineering
  • ⚑ Ensemble Methods: Multiple ensemble approaches tested
  • πŸ“Š Robust Validation: Cross-validation with proper SMAPE optimization

πŸ“‹ Requirements

  • Python 3.8+
  • pandas, numpy, scikit-learn
  • lightgbm, xgboost
  • PIL (for image processing)
  • See requirements.txt for full list

πŸš€ Usage Examples

Run Best Model

cd models
python meta_learning_model.py

Run Computer Vision Model

cd models 
python computer_vision_model.py

Resume Image Downloads

cd scripts
python download_computer_vision_images.py

πŸ“Š Results & Analysis

  • Training Data: 75,000 samples
  • Test Data: 75,000 samples
  • Image Dataset: 140,587 unique URLs (2,183+ downloaded)
  • Feature Engineering: 600+ features per model
  • Cross-Validation: 5-fold stratified

See results/ folder for detailed performance analysis.

πŸ§ͺ Experimental Work

The experiments/ folder contains extensive research:

  • Ensemble Methods: Gradient boosting combinations
  • Optimization: Advanced hyperparameter tuning
  • Legacy Models: Early development iterations

🀝 Contributing

  1. Fork the repository
  2. Create feature branch (git checkout -b feature/amazing-feature)
  3. Commit changes (git commit -m 'Add amazing feature')
  4. Push to branch (git push origin feature/amazing-feature)
  5. Open Pull Request

πŸ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.

πŸ™ Acknowledgments

  • Advanced machine learning techniques
  • Meta-learning and stacking approaches
  • Computer vision for e-commerce
  • Ensemble method research

⭐ Star this repo if it helped you!

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

AltStyle γ«γ‚ˆγ£γ¦ε€‰ζ›γ•γ‚ŒγŸγƒšγƒΌγ‚Έ (->γ‚ͺγƒͺγ‚ΈγƒŠγƒ«) /