SpaceTimeLab/ClipTheLandscape

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
figures		figures
heads		heads
kaggle_data		kaggle_data
kaggle_results		kaggle_results
logs		logs
logs_old		logs_old
misc		misc
submissions		submissions
.gitignore		.gitignore
1 - Prep data.ipynb		1 - Prep data.ipynb
2 - Train classifiers.ipynb		2 - Train classifiers.ipynb
3 - Test and submit.ipynb		3 - Test and submit.ipynb
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml

Repository files navigation

CLIP the Landscape: Automated Tagging of Crowdsourced Landscape Images

doi arXiv Kaggle

This repo contains our CLIP-based, multi-modal classifiers for the Kaggle 'Predict Geographic Context from Landscape Photographs' challenge on the Geograph dataset. It provides scripts to:

Download and preprocess training and test sets
Train MLP and linear classifiers on CLIP image, title and location embeddings (alone or in combination)
Evaluate model performance and generate Kaggle-ready submission files (.csv.zip)

🖼️ The paper is published in Remote Sensing Applications: Society and Environment [https://doi.org/10.1016/j.rsase.2025.101824].

📃 The preprint is available on arXiv [https://arxiv.org/pdf/2506.12214].

✍️ Authors: Ilya Ilyankou*, Natchapon Jongwiriyanurak*, Tao Cheng, and James Haworth

*Equal contribution

Setup

We suggest running the notebooks in a separate virtual environment. Using miniconda,

# Navigate to the project folder
cd ClipTheLandscape
# Create a new virtual environment
conda env create -f environment.yml
# Activate that new virtual environment
conda activate clip-the-landscape
# Run Jupyter (will open in your default browser) or use VSCode instead
jupyter lab

Examples of misclassified images

This section illustrates the subjectivity of labelling; our model's predicted tags are often as (or even more) appropriate as the original annotations. Tags like Canals, Air transport, Railways, and Burial ground, which represent distinct and objective features, achieve high $F_1$ scores; less visually pronounced tags like Flat landscapes and Lowlands perform poorly.

Misclassified images

Cite

@article{clip-the-landscape,
 title = {CLIP the landscape: Automated tagging of crowdsourced landscape images},
 journal = {Remote Sensing Applications: Society and Environment},
 volume = {41},
 pages = {101824},
 year = {2026},
 issn = {2352-9385},
 doi = {https://doi.org/10.1016/j.rsase.2025.101824},
 url = {https://www.sciencedirect.com/science/article/pii/S2352938525003775},
 author = {Ilya Ilyankou and Natchapon Jongwiriyanurak and Tao Cheng and James Haworth}
}

License

The code is released under the MIT license. The Geograph images are available under the CC-BY-SA 2.0 license.

About

CLIP the Landscape: Automated Tagging of Crowdsourced Landscape Images [published in Remote Sensing Applications: Society and Environment]

doi.org/10.1016/j.rsase.2025.101824

Languages

Jupyter Notebook 100.0%

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

SpaceTimeLab/ClipTheLandscape

Folders and files

Latest commit

History

Repository files navigation

CLIP the Landscape: Automated Tagging of Crowdsourced Landscape Images

Setup

Examples of misclassified images

Cite

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Contributors 2

Uh oh!

Languages

License

SpaceTimeLab/ClipTheLandscape

Folders and files

Latest commit

History

Repository files navigation

CLIP the Landscape: Automated Tagging of Crowdsourced Landscape Images

Setup

Examples of misclassified images

Cite

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages