Commit ba9653a

clamprngehrsitzLairdStreakandersoncarlosfs

authored

Version 1.7.0 (#184)

* Change numpy.NaN to numpy.nan for compatibility with numpy>2.0.0 (#160) * Version 1.7.0: Initial commit * Adds developer quality of life dev enhancements (#138) * Adds developer quality of life dev enhancements 1: Adds a make file for running tasks 2: Add make for black formatting and pylint + flake8 3: Create a requirements_dev file for dev setup 4: Use pip-tools to generate appropriate requirements file. TAG: #HACKTOBERFEST2023 * Update: Update python file formatting via black * Update: Fixup some flake8 and pylint issues Added Ignore for E203 space before : Added Ignore for W503 newline before binary operator * Update: Update README and add diagrams Update to readme with developer instructions Adds Class and package diagrams. [pyreverse] * Adding proxy configuration (#143) * adding proxy configuration * reshaping the code * adding imports * reshaping the code * adding support for proxies * renaming variables * decompressing the response * renaming variable * renaming variable * fixing errors --------- Co-authored-by: Christian Lamprecht <christian.lamprecht@aol.de> * initial commit for hourly ts * feat: rename diverging columns * feat: virtual columns * feat: enable daily/monthly interfaces for new data access * feat: retain normals behavior * fix: broken inventory mask * fix: series normalization * chore: update requirements * fix: formatting * fix: linting * fix: ignore import errors when linting * fix: install pytest * fix: filter cols before interpolation * fix: pass fill_value * fix: convert to numpy float64 for interpolation * feat: cleanup * feat: update README * feat: review * fix: formatting * feat: add default start & end date for daily data * fix: formatting * fix: use correct freq * feat: order cols, use joined flags * fix: formatting * fix: silence future warnings * fix: formatting --------- Co-authored-by: ngehrsitz <45375059+ngehrsitz@users.noreply.github.com> Co-authored-by: Laird Streak <laird.streak@bentley.com> Co-authored-by: Anderson Carlos Ferreira da Silva <andersoncarlosfs@outlook.com>

1 parent 3556cca commit ba9653aCopy full SHA for ba9653a

File tree

26 files changed

+456

-321

lines changed

.pylintrc
README.md
examples
- daily
  - compare_aggregate.py
- monthly
  - aggregate.py
meteostat
- __init__.py
- core
- interface
- series
- utilities
requirements.txt
setup.py
tests/manual
- manual_test_aggregation.py

26 files changed

+456

-321

lines changed

`‎.pylintrc‎`

Lines changed: 3 additions & 7 deletions

Original file line number	Diff line number	Diff line change
`@@ -1,16 +1,12 @@`
`1`	`1`	`[MESSAGES CONTROL]`
`2`	`2`
`3`		`-disable=print-statement,`
`4`		`- singleton-comparison,`
`5`		`- no-member,`
	`3`	`+disable=no-member,`
`6`	`4`	`too-few-public-methods,`
`7`	`5`	`protected-access,`
`8`		`- inconsistent-return-statements,`
`9`	`6`	`import-outside-toplevel,`
`10`	`7`	`duplicate-code,`
`11`		`- import-error,`
`12`		`- nan-comparison,`
`13`		`- consider-using-set-comprehension`
	`8`	`+ too-many-positional-arguments,`
	`9`	`+ import-error`
`14`	`10`
`15`	`11`	`[BASIC]`
`16`	`12`

`‎README.md‎`

Lines changed: 1 addition & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -2,7 +2,7 @@`
`2`	`2`
`3`	`3`	`The Meteostat Python library provides a simple API for accessing open weather and climate data. The historical observations and statistics are collected by [Meteostat](https://meteostat.net) from different public interfaces, most of which are governmental.`
`4`	`4`
`5`		`-Among the data sources are national weather services like the National Oceanic and Atmospheric Administration (NOAA) and Germany's national meteorological service (DWD).`
	`5`	`+Among the data sources are national weather services like the National Oceanic and Atmospheric Administration (NOAA) and Germany's national weather service (DWD).`
`6`	`6`
`7`	`7`	`Are you looking for a hosted solution? Try our [JSON API](https://rapidapi.com/meteostat/api/meteostat/).`
`8`	`8`

`‎examples/daily/compare_aggregate.py‎`

Lines changed: 1 addition & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -28,7 +28,7 @@`
`28`	`28`	`data = Daily(stations, start, end)`
`29`	`29`
`30`	`30`	`# Aggregate annually`
`31`		`-data = data.aggregate(freq="1Y").fetch()`
	`31`	`+data = data.aggregate(freq="1YE").fetch()`
`32`	`32`
`33`	`33`	`# Plot chart`
`34`	`34`	`fig, ax = plt.subplots(figsize=(8, 6))`

`‎examples/monthly/aggregate.py‎`

Lines changed: 1 addition & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -19,7 +19,7 @@`
`19`	`19`	`# Get monthly data`
`20`	`20`	`# Then, aggregate annually`
`21`	`21`	`data = Monthly("72202", start, end)`
`22`		`-data = data.normalize().aggregate(freq="1Y").fetch()`
	`22`	`+data = data.normalize().aggregate(freq="1YE").fetch()`
`23`	`23`
`24`	`24`	`# Plot chart`
`25`	`25`	`data.plot(y="tavg")`

`‎meteostat/init.py‎`

Lines changed: 12 additions & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -12,7 +12,7 @@`
`12`	`12`	`"""`
`13`	`13`
`14`	`14`	`__appname__ = "meteostat"`
`15`		`-__version__ = "1.6.8"`
	`15`	`+__version__ = "1.7.0"`
`16`	`16`
`17`	`17`	`from .interface.base import Base`
`18`	`18`	`from .interface.timeseries import TimeSeries`
`@@ -22,3 +22,14 @@`
`22`	`22`	`from .interface.daily import Daily`
`23`	`23`	`from .interface.monthly import Monthly`
`24`	`24`	`from .interface.normals import Normals`
	`25`	`+`
	`26`	`+__all__ = [`
	`27`	`+ "Base",`
	`28`	`+ "TimeSeries",`
	`29`	`+ "Stations",`
	`30`	`+ "Point",`
	`31`	`+ "Hourly",`
	`32`	`+ "Daily",`
	`33`	`+ "Monthly",`
	`34`	`+ "Normals",`
	`35`	`+]`

`‎meteostat/core/cache.py‎`

Lines changed: 0 additions & 2 deletions

Original file line number	Diff line number	Diff line change
`@@ -53,7 +53,6 @@ def clear_cache(cls, max_age: int = None) -> None:`
`53`	`53`	`"""`
`54`	`54`
`55`	`55`	`if os.path.exists(cls.cache_dir + os.sep + cls.cache_subdir):`
`56`		`-`
`57`	`56`	`# Set max_age`
`58`	`57`	`if max_age is None:`
`59`	`58`	`max_age = cls.max_age`
`@@ -63,7 +62,6 @@ def clear_cache(cls, max_age: int = None) -> None:`
`63`	`62`
`64`	`63`	`# Go through all files`
`65`	`64`	`for file in os.listdir(cls.cache_dir + os.sep + cls.cache_subdir):`
`66`		`-`
`67`	`65`	`# Get full path`
`68`	`66`	`path = os.path.join(cls.cache_dir + os.sep + cls.cache_subdir, file)`
`69`	`67`

`‎meteostat/core/loader.py‎`

Lines changed: 26 additions & 28 deletions

Original file line number	Diff line number	Diff line change
`@@ -8,16 +8,19 @@`
`8`	`8`	`The code is licensed under the MIT license.`
`9`	`9`	`"""`
`10`	`10`
	`11`	`+from io import BytesIO`
	`12`	`+from gzip import GzipFile`
	`13`	`+from urllib.request import Request, ProxyHandler, build_opener`
`11`	`14`	`from urllib.error import HTTPError`
`12`	`15`	`from multiprocessing import Pool`
`13`	`16`	`from multiprocessing.pool import ThreadPool`
`14`		`-from typing import Callable, Union`
	`17`	`+from typing import Callable, List, Optional`
`15`	`18`	`import pandas as pd`
`16`	`19`	`from meteostat.core.warn import warn`
`17`	`20`
`18`	`21`
`19`	`22`	`def processing_handler(`
`20`		`- datasets: list, load: Callable[[dict], None], cores: int, threads: int`
	`23`	`+ datasets: List, load: Callable[[dict], None], cores: int, threads: int`
`21`	`24`	`) -> None:`
`22`	`25`	`"""`
`23`	`26`	`Load multiple datasets (simultaneously)`
`@@ -28,10 +31,8 @@ def processing_handler(`
`28`	`31`
`29`	`32`	`# Multi-core processing`
`30`	`33`	`if cores > 1 and len(datasets) > 1:`
`31`		`-`
`32`	`34`	`# Create process pool`
`33`	`35`	`with Pool(cores) as pool:`
`34`		`-`
`35`	`36`	`# Process datasets in pool`
`36`	`37`	`output = pool.starmap(load, datasets)`
`37`	`38`
`@@ -41,10 +42,8 @@ def processing_handler(`
`41`	`42`
`42`	`43`	`# Multi-thread processing`
`43`	`44`	`elif threads > 1 and len(datasets) > 1:`
`44`		`-`
`45`	`45`	`# Create process pool`
`46`	`46`	`with ThreadPool(threads) as pool:`
`47`		`-`
`48`	`47`	`# Process datasets in pool`
`49`	`48`	`output = pool.starmap(load, datasets)`
`50`	`49`
`@@ -54,49 +53,48 @@ def processing_handler(`
`54`	`53`
`55`	`54`	`# Single-thread processing`
`56`	`55`	`else:`
`57`		`-`
`58`	`56`	`for dataset in datasets:`
`59`	`57`	`output.append(load(*dataset))`
`60`	`58`
`61`	`59`	`# Remove empty DataFrames`
`62`		`- filtered = list(filter(lambda df: df.index.size>0, output))`
	`60`	`+ filtered = list(filter(lambda df: notdf.empty, output))`
`63`	`61`
`64`	`62`	`return pd.concat(filtered) if len(filtered) > 0 else output[0]`
`65`	`63`
`66`	`64`
`67`	`65`	`def load_handler(`
`68`	`66`	`endpoint: str,`
`69`	`67`	`path: str,`
`70`		`- columns: list,`
`71`		`- types: Union[dict, None],`
`72`		`- parse_dates: list,`
`73`		`- coerce_dates: bool = False,`
	`68`	`+ proxy: Optional[str] = None,`
	`69`	`+ names: Optional[List] = None,`
	`70`	`+ dtype: Optional[dict] = None,`
	`71`	`+ parse_dates: Optional[List] = None,`
	`72`	`+ default_df: Optional[pd.DataFrame] = None,`
`74`	`73`	`) -> pd.DataFrame:`
`75`	`74`	`"""`
`76`	`75`	`Load a single CSV file into a DataFrame`
`77`	`76`	`"""`
`78`	`77`
`79`	`78`	`try:`
	`79`	`+ handlers = []`
	`80`	`+`
	`81`	`+ # Set a proxy`
	`82`	`+ if proxy:`
	`83`	`+ handlers.append(ProxyHandler({"http": proxy, "https": proxy}))`
`80`	`84`
`81`	`85`	`# Read CSV file from Meteostat endpoint`
`82`		`- df = pd.read_csv(`
`83`		`- endpoint + path,`
`84`		`- compression="gzip",`
`85`		`- names=columns,`
`86`		`- dtype=types,`
`87`		`- parse_dates=parse_dates,`
`88`		`- )`
`89`		`-`
`90`		`- # Force datetime conversion`
`91`		`- if coerce_dates:`
`92`		`- df.iloc[:, parse_dates] = df.iloc[:, parse_dates].apply(`
`93`		`- pd.to_datetime, errors="coerce"`
`94`		`- )`
	`86`	`+ with build_opener(*handlers).open(Request(endpoint + path)) as response:`
	`87`	`+ # Decompress the content`
	`88`	`+ with GzipFile(fileobj=BytesIO(response.read()), mode="rb") as file:`
	`89`	`+ df = pd.read_csv(`
	`90`	`+ file,`
	`91`	`+ names=names,`
	`92`	`+ dtype=dtype,`
	`93`	`+ parse_dates=parse_dates,`
	`94`	`+ )`
`95`	`95`
`96`	`96`	`except (FileNotFoundError, HTTPError):`
`97`		`-`
`98`		`- # Create empty DataFrane`
`99`		`- df = pd.DataFrame(columns=[*types])`
	`97`	`+ df = default_df if default_df is not None else pd.DataFrame(columns=names)`
`100`	`98`
`101`	`99`	`# Display warning`
`102`	`100`	`warn(f"Cannot load {path} from {endpoint}")`

`‎meteostat/core/warn.py‎`

Lines changed: 1 addition & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -16,7 +16,7 @@ def _format(message, category, _filename, _lineno, _line=None) -> str:`
`16`	`16`	`Print warning on a single line`
`17`	`17`	`"""`
`18`	`18`
`19`		`- return "%s: %s\n"% (category.__name__, message)`
	`19`	`+ return f"{category.__name__}: {message}\n"`
`20`	`20`
`21`	`21`
`22`	`22`	`# Set warning format`

`‎meteostat/interface/base.py‎`

Lines changed: 10 additions & 7 deletions

Original file line number	Diff line number	Diff line change
`@@ -9,28 +9,31 @@`
`9`	`9`	`"""`
`10`	`10`
`11`	`11`	`import os`
	`12`	`+from typing import Optional`
`12`	`13`
`13`	`14`
`14`	`15`	`class Base:`
`15`		`-`
`16`	`16`	`"""`
`17`	`17`	`Base class that provides features which are used across the package`
`18`	`18`	`"""`
`19`	`19`
`20`	`20`	`# Base URL of the Meteostat bulk data interface`
`21`		`- endpoint: str = "https://bulk.meteostat.net/v2/"`
	`21`	`+ endpoint = "https://bulk.meteostat.net/v2/"`
	`22`	`+`
	`23`	`+ # Proxy URL for the Meteostat (bulk) data interface`
	`24`	`+ proxy: Optional[str] = None`
`22`	`25`
`23`	`26`	`# Location of the cache directory`
`24`		`- cache_dir: str = os.path.expanduser("~") + os.sep + ".meteostat" + os.sep + "cache"`
	`27`	`+ cache_dir = os.path.expanduser("~") + os.sep + ".meteostat" + os.sep + "cache"`
`25`	`28`
`26`	`29`	`# Auto clean cache directories?`
`27`		`- autoclean: bool = True`
	`30`	`+ autoclean = True`
`28`	`31`
`29`	`32`	`# Maximum age of a cached file in seconds`
`30`		`- max_age: int = 24 * 60 * 60`
	`33`	`+ max_age = 24 * 60 * 60`
`31`	`34`
`32`	`35`	`# Number of processes used for processing files`
`33`		`- processes: int = 1`
	`36`	`+ processes = 1`
`34`	`37`
`35`	`38`	`# Number of threads used for processing files`
`36`		`- threads: int = 1`
	`39`	`+ threads = 1`

`‎meteostat/interface/daily.py‎`

Lines changed: 44 additions & 31 deletions

Original file line number	Diff line number	Diff line change
`@@ -8,7 +8,7 @@`
`8`	`8`	`The code is licensed under the MIT license.`
`9`	`9`	`"""`
`10`	`10`
`11`		`-from datetime import datetime`
	`11`	`+from datetime import datetime, timedelta`
`12`	`12`	`from typing import Union`
`13`	`13`	`import pandas as pd`
`14`	`14`	`from meteostat.enumerations.granularity import Granularity`
`@@ -18,61 +18,68 @@`
`18`	`18`
`19`	`19`
`20`	`20`	`class Daily(TimeSeries):`
`21`		`-`
`22`	`21`	`"""`
`23`	`22`	`Retrieve daily weather observations for one or multiple weather stations or`
`24`	`23`	`a single geographical point`
`25`	`24`	`"""`
`26`	`25`
`27`	`26`	`# The cache subdirectory`
`28`		`- cache_subdir: str = "daily"`
	`27`	`+ cache_subdir = "daily"`
`29`	`28`
`30`	`29`	`# Granularity`
`31`	`30`	`granularity = Granularity.DAILY`
`32`	`31`
	`32`	`+ # Download data as annual chunks`
	`33`	`+ # This cannot be changed and is only kept for backward compatibility`
	`34`	`+ chunked = True`
	`35`	`+`
`33`	`36`	`# Default frequency`
`34`		`- _freq: str = "1D"`
	`37`	`+ _freq = "1D"`
	`38`	`+`
	`39`	`+ # Source mappings`
	`40`	`+ _source_mappings = {`
	`41`	`+ "dwd_daily": "A",`
	`42`	`+ "eccc_daily": "A",`
	`43`	`+ "ghcnd": "B",`
	`44`	`+ "dwd_hourly": "C",`
	`45`	`+ "eccc_hourly": "C",`
	`46`	`+ "isd_lite": "D",`
	`47`	`+ "synop": "E",`
	`48`	`+ "dwd_poi": "E",`
	`49`	`+ "metar": "F",`
	`50`	`+ "model": "G",`
	`51`	`+ "dwd_mosmix": "G",`
	`52`	`+ "metno_forecast": "G",`
	`53`	`+ }`
`35`	`54`
`36`	`55`	`# Flag which represents model data`
`37`	`56`	`_model_flag = "G"`
`38`	`57`
`39`	`58`	`# Columns`
`40`		`- _columns: list = [`
`41`		`- "date",`
`42`		`- "tavg",`
	`59`	`+ _columns = [`
	`60`	`+ "year",`
	`61`	`+ "month",`
	`62`	`+ "day",`
	`63`	`+ {"tavg": "temp"},`
`43`	`64`	`"tmin",`
`44`	`65`	`"tmax",`
`45`	`66`	`"prcp",`
`46`		`- "snow",`
`47`		`- "wdir",`
	`67`	`+ {"snow": "snwd"},`
	`68`	`+ {"wdir": None},`
`48`	`69`	`"wspd",`
`49`	`70`	`"wpgt",`
`50`	`71`	`"pres",`
`51`	`72`	`"tsun",`
`52`	`73`	`]`
`53`	`74`
`54`	`75`	`# Index of first meteorological column`
`55`		`- _first_met_col = 1`
`56`		`-`
`57`		`- # Data types`
`58`		`- _types: dict = {`
`59`		`- "tavg": "float64",`
`60`		`- "tmin": "float64",`
`61`		`- "tmax": "float64",`
`62`		`- "prcp": "float64",`
`63`		`- "snow": "float64",`
`64`		`- "wdir": "float64",`
`65`		`- "wspd": "float64",`
`66`		`- "wpgt": "float64",`
`67`		`- "pres": "float64",`
`68`		`- "tsun": "float64",`
`69`		`- }`
	`76`	`+ _first_met_col = 3`
`70`	`77`
`71`	`78`	`# Columns for date parsing`
`72`		`- _parse_dates: dict= {"time": [0]}`
	`79`	`+ _parse_dates= ["year", "month", "day"]`
`73`	`80`
`74`	`81`	`# Default aggregation functions`
`75`		`- aggregations: dict = {`
	`82`	`+ aggregations = {`
`76`	`83`	`"tavg": "mean",`
`77`	`84`	`"tmin": "min",`
`78`	`85`	`"tmax": "max",`
`@@ -88,12 +95,18 @@ class Daily(TimeSeries):`
`88`	`95`	`def __init__(`
`89`	`96`	`self,`
`90`	`97`	`loc: Union[pd.DataFrame, Point, list, str], # Station(s) or geo point`
`91`		`- start: datetime = None,`
`92`		`- end: datetime = None,`
`93`		`- model: bool = True, # Include model data?`
`94`		`- flags: bool = False, # Load source flags?`
	`98`	`+ start=datetime(1781, 1, 1, 0, 0, 0),`
	`99`	`+ end=datetime.combine(`
	`100`	`+ datetime.today().date() + timedelta(days=10), datetime.max.time()`
	`101`	`+ ),`
	`102`	`+ model=True, # Include model data?`
	`103`	`+ flags=False, # Load source flags?`
`95`	`104`	`) -> None:`
`96`		`-`
	`105`	`+ # Extract relevant years`
	`106`	`+ if self.chunked:`
	`107`	`+ self._annual_steps = [`
	`108`	`+ start.year + i for i in range(end.year - start.year + 1)`
	`109`	`+ ]`
`97`	`110`	`# Initialize time series`
`98`	`111`	`self._init_time_series(loc, start, end, model, flags)`
`99`	`112`

0 commit comments

Comments

(0)

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Commit ba9653a

File tree

26 files changed

26 files changed

`‎.pylintrc‎`

`‎README.md‎`

`‎examples/daily/compare_aggregate.py‎`

`‎examples/monthly/aggregate.py‎`

`‎meteostat/init.py‎`

`‎meteostat/core/cache.py‎`

`‎meteostat/core/loader.py‎`

`‎meteostat/core/warn.py‎`

`‎meteostat/interface/base.py‎`

`‎meteostat/interface/daily.py‎`

0 commit comments