Simplemma changelog

Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

All Versions
7
Latest Version
Avg Release Cycle
-
Latest Release
-

Changelog History

  • v0.5.0 Changes

    • faster, more efficient code
    • ⬇️ dropped support for Python 3.5
  • v0.4.0 Changes

    • πŸ†• new languages: Armenian, Greek, Macedonian, Norwegian (BokmΓ₯l), and Polish
    • language data reviewed for: Dutch, Finnish, German, Hungarian, Latin, Russian, and Swedish
    • 🚚 Urdu removed of language list due to issues with the data
    • βž• add support for Python 3.10 and drop support for Python 3.4
    • πŸ‘Œ improved decomposition and tokenization algorithms
  • v0.3.0 Changes

    • πŸ‘Œ improved models and disambiguation
    • πŸ‘Œ improved tokenization
    • extended rules for German
  • v0.2.2 Changes

    • Work on decomposition rules
    • Reviewed language data
    • Cleaner code
  • v0.2.1 Changes

    • πŸ‘ Better decomposition into subwords by greedy algorithm
    • First benchmarks and data-based corrections: German, French, English, Spanish
  • v0.2.0 Changes

    • Languages added: Danish, Dutch, Finnish, Georgian, Indonesian, Latin, Latvian, Lithuanian, Luxembourgish, Turkish, Urdu
    • πŸ‘Œ Improved word pair coverage
    • Tokenization functions added
    • Limit greediness and range of potential candidates
  • v0.1.0 Changes

    • πŸš€ First release on PyPI
Awesome Python is part of the LibHunt network. Terms. Privacy Policy.

(CC)
BY-SA
We recommend Spin The Wheel Of Names for a cryptographically secure random name picker.

AltStyle γ«γ‚ˆγ£γ¦ε€‰ζ›γ•γ‚ŒγŸγƒšγƒΌγ‚Έ (->γ‚ͺγƒͺγ‚ΈγƒŠγƒ«) /