Smith Utils is a central hub for data cleaning and parsing scripts. This package consolidates distributed utility functions to improve code reuse and maintenance efficiency across all yeiichi projects.
Robust date parsing and formatting.
ensure_date: Flexible conversion of strings,datetime.dateobjects, orNone(returns today) into adateobject.parse_strict_date: Strict parsing forYYYYMMDDorYYYY-MM-DDformats, rejecting ambiguous inputs.format_ordinal: Converts integers to ordinal strings (e.g.,1→"1st",22→"22nd").
Clean and parse messy numeric data.
parse_numeric_value: Handles custom separators, decimals, and negative formats like(1,234.56).parse_currency_value: Alias for numeric parsing, specifically for currency strings.
Standardize text and compare string similarity.
normalize_text: Unicode NFKC normalization, case folding, and whitespace handling.StringDistance: Implementation of Damerau-Levenshtein and Jaro-Winkler algorithms for fuzzy matching.
Install via pip:
pip install smith-utilsfrom smith_utils import ensure_date, parse_numeric_value, normalize_text
# Datetime
date = ensure_date("20231225") # datetime.date(2023, 12, 25)
# Numeric
value = parse_numeric_value("(1,250.50)") # -1250.5
# Text
clean_text = normalize_text(" Smith Utils ") # "smith utils"src/smith_utils/: Main package source.legacy/: Legacy scripts and templates (not included in distribution).tests/: Comprehensive test suite.
This project is licensed under the MIT License - see the LICENSE file for details.