Expand description
Code for cleaning up ISBNs.
This module contains two families of functions:
- The simple character-cleaning functions clean_isbn_chars and clean_asin_chars.
- The full multi-ISBN parser parse_isbn_string.
When a string is a relatively well-formed ISBN (or ASIN), the character-cleaning functions are fine. Some sources, however (such as the Library of Congress) have messy ISBNs that may have multiple ISBNs in one string, descriptive tags, and all manner of other messes. The multi-ISBN parser exposed through parse_isbn_string supports cleaning these ISBN strings using a PEG-based parser.
Modules§
Structs§
Enums§
- Parse
Result - Result of parsing an ISBN string.
Statics§
- IGNORES 🔒
- Regular expressions for unparsable ISBN strings to ignore. This cleans up warning displays.
Functions§
- clean_
asin_ chars - Crude ASIN cleanup.
- clean_
isbn_ chars - Crude ISBN cleanup.
- parse_
isbn_ string - Parse an ISBN string.