main-2025-01-29-10-35-28: fix repetitive pattern extraction (#109)
·
4 commits
to main
since this release
* fix repetitive pattern extraction #108 * add --ignore_extraction_boundary #109 * Update README.md * adding tests * fix subdomain-hostname must not have tld, fix issue causing any item with same boundary as the file not getting extracted by adding \n at start and end of file before splitting * docs updates --------- Co-authored-by: David G <[email protected]>