Changelog
v01.0.3 - modified item mapping dict regex to be more robust. - modified Monitor().monitor_submissions() to have a start_date parameter.
v01.0.2 2025-02-06 - modified .parse() to return dictionary in previous package format.
v01.0.0 2025-02-04 - reworked parsing into doc2dict. - reworked sgml parsing into secsgml - reworked datasetbuilder into txt2dataset - fixed some downloader issues - made all xmls parseable - generalized documents iterablity - may break some work flows. Sorry - modified Downloader() to be more specific. downloading 10-K would also download 10-K/A.
v.433 (2025-01-21) - added deduplication to Premium Downloader
v.428 (2025-01-03) - added option to set download source via Config class - added option to search by document type and by containing text to Portfolio class - added download option to Portfolio class. Now you don’t need to import Downloader class to download submissions
v0.418 (2024-12-17) - improved imports
v0.417 (2024-12-17) - Fixed bug preventing installation on linux/macOS.
v0.400 (2024-12-16)
Major update. Reworked/removed some functions to improve performance and usability.
If functionality removed was critical to your workflow, please post an issue on GitHub. Apologies.
Reworked downloader to use .sgml submission files. This grabs the full submission, including attachments and metadata.
Added premium downloader for faster downloads. This is a paid feature.
Changed to Submission/Document terminology to be more inline with SEC taxonomy.
specifying document type is no longer needed for parsing, as it is now inferred from the submission.
reworked downloader.watch to monitor.monitor, and made callback functions better.
reworked updating package data to packageupdater class
added fast sgml parsing using cython
this is an early release, so please report any issues on GitHub.
v0.382 (2024-11-19)
Fixed issue where SEC recorded primary doc URL as 0001.extension, while the true location was acc no - 0001. Example: https://www.sec.gov/Archives/edgar/data/1102262/000110226201000003/0001102262-01-000003-0001.txt.
v0.379 (2024-11-18)
Added metadata.
Moved return URLs to metadata.
v0.377 (2024-11-01)
Fixed SC 13G Item 10 being detected as Item 1.
v0.376 (2024-11-01)
Added parsing and iterable objects for forms: 3, 13F-HR, NPORT-P, SC 13D, SC 13G, 10-Q, 10-K, 8-K, D.
v0.374 (2024-10-29)
Added filing.write_json and filing.write_csv.
v0.373 (2024-10-29)
Made parsed filing structure for 8-K, 10-K, 10-Q more intuitive.
v0.372 (2024-10-29)
Reduced package import time to ~400 milliseconds. Further optimization planned for dependency cleanup.
v0.368 (2024-10-28)
Improved parsing robustness for 10-K, 10-Q, and 8-K by centralizing helper scripts for loading file content and cleaning titles.
v0.364 (2024-10-28)
Added parsing support for 10-K and 10-Q.
v0.363 (2024-10-26)
Added dataset_builder.
Enhanced 8-K parsing robustness.
v0.357 (2024-10-24)
Improved access to package data.
v0.356 (2024-10-23)
Added parsing support for 13F-HR information table and 8-K.
v0.355 (2024-10-23)
Added setuptools to package for handling edge cases.
v0.352 (2024-10-21)
Switched 10-K dataset download source to Dropbox from Zenodo.
v0.351 (2024-10-18)
Added download options for attachments by file type and item type.
Introduced up-to-date dataset for 13F-HR information tables.
v0.350 (2024-10-17)
Added bulk download functionality for 10-K.
v0.343 (2024-10-16)
Added bulk download functionality for 10-Q.
v0.342 (2024-10-16)
Introduced callback function option for downloader.watch().
v0.341 (2024-10-15)
Added company metadata datasets, including SIC codes, former names, and more.
v0.337 (2024-10-14)
Added filtering options by SICs and items to downloader.
Included FTD dataset in download_dataset.
v0.335 (2024-10-13)
Added prefill option for MuleBot server.
v0.334 (2024-10-13)
Added links to GitHub and website for chatbot.
v0.333 (2024-10-13)
Simplified MuleBot server UI.
Refactored MuleBot server into multiple modules.
v0.332 (2024-10-05)
Modified table parser to output parsed tables in list format.
v0.330 (2024-10-03)
Improved downloader robustness.
Introduced set_limiter for precise control.
Added dataset of all 10-Ks since 2001.
v0.323 (2024-09-27)
Added MuleBot.
Reworked Filing Viewer.
v0.314 (2024-09-26)
Added TableParser.
v0.312 (2024-09-20)
Introduced download_company_concepts.
v0.311 (2024-09-19)
Added basic MuleBot tool calling and interface.
v0.302 (2024-09-18)
Re-added output directory option to download functionality.
v0.301 (2024-09-18)
Fixed Jupyter Notebook package data issue.
v0.29 (2024-09-18)
Major overhaul: - Removed need to download or construct indices. - Expanded scope to all SEC filings since 2001, including companies without tickers and individuals. - Moved Indexer().watch() to downloader. - Temporarily removed filtering by company name due to exact matching issues.
v0.26 (2024-09-16)
Introduced indexer.watch(interval, cik, form) for EDGAR monitoring.
v0.25 (2024-09-16)
Added human_readable option to download and download_using_api.
Earlier Updates
2024-09-15: Fixed issue where filings download would overwrite each other.
2024-09-14: Added parser API support.
2024-09-13: Introduced download_datasets and index download options.
2024-09-09: Added download_using_api (no indices required).
2024-09-08: Integrated with datamule’s SEC Router API.
2024-09-07: Simplified indices approach, switched to Polar for faster index loading.