Welcome to doc2dict¶ doc2dict is a package to quickly parse documents in pdf, html, xml, and txt formats. It supports the datamule project. Package is in early development