agatha.construct.document_parsers.parse_pubmed_xml module

agatha.construct.document_parsers.parse_pubmed_xml.parse_zipped_pubmed_xml(xml_path)

Copies the given xml file to local scratch, and then gets the set of articles, represented by a list of dicts.

Return type

List[Dict[str, Any]]

agatha.construct.document_parsers.parse_pubmed_xml.pubmed_xml_to_record(pubmed_elem)

Given a PubmedArticle element, parse out all the fields we care about. Fields are represented as a dictionary.

Return type

Dict[str, Any]

agatha.construct.document_parsers.parse_pubmed_xml.xml_obj_to_date(elem)
Return type

str