llmtuner.infogetter.arxiv ========================= .. py:module:: llmtuner.infogetter.arxiv .. autoapi-nested-parse:: Get list of papers from arxiv and parse to local db .. !! processed by numpydoc !! Classes ------- .. autoapisummary:: llmtuner.infogetter.arxiv.ArxivTalker Module Contents --------------- .. py:class:: ArxivTalker(config='') Bases: :py:obj:`llmtuner.infogetter.infobase.InfoBase` Class to retrieve entry information from ArXiv. .. !! processed by numpydoc !! .. py:method:: _get_entry(content) :staticmethod: .. py:method:: list_entries(query, nentries=100, write_to_index=False) Get a list of arxiv entries according to a query term, e.g. 'ANTARES'. Can limit number of entries. If you provide a LLMTunerConfig object to the talker, the results will be written to the database. .. !! processed by numpydoc !! .. py:method:: write_to_index(entry) Function to match entries from arxiv to database list of entries .. !! processed by numpydoc !! .. py:method:: download(docid) Function for download of a single entry to embed in the vector database .. !! processed by numpydoc !!