llmtuner.infogetter.arxiv

Get list of papers from arxiv and parse to local db

Classes

ArxivTalker

Class to retrieve entry information from ArXiv.

Module Contents

class llmtuner.infogetter.arxiv.ArxivTalker(config='')[source]

Bases: llmtuner.infogetter.infobase.InfoBase

Class to retrieve entry information from ArXiv.

static _get_entry(content)[source]
list_entries(query, nentries=100, write_to_index=False)[source]

Get a list of arxiv entries according to a query term, e.g. ‘ANTARES’. Can limit number of entries. If you provide a LLMTunerConfig object to the talker, the results will be written to the database.

write_to_index(entry)[source]

Function to match entries from arxiv to database list of entries

download(docid)[source]

Function for download of a single entry to embed in the vector database