Configure Danswer to use HuggingFace APIs
Pro Account
to get an API Key
Pro Account
, go to your user settings:User Access Token
Llama-2-70B
via next-token generation promptingGEN_AI_MODEL_VERSION
(not GEN_AI_API_ENDPOINT
),
this is a quirk from the LiteLLM library.
Update (November 2023): HuggingFace has stopped supporting very large models (>10GB) via the Pro Plan.
The latest options are to rent dedicated hardware from them via Inference Endpoint or get an Enterprise Plan.
The Pro Plan still supports smaller models but these produce worse results for Danswer.