EduX Documentation home page
Search...
⌘K
danswer-ai/danswer
danswer-ai/danswer
Search...
Navigation
Connectors
Web Connector
Documentation
Slack
Discord
Welcome to EduX
Introduction
Quickstart
Resourcing
Slack Bot Setup
Gen AI Configs
Configuring Danswer
System Overview
Contact Us
Security
Deploy with AWS
Deploy on GCP
Deploy on Azure
Deploy on Digital Ocean
Auth
Basic Auth Setup
Google OAuth Setup
OIDC/SAML Setup
Connectors
Connector Overview
Web Connector
File Connector
Slack Connector
GitHub Connector
GitLab Connector
Confluence Connector
Jira Connector
Google Drive Connector
Gmail Connector
Notion Connector
Zendesk Connector
Microsoft Sharepoint Connector
Salesforce Connector
Teams Connector
Gong Connector
Linear Connector
BookStack Connector
Document360 Connector
Request Tracker Connector
Slab Connector
Guru Connector
Productboard Connector
HubSpot Connector
Zulip Connector
Google Sites Connector
Dropbox Connector (Beta)
Discourse Connector
ClickUp Connector
Backend APIs
Ingestion API
Cloud APIs
POST
Answer with Quote
POST
Answer with Citations
More
Telemetry
Additional Options
On this page
How it works
Setting up
Authorization
Indexing
Connectors
Web Connector
Access knowledge from Web Pages
How it works
The Web Connector scrapes sites based on a base URL.
It only indexes files from the same domain and containing the same base path.
It will index pages reachable via hyperlinks from the base URL.
The text contents are cleaned up via some heuristics and some metadata such as the page Title is extracted.
Setting up
Authorization
As long as the page is reachable, no additional authorization is necessary.
Indexing
Navigate to the Admin Dashboard and select the
Web
Connector.
Input the base URL to index and click on Index.
To see the status of the indexing, visit the Connectors Status page (top left).
Connector Overview
File Connector
Assistant
Responses are generated using AI and may contain mistakes.