URLs - GIDR.ai

The URLs data source allows you to ingest content from web pages or knowledge bases. Unlike traditional crawlers, the system operates via real-time search:

Scoped Search & Scrape: It searches for relevant pages scoped to the provided URL (and its sub-paths). This creates a controlled boundary, ensuring content is only ingested from the specified enterprise source.
Caching: Scraped content is saved to a local cache with a configurable expiry.
Retrieval: When a query is made, the system first checks the local cache. If the content is missing or expired, it searches the internet (within the defined scope) and scrapes fresh content in parallel.

Connect Data source

Select Data Sources > + Add.
Choose URL’s as the source type.
Enter the starting URL (e.g., https://dashboard.intelligrated.com/knowledgebase).
(Optional) Set an Expire cache in value if you want the system to periodically refresh the content.