Web URL Crawling

Estimated reading: 3 minutes 72 views

Summary: The Web URL Crawling feature in Antimanual allows your AI to learn from external public sources automatically. It filters out digital noise, ensuring your assistant stays updated, provides accurate answers, and improves customer support.

The Web URL Crawling feature in Antimanual helps your AI learn from external sources. It reads public web pages—such as guides, news, and helpful resources—so your AI can provide more accurate answers.

Feature Overview

This Pro version feature connects your AI to the internet. Instead of copying text by hand, the system automatically pulls and processes web data for you. This ensures your AI assistant stays up to date.

This is a great tool if you store guides on multiple platforms or want your AI to learn from industry experts. By adding URLs, your AI can:

Check facts quickly.
Provide better customer support.
Show a deeper understanding of your specific industry.

Adding a Web URL

Adding a new link is simple:

Go to the Knowledge Base: Open your Antimanual dashboard and click the “Website URL” tab.
Enter the URL: Paste the public web link you want to add. (Note: The page must not require a login.)
Submit: Click the “Submit” button. The system will grab and process the page for you.

How We Extract Data

Web pages often contain “noise” like menus, sidebars, and ads. Antimanual uses smart filters to ignore this clutter and save only the main content.

Smart Detection: The system finds the main article or content area.
Noise Removal: It automatically ignores headers, cookie banners, and social buttons.
Content Cleaning: It removes extra code to provide clear text that the AI can easily read.

This cleaning process ensures your AI focuses only on the helpful information, not on menu links like “About Us.”

Managing Your Links

You can view all your saved URLs in a table to track what information your AI uses. For each link, you will see:

URL Source: The original link.
AI Model: The engine used to process the data.
Actions: A simple delete button to remove old URLs from your Knowledge Base.

Frequently Asked Questions

Can I crawl pages that require a login?

No. The crawler only works with public web pages. It cannot access pages behind login screens or paywalls.

How often is the content updated?

The AI saves the content exactly as it appears when you submit it. If the original page changes, you must delete the old entry and add the URL again to refresh the data.

Is there a limit to how many URLs I can add?

Yes, limits depend on your Pro license plan and your AI provider’s storage guidelines.

Web URL Crawling

Feature Overview

Adding a Web URL

How We Extract Data

Managing Your Links

Frequently Asked Questions

Can I crawl pages that require a login?

How often is the content updated?

Is there a limit to how many URLs I can add?

Leave a Comment Cancel reply

Backup

Product Notification Settings

Version Tracking Settings

User Authentication Settings

Roadmap Settings

Feedback Settings

Releases Settings

General Settings

Global Plugin Settings

Settings Reference

Embed Types

Embed Settings

Subscribe