ansys.tools.meilisearch.scraper#
Module for scaping web pages.
Classes#
Provides for scraping web pages and checking if responses are successful. |
Functions#
|
Get the name of the temporary file, which has a |
Module Contents#
- ansys.tools.meilisearch.scraper.get_temp_file_name(ext='.txt')#
Get the name of the temporary file, which has a
.txt
extension.
- class ansys.tools.meilisearch.scraper.WebScraper(meilisearch_host_url=None, meilisearch_api_key=None)#
Bases:
ansys.tools.meilisearch.client.BaseClient
Provides for scraping web pages and checking if responses are successful.
- Parameters:
- scrape_url(url, index_uid, template=None, stop_urls=None, verbose=False)#
Scrape a URL for a web page using the active Meilisearch host.
This method generates a single unique name for a single URL.
- scrape_from_directory(path, template=None, verbose=False)#
Scrape the URLs for all web pages in a directory using the active Meilisearch host.
This method generates a unique index identifier for each URL in the directory.
- Parameters:
- Returns:
dict
Dictionary where keys are unique IDs of indexes and values are the number of hits for each URL.
- Raises:
FileNotFoundError
If the specified path does not exist.