Is there a way to improve the quality of search results by targeting a particular subject area?
For example, let us say you wanted to improve the results for the 2020 Tokyo Olympics, could you systematically start crawling particular web pages or something like that?
Are there any useful strategies to accomplish this that anybody can suggest?
Only idea is, after sitemap retrieval form other peers, to filter the urls where it reads what you need. Sitemap retrieval is my prop noted here [link].
Other way “would” be to filter the urls using the crawler’s settings. But - filtering out too many documents means ending up with nothing quite quickly.
Just start crawling from pages related to the desired topic?