C++ Web Crawler

Start: Begins crawling from a given URL
Download: Fetches the HTML content of the page
Parse: Extracts all links from &lt;a href="..."&gt; tags
Normalize: Converts relative URLs (e.g., /about ) to absolute URLs
Filter: Only follows links from the same domain
Recurse: Repeats the process for each found link up to max depth
Track: Keeps a set of visited URLs to avoid crawling the same page twice

A simple web crawler written in C++ using WinINet for HTTP requests.

How It Works

The max depth parameter controls how deep the crawler goes:

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
README.md		README.md