This guide will help you set up SmartCrawler on macOS systems.
- macOS 10.15 (Catalina) or later
- Administrator access for installation
- Internet connection for downloads
- Go to the SmartCrawler releases page
- Download the appropriate binary for your Mac:
smart-crawler-macos-x64.tar.gzfor Intel Macssmart-crawler-macos-arm64.tar.gzfor Apple Silicon Macs (M1/M2/M3)
- Extract and install:
# Extract the downloaded file tar -xzf smart-crawler-macos-*.tar.gz # Move to a directory in your PATH sudo mv smart-crawler /usr/local/bin/ # Make it executable chmod +x /usr/local/bin/smart-crawler
- Download
smart-crawler-[version].dmgfrom the releases page - Open the DMG file
- Copy
smart-crawlerto/usr/local/bin/or your preferred location - Make it executable:
chmod +x /usr/local/bin/smart-crawler
If you have Rust installed:
git clone https://github.com/pixlie/SmartCrawler.git
cd SmartCrawler
cargo build --release
# Binary will be in target/release/smart-crawlerSmartCrawler requires a WebDriver server to control a browser. Choose one:
-
Install Firefox (if not already installed):
# Using Homebrew (recommended) brew install firefox # Or download from firefox.com
-
Install GeckoDriver:
# Using Homebrew (recommended) brew install geckodriver # Or download manually from GitHub releases wget https://github.com/mozilla/geckodriver/releases/latest/download/geckodriver-v0.33.0-macos.tar.gz tar -xzf geckodriver-v0.33.0-macos.tar.gz sudo mv geckodriver /usr/local/bin/
-
Install Chrome (if not already installed):
# Using Homebrew brew install google-chrome # Or download from chrome.com
-
Install ChromeDriver:
# Using Homebrew brew install chromedriver # Or download manually - check your Chrome version first google-chrome --version # Then download matching version from Chrome for Testing
macOS may block unsigned executables. If you get a security warning:
-
Allow the binary to run:
# Remove quarantine attribute xattr -d com.apple.quarantine /usr/local/bin/smart-crawler # Or allow in System Preferences # System Preferences > Security & Privacy > General > Allow anyway
-
For WebDriver binaries:
# If you downloaded manually xattr -d com.apple.quarantine /usr/local/bin/geckodriver xattr -d com.apple.quarantine /usr/local/bin/chromedriver
- Open Terminal
- Start WebDriver (choose one):
# For Firefox (GeckoDriver) geckodriver --port 4444 # For Chrome (ChromeDriver) chromedriver --port=4444
- Open a new Terminal window
- Test SmartCrawler:
smart-crawler --link "https://example.com"
# Basic crawl
smart-crawler --link "https://example.com"
# Crawl with verbose output
smart-crawler --link "https://example.com" --verbose
# Crawl with template detection
smart-crawler --link "https://example.com" --template --verbose
# Crawl multiple sites
smart-crawler --link "https://example.com" --link "https://another.com"- Ensure WebDriver is running on port 4444
- Check that the browser is installed
- Try restarting the WebDriver
- Verify no other application is using port 4444
- Make sure the binary is executable:
chmod +x /usr/local/bin/smart-crawler
- Check that
/usr/local/binis in your PATH:echo $PATH
- If you didn't install to
/usr/local/bin, add the location to your PATH:export PATH=$PATH:/path/to/smart-crawler
- Or run with the full path:
/path/to/smart-crawler --link "https://example.com"
- Remove quarantine attributes:
xattr -d com.apple.quarantine /usr/local/bin/smart-crawler
- Or go to System Preferences > Security & Privacy > General and click "Allow anyway"
- Kill any existing WebDriver processes:
pkill geckodriver pkill chromedriver
If you use Homebrew, you can install everything in one go:
# Install browsers and WebDriver
brew install firefox geckodriver
# Or for Chrome
brew install google-chrome chromedriver
# Download SmartCrawler binary and install
# (Follow Option A above for binary installation)- Read the CLI Options documentation for advanced usage
- Learn more about template detection for content pattern analysis
- Explore verbose mode for detailed HTML tree analysis
If you encounter issues:
- Check the troubleshooting section above
- Visit the GitHub Issues page
- Search for existing solutions or create a new issue
- Include your macOS version, browser version, and error messages
- Homebrew - Package manager for macOS
- Firefox Download
- Chrome Download
- GeckoDriver Releases
- ChromeDriver Downloads
- Rust Installation (if building from source)