A Streamlit-based web scraping application designed to simplify the process of extracting data from web pages. it streamlines data collection and visualization, making it ideal for showcasing extracted web data in a modern and professional manner.
- Easy-to-use web interface.
- Custom field specification for data extraction.
- Pagination
- Dynamic data processing with Python and Streamlit.
- Direct download capabilities for extracted data in various formats.
- Attended mode
Before you begin, ensure you have the following installed:
- Python 3.6 or higher
- Pip for managing Python packages
Follow these steps to get your development environment running:
# Clone the repository
git clone https://github.com/reda-marzouk608/scrape-master
cd scrape-master
# It's recommended to create a virtual environment
python -m venv venv
# Activate the virtual environment
# On Windows
venv\Scripts\activate
# On MacOS/Linux
source venv/bin/activate
# Install the required packages
pip install -r requirements.txtTo run ScrapeMaster, navigate to the project directory and run the following command:
streamlit run streamlit_app.pyAfter launching the application, open your web browser to the indicated address (typically http://localhost:8501). Use the sidebar to input the URL and fields you wish to scrape, then click the "Scrape" button to see results.
