Stock-Market-Sentiment-Analysis-using-hugging-face

This project helps to visualize relation between News articles and stock market.

Stock Market Sentiment Analysis

This repository contains code for analyzing market sentiment using stock prices and news articles. It includes three main Python files:

Ticker Collection Script (ticker_collection.py)

This script is designed to collect ticker symbols and corresponding company names for companies listed in the S&P 500 index.

Usage

Dependencies

yfinance: Used to fetch financial data, including stock prices, from Yahoo Finance.
pandas: Utilized for data manipulation and analysis.

Functionality

get_company_name Function:
- Fetches the company name for a given ticker symbol using the Yahoo Finance API.
- Returns the full name of the company if successful; otherwise, prints an error message and returns None.
Fetching S&P 500 Tickers:
- Fetches the list of S&P 500 tickers from Wikipedia.
- Extracts ticker symbols from the HTML table and converts them to a list, limiting it to the first 500 tickers.
Creating an Empty DataFrame:
- Initializes an empty DataFrame to store the results, with columns for ticker symbols and company names.
Collecting Tickers and Company Names:
- Iterates through each ticker symbol in the S&P 500 tickers list.
- Calls the get_company_name function to fetch the corresponding company name.
- Appends the ticker and company name to the DataFrame if the company name is retrieved successfully.
- Prints an error message if the company name cannot be retrieved for a ticker.
Saving Data to CSV:
- Saves the DataFrame containing ticker symbols and company names to a CSV file named companies.csv, excluding the index.

Files

ticker_collection.py: Python script containing the ticker collection functionality.
companies.csv: CSV file containing the collected ticker symbols and company names.

Companies CSV File

The companies.csv file contains the collected ticker symbols and corresponding company names retrieved using the ticker collection script.

Description

This CSV file serves as a dataset that stores information about companies listed in the S&P 500 index. It includes two columns:

Ticker: Represents the unique ticker symbol associated with each company.
Company Name: Represents the full name of the company corresponding to the ticker symbol.

Usage

Data Retrieval: The main.py script accesses the companies.csv file to retrieve the ticker symbols and company names of companies listed in the S&P 500 index.
User Selection: In the Streamlit web application generated by main.py, users can select one or more companies from a dropdown menu populated with the company names listed in the companies.csv file.
Data Processing: After the user selects companies, the main.py script retrieves historical stock data and sentiment analysis scores for the selected companies.
Integration: The ticker symbols obtained from the companies.csv file are used to fetch historical stock data from the Yahoo Finance API. Additionally, the company names are used for display purposes in the Streamlit application.
Error Handling: In case a company's ticker symbol or name is not found, appropriate error messages are displayed to the user within the Streamlit application, ensuring a smooth user experience.

Example

Suppose a user selects the company "Apple Inc." from the dropdown menu in the Streamlit application. The main.py script retrieves the corresponding ticker symbol "AAPL" from the companies.csv file. Subsequently, it uses this ticker symbol to fetch historical stock data and sentiment analysis scores for Apple Inc.

File Format

File Name: companies.csv
Format: Comma-separated values (CSV)
Encoding: UTF-8

Main Script: Market Sentiment Analysis

The main.py script is the primary component of the Market Sentiment Analysis project. It is responsible for generating a Streamlit web application that allows users to select companies from the S&P 500 index, retrieve their historical stock data, and analyze their sentiment scores based on news articles.

File Structure

File Name: main.py
Dependencies:
- yfinance: For retrieving historical stock data.
- pandas: For data manipulation and processing.
- plotly.graph_objects: For generating interactive charts.
- transformers: For sentiment analysis using pre-trained models.
- tqdm: For displaying progress bars during sentiment analysis.
- requests: For fetching news articles from the News API.
- time: For adding delays during sentiment analysis.
- plotly.subplots: For creating subplot grids in the charts.
- streamlit: For building the web application user interface.

Usage

Initialization: Upon execution, the main.py script initializes the sentiment analysis model and tokenizer using the Twitter-RoBERTa model.
Company Selection: Users are presented with a dropdown menu populated with company names fetched from the companies.csv file. They can select one or more companies they are interested in.
Data Retrieval: After selecting companies, the script retrieves historical stock data for the last month and news articles related to the selected companies.
Sentiment Analysis: The script performs sentiment analysis on the retrieved news articles using the initialized sentiment analysis model. It calculates sentiment scores for each article and aggregates them to obtain an average sentiment score for each company.
Data Visualization: Using Plotly, the script generates interactive charts that display the historical stock prices and average sentiment scores for the selected companies over the past month.
User Interaction: Users can interact with the generated charts to analyze the relationship between stock prices and sentiment scores. Additionally, error messages are displayed for any issues encountered during data retrieval or analysis.
Integration: The main.py script integrates various libraries and APIs, including yfinance for stock data retrieval, Transformers for sentiment analysis, Plotly for data visualization, and Streamlit for web application development.

Example

Suppose a user selects the companies "Apple Inc." and "Microsoft Corporation" from the dropdown menu in the Streamlit application. The main.py script retrieves their historical stock data and sentiment analysis scores, generates interactive charts displaying the data, and presents them to the user for analysis.

Getting Started

Clone the Repository:

git clone repository_url>](https://github.com/Ojas-Rohatgi/Stock-Market-Sentiment-Analysis-using-hugging-face.git

Install Dependencies:

cd <repository_directory>
python -m venv venv
source venv/bin/activate  # (Linux/Mac)
pip install -r requirements.txt

For Windows, use venv\Scripts\activate.

Collect Ticker Symbols and Company Names:
```
python ticker_collection.py
```
Run the Main Application:
- Execute the main.py script using Streamlit.
- This will start the Streamlit application, allowing you to select companies and view stock price and sentiment analysis.
```
streamlit run main.py
```

Files

ticker_collection.py:
- Script for collecting ticker symbols and company names.
companies.csv:
- CSV file containing company names and ticker symbols.
main.py:
- Main application script for visualizing stock price and sentiment analysis.
requirements.txt
- yfinance
- pandas
- plotly
- transformers
- tqdm
- requests
- streamlit

Contributing

Contributions are welcome! Please feel free to submit issues or pull requests.

License

MIT License

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stock-Market-Sentiment-Analysis-using-hugging-face

Stock Market Sentiment Analysis

Ticker Collection Script (ticker_collection.py)

Usage

Dependencies

Functionality

Files

Companies CSV File

Description

Usage

Example

File Format

Main Script: Market Sentiment Analysis

File Structure

Usage

Example

Getting Started

Files

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.devcontainer		.devcontainer
LICENSE		LICENSE
README.md		README.md
companies.csv		companies.csv
main.py		main.py
requirements.txt		requirements.txt
ticker_collection.py		ticker_collection.py

Folders and files

Latest commit

History

Repository files navigation

Stock-Market-Sentiment-Analysis-using-hugging-face

Stock Market Sentiment Analysis

Ticker Collection Script (ticker_collection.py)

Usage

Dependencies

Functionality

Files

Companies CSV File

Description

Usage

Example

File Format

Main Script: Market Sentiment Analysis

File Structure

Usage

Example

Getting Started

Files

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages