Meow Team:

Angelica Noriega Stoudenikina
Zaid Minhas
Héna Ricucci
Beaudelaire Tsoungui Nzodoumkouo

AIMS Hackathon - Visual Extracton and Analysis Program

Chosen challenge: #1 Data Mining, Processing & Enrichment

1. Problem Statement

There is currently a loss of information due to ignoring visual elements in reports and missing support for french reports from Canada.

2. Objective

Efficiently extract, classify and analyse the visual elements of the modern slavery reports provided by companies.

3. Solution/Data use case description

Python script that scrapes Public Safety Canada's Supply Chains Act library to get the pdf URLs in both french and english
Python script that extracts the visual elements from these statements
AI classification model that categorizes the images into one of the following categories: Signature, Logo, Scanned Page, Diagram, or Other
Model that extracts information from diagrams and text from scanned pages

4. Pitch

Watch our solution pitch here!

5. Datasets

Location: Online

• Public Safety Canada's Supply Chain Act Library

6. Project Code

Location: /project

It includes the following:

• Data transformations, merging & quality assurance

• Model related code (projection, prediction, correlation etc.)

• User Interface code

7. How to run the code

🚀 Quick Start (Recommended)

Prerequisites

Python 3.9+ installed on your system
Node.js 16+ and npm installed
Git (to clone the repository)
tesseract
- macOS → brew install tesseract
- Ubuntu/Debian: sudo apt-get install tesseract-ocr
- Windows: Download from https://github.com/UB-Mannheim/tesseract/wiki

One-Command Setup

# Clone the repository
git clone <repository-url>
cd AIMS-repo

# Run the automated setup script
chmod +x setup.sh
./setup.sh

The setup script will automatically:

✅ Install all Python dependencies (backend + ML/AI libraries)
✅ Install all Node.js dependencies (React frontend)
✅ Verify all prerequisites are met
✅ Provide clear next steps

Starting the Application

After setup completes, start both services:

Terminal 1 - Backend:

cd backend
python3 app.py

Terminal 2 - Frontend:

cd frontend
npm run dev

Then open your browser to: http://localhost:5173

🛠️ Manual Setup (Alternative)

If you prefer manual setup or encounter issues with the automated script:

Backend Setup

cd backend
pip3 install -r requirements.txt

# Install ML/AI dependencies
cd ../project_code
pip3 install -r requirements.txt

Frontend Setup

cd frontend
npm install

Running Manually

# Terminal 1: Backend
cd backend && python3 app.py

# Terminal 2: Frontend  
cd frontend && npm run dev

🎯 Using the Application

1. Automatic Extraction

Click "Start Extraction" on the main dashboard
The system will automatically:
- Crawl Canadian Supply Chain Act statements (if needed)
- Extract visual elements from PDF documents
- Classify images using AI models
- Display results in an interactive table

2. View Results

Overview Tab: Statistics and summary of extracted data
Data Table Tab: Detailed view of all extracted images with sorting
Image Preview: Click any row to see the actual extracted image

3. Additional Tools

Image Classifier: Upload and classify individual images
PDF Extractor: Extract visuals from your own PDF files

📁 Project Structure

AIMS-repo/
├── backend/           # Flask API server
├── frontend/          # React web interface  
├── project_code/      # Core extraction & AI modules
│   ├── classification/    # Image classification models
│   └── data_extraction/   # PDF processing & web scraping
├── setup.sh          # Automated setup script
└── README.md         # This file

🔧 Troubleshooting

Common Issues

"command not found: python" → Use python3 instead
Module import errors → Ensure you've installed project_code dependencies
Port conflicts → Backend runs on :5001, Frontend on :5173
Image loading issues → Check that extraction has completed successfully

Getting Help

Check the browser console for frontend errors
Check terminal output for backend errors
Ensure all dependencies are installed correctly
Verify Python and Node.js versions meet requirements

8. Additional docs (Optional)

Location: /docs

• PowerPoint presentation

• Flayers

• Additional videos/demo

• Protocols

• Guides

9. Declaration of Intellectual Property

This project builds on the open research of Project AIMS (AI against Modern Slavery) by Mila and QUT.
GitHub repository: ai4h_aims-au.

Disclaimers

Computational Resources & Comparative Results

Describe here the resources used in developing your solution (e.g. GPUs, etc).

No Claims About Companies

This repository and its accompanying models, datasets, metrics, dashboards, and comparative analyses are provided strictly for research and demonstration purposes.

Any comparisons, rankings, or assessments of companies or organizations are exploratory in nature. They may be affected by incomplete data, modeling limitations, or methodological choices. These results must not be used to make factual, legal, or reputational claims about any entity without independent expert review and validation.

Do not use this repository’s contents to make public statements or claims about specific companies, organizations, or individuals.

Terms and Conditions

By submitting this solution to the AIMS Hackathon, our team acknowledges and agrees to abide by the Event’s Terms and Conditions.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
backend		backend
frontend		frontend
project_code		project_code
Datasets		Datasets
Docs		Docs
README.md		README.md
setup.sh		setup.sh

Folders and files

Latest commit

History

Repository files navigation

Meow Team:

AIMS Hackathon - Visual Extracton and Analysis Program

1. Problem Statement

2. Objective

3. Solution/Data use case description

4. Pitch

5. Datasets

6. Project Code

7. How to run the code

🚀 Quick Start (Recommended)

Prerequisites

One-Command Setup

Starting the Application

🛠️ Manual Setup (Alternative)

Backend Setup

Frontend Setup

Running Manually

🎯 Using the Application

1. Automatic Extraction

2. View Results

3. Additional Tools

📁 Project Structure

🔧 Troubleshooting

Common Issues

Getting Help

8. Additional docs (Optional)

9. Declaration of Intellectual Property

Disclaimers

Computational Resources & Comparative Results

No Claims About Companies

Terms and Conditions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages