Skip to content

fpasoslibres/AIMS-Meow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Meow Team:

  • Angelica Noriega Stoudenikina
  • Zaid Minhas
  • Héna Ricucci
  • Beaudelaire Tsoungui Nzodoumkouo

AIMS Hackathon - Visual Extracton and Analysis Program

Chosen challenge: #1 Data Mining, Processing & Enrichment

1. Problem Statement

There is currently a loss of information due to ignoring visual elements in reports and missing support for french reports from Canada.

2. Objective

Efficiently extract, classify and analyse the visual elements of the modern slavery reports provided by companies.

3. Solution/Data use case description

  • Python script that scrapes Public Safety Canada's Supply Chains Act library to get the pdf URLs in both french and english
  • Python script that extracts the visual elements from these statements
  • AI classification model that categorizes the images into one of the following categories: Signature, Logo, Scanned Page, Diagram, or Other
  • Model that extracts information from diagrams and text from scanned pages

4. Pitch

Watch our solution pitch here!

5. Datasets

Location: Online

Public Safety Canada's Supply Chain Act Library

6. Project Code

Location: /project

It includes the following:

• Data transformations, merging & quality assurance

• Model related code (projection, prediction, correlation etc.)

User Interface code

7. How to run the code

🚀 Quick Start (Recommended)

Prerequisites

  • Python 3.9+ installed on your system
  • Node.js 16+ and npm installed
  • Git (to clone the repository)
  • tesseract

One-Command Setup

# Clone the repository
git clone <repository-url>
cd AIMS-repo

# Run the automated setup script
chmod +x setup.sh
./setup.sh

The setup script will automatically:

  • ✅ Install all Python dependencies (backend + ML/AI libraries)
  • ✅ Install all Node.js dependencies (React frontend)
  • ✅ Verify all prerequisites are met
  • ✅ Provide clear next steps

Starting the Application

After setup completes, start both services:

Terminal 1 - Backend:

cd backend
python3 app.py

Terminal 2 - Frontend:

cd frontend
npm run dev

Then open your browser to: http://localhost:5173

🛠️ Manual Setup (Alternative)

If you prefer manual setup or encounter issues with the automated script:

Backend Setup

cd backend
pip3 install -r requirements.txt

# Install ML/AI dependencies
cd ../project_code
pip3 install -r requirements.txt

Frontend Setup

cd frontend
npm install

Running Manually

# Terminal 1: Backend
cd backend && python3 app.py

# Terminal 2: Frontend  
cd frontend && npm run dev

🎯 Using the Application

1. Automatic Extraction

  • Click "Start Extraction" on the main dashboard
  • The system will automatically:
    • Crawl Canadian Supply Chain Act statements (if needed)
    • Extract visual elements from PDF documents
    • Classify images using AI models
    • Display results in an interactive table

2. View Results

  • Overview Tab: Statistics and summary of extracted data
  • Data Table Tab: Detailed view of all extracted images with sorting
  • Image Preview: Click any row to see the actual extracted image

3. Additional Tools

  • Image Classifier: Upload and classify individual images
  • PDF Extractor: Extract visuals from your own PDF files

📁 Project Structure

AIMS-repo/
├── backend/           # Flask API server
├── frontend/          # React web interface  
├── project_code/      # Core extraction & AI modules
│   ├── classification/    # Image classification models
│   └── data_extraction/   # PDF processing & web scraping
├── setup.sh          # Automated setup script
└── README.md         # This file

🔧 Troubleshooting

Common Issues

  • "command not found: python" → Use python3 instead
  • Module import errors → Ensure you've installed project_code dependencies
  • Port conflicts → Backend runs on :5001, Frontend on :5173
  • Image loading issues → Check that extraction has completed successfully

Getting Help

  1. Check the browser console for frontend errors
  2. Check terminal output for backend errors
  3. Ensure all dependencies are installed correctly
  4. Verify Python and Node.js versions meet requirements

8. Additional docs (Optional)

Location: /docs

• PowerPoint presentation

• Flayers

• Additional videos/demo

• Protocols

• Guides

9. Declaration of Intellectual Property

This project builds on the open research of Project AIMS (AI against Modern Slavery) by Mila and QUT.
GitHub repository: ai4h_aims-au.

Disclaimers

Computational Resources & Comparative Results

  • Describe here the resources used in developing your solution (e.g. GPUs, etc).

No Claims About Companies

This repository and its accompanying models, datasets, metrics, dashboards, and comparative analyses are provided strictly for research and demonstration purposes.

Any comparisons, rankings, or assessments of companies or organizations are exploratory in nature. They may be affected by incomplete data, modeling limitations, or methodological choices. These results must not be used to make factual, legal, or reputational claims about any entity without independent expert review and validation.

Do not use this repository’s contents to make public statements or claims about specific companies, organizations, or individuals.

Terms and Conditions

By submitting this solution to the AIMS Hackathon, our team acknowledges and agrees to abide by the Event’s Terms and Conditions.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors