Adversarial-Prompt-Stress-Testing-for-AI-Security

Inspiration

With AI copilots and assistants being widely adopted, we saw a major risk: adversarial prompts can trick models into revealing secrets or bypassing safeguards. We wanted a proactive way to test and strengthen AI security before deployment.

What It Does

The system generates adversarial prompts using an LLM, runs them against a target AI model (cloud or local), captures the responses, and evaluates vulnerabilities. It helps identify weaknesses like prompt injection, data leakage, or unsafe outputs.

How We Built It

• Used LangChain + ChatOpenAI wrapper to connect with the GenAI Lab API for generating adversarial prompts. • Created a vulnerable test AI model to simulate attacks. • Developed scripts to generate prompts, run tests, and save outputs. • Designed a workflow that evaluates model responses and reports findings.

Challenges We Ran Into

• Dealing with version mismatches in LangChain imports. • Converting LLM responses into consistent strings for saving. • Designing a vulnerable AI model that is simple but realistic enough for testing.

Accomplishments That We’re Proud Of

• Built a working adversarial testing pipeline end-to-end. • Successfully generated and executed adversarial prompts. • Created a clear workflow diagram and hackathon submission framework.

What We Learned

• Hands-on experience with LLM prompt injection attacks and mitigations. • How to integrate LangChain with custom AI endpoints. • The importance of version control and defensive coding when dealing with rapidly evolving AI frameworks.

The Future Is Bright

• Extend evaluation with automated scoring metrics (e.g., risk level, severity). • Integrate into CI/CD pipelines for AI model deployment. • Build a dashboard for real-time monitoring and security reporting.

**Built With ** • LangChain • OpenAI / GenAI Lab API • Python • httpx • Custom test AI model

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
config		config
scripts		scripts
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adversarial-Prompt-Stress-Testing-for-AI-Security

Inspiration

What It Does

How We Built It

Challenges We Ran Into

Accomplishments That We’re Proud Of

What We Learned

The Future Is Bright

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Adversarial-Prompt-Stress-Testing-for-AI-Security

Inspiration

What It Does

How We Built It

Challenges We Ran Into

Accomplishments That We’re Proud Of

What We Learned

The Future Is Bright

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages