Skip to content

Commit 25e460e

Browse files
authored
Merge pull request anthropics#54 from anthropics/dworken/prompt-injection-docs
Add security considerations for prompt injection
2 parents 68982a6 + ffc925f commit 25e460e

3 files changed

Lines changed: 8 additions & 2 deletions

File tree

README.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -40,6 +40,10 @@ jobs:
4040
claude-api-key: ${{ secrets.CLAUDE_API_KEY }}
4141
```
4242
43+
## Security Considerations
44+
45+
This action is not hardened against prompt injection attacks and should only be used to review trusted PRs. We recommend [configuring your repository](https://docs.github.com/en/repositories/managing-your-repositorys-settings-and-features/enabling-features-for-your-repository/managing-github-actions-settings-for-a-repository#controlling-changes-from-forks-to-workflows-in-public-repositories) to use the "Require approval for all external contributors" option to ensure workflows only run after a maintainer has reviewed the PR.
46+
4347
## Configuration Options
4448
4549
### Action Inputs

claudecode/github_action_audit.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -224,7 +224,8 @@ def run_security_audit(self, repo_dir: Path, prompt: str) -> Tuple[bool, str, Di
224224
cmd = [
225225
'claude',
226226
'--output-format', 'json',
227-
'--model', DEFAULT_CLAUDE_MODEL
227+
'--model', DEFAULT_CLAUDE_MODEL,
228+
'--disallowed-tools', 'Bash(ps:*)'
228229
]
229230

230231
# Run Claude Code with retry logic

claudecode/test_claude_runner.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -166,7 +166,8 @@ def test_run_security_audit_success(self, mock_run):
166166
assert call_args[0][0] == [
167167
'claude',
168168
'--output-format', 'json',
169-
'--model', DEFAULT_CLAUDE_MODEL
169+
'--model', DEFAULT_CLAUDE_MODEL,
170+
'--disallowed-tools', 'Bash(ps:*)'
170171
]
171172
assert call_args[1]['input'] == 'test prompt'
172173
assert call_args[1]['cwd'] == Path('/tmp/test')

0 commit comments

Comments
 (0)