Flash-X Docker Test Suite

This document describes the comprehensive test suite for the flashx_docker project, including unit tests, integration tests, and continuous integration workflows.

Overview
Test Structure
Prerequisites
Running Tests Locally
Test Categories
GitHub Actions CI/CD
Writing New Tests
Troubleshooting

Overview

The test suite ensures that:

Shell scripts are syntactically correct and follow best practices
Dockerfiles are properly formatted and secure
Docker images build successfully
The complete workflow functions correctly across platforms
Security vulnerabilities are detected early

Test Structure

tests/
├── run_tests.sh                   # Local test runner script
├── test_run_flashx.bats          # Unit tests for run_flashx.sh
├── test_docker_build.bats        # Unit tests for flashx_dockerfile
└── test_integration.bats         # Integration tests for complete workflow

.github/workflows/
└── ci.yml                        # GitHub Actions CI/CD pipeline

README_testing.md                  # This file

Prerequisites

Required Tools

BATS (Bash Automated Testing System)

# macOS
brew install bats-core

# Ubuntu/Debian
sudo apt-get install bats

# Manual installation
git clone https://github.com/bats-core/bats-core.git
cd bats-core
sudo ./install.sh /usr/local

Docker
- Install from docker.com
- Ensure Docker daemon is running

Optional Tools (Recommended)

ShellCheck (Shell script linter)

# macOS
brew install shellcheck

# Ubuntu/Debian
sudo apt-get install shellcheck

# Or download from: https://github.com/koalaman/shellcheck

Hadolint (Dockerfile linter)

# macOS
brew install hadolint

# Linux
wget -O /usr/local/bin/hadolint https://github.com/hadolint/hadolint/releases/latest/download/hadolint-Linux-x86_64
chmod +x /usr/local/bin/hadolint

# Or use Docker
docker pull hadolint/hadolint

Running Tests Locally

Quick Start

Run all tests with the automated test runner:

cd tests
./run_tests.sh

This script will:

Check for required dependencies
Run linters (ShellCheck, Hadolint)
Execute all BATS test suites
Provide a summary of results

Running Individual Test Suites

Run specific test files:

# Test the shell script
bats tests/test_run_flashx.bats

# Test the Dockerfile
bats tests/test_docker_build.bats

# Run integration tests
bats tests/test_integration.bats

Running Linters

# Check shell script
shellcheck run_flashx.sh

# Check Dockerfile
hadolint flashx_dockerfile

Test Categories

1. Shell Script Tests (`test_run_flashx.bats`)

Tests for run_flashx.sh:

File existence and permissions
Syntax validation
OS detection logic
Docker availability checks
UID/GID handling
Volume mounting configuration
WSL path conversion
Error handling
Security (no hardcoded credentials)

Example tests:

Verifies script is executable
Checks for proper Docker commands
Validates environment variable usage
Ensures cross-platform compatibility

2. Dockerfile Tests (`test_docker_build.bats`)

Tests for flashx_dockerfile:

Base image specification
Architecture support (x86_64, aarch64)
Required package installation:
- Build tools (gcc, gfortran, make, cmake)
- MPI (OpenMPI)
- HDF5 libraries
- Python/Conda environment
- Scientific packages (yt, h5py)
Flash-X repository cloning
User creation and permissions
Build process validation
Security checks

Example tests:

Confirms Ubuntu base image
Verifies all required packages are installed
Checks for non-root user creation
Validates MANIFEST generation

3. Integration Tests (`test_integration.bats`)

End-to-end workflow tests:

Docker installation and daemon status
Complete image build process
Container startup and execution
File system operations
Volume mounting functionality
Cross-platform compatibility
User permission handling

Note: Many integration tests are skipped by default because they require:

A fully built Docker image (time-consuming)
Significant computational resources
Platform-specific configurations

Running Full Integration Tests

What's Required:

Time: 15-30+ minutes for initial Docker image build
Disk Space: ~5-10 GB for Docker image and dependencies
Network: Active internet connection for downloading packages
Resources:
- 4+ GB RAM recommended
- Multi-core CPU for faster compilation
- Docker daemon running with sufficient resources allocated

What Gets Built: The full integration tests build a complete Flash-X Docker image including:

Ubuntu 20.04 base image
Build tools (gcc, gfortran, make, cmake)
OpenMPI for parallel computing
HDF5 libraries for scientific data storage
Miniconda Python 3.10 environment
Scientific packages (yt toolkit, h5py)
FFmpeg for visualization
Flash-X astrophysical simulation code
Compiled Sedov test problem

How to Enable Full Integration Tests:

Build the Docker image first (one-time setup):

# Option 1: Use the run script
./run_flashx.sh

# Option 2: Build directly
docker build -f flashx_dockerfile \
  --build-arg USER_ID=$(id -u) \
  --build-arg GROUP_ID=$(id -g) \
  -t flashx-integration-test .

Edit the test file to enable specific tests:

# Open the integration test file
vim tests/test_integration.bats

# Find tests with 'skip' and either:
# - Remove the 'skip' line entirely
# - Comment it out with '#'
# - Replace 'skip' with 'run' (in some test frameworks)

Run the full integration test suite:
```
bats tests/test_integration.bats
```

Which Tests to Enable:

After building the image, you can safely enable these tests:

"Built image contains Flash-X directory"
"Container can execute basic commands"
"Container runs with non-root user"
"Container has Conda environment activated"
"Container has yt toolkit installed"
"Container has h5py installed"
"Container has OpenMPI installed"
"Container has gcc installed"
"Container has gfortran installed"
"Container has Flash-X repository cloned"
"Container has Sedov test problem built"
"Container has MANIFEST file"
"Container can mount volumes"
"Container has FFmpeg installed"
"Container has HDF5 tools"
"Container has git installed"
"Container has Python 3.10"

Time-Intensive Tests (keep skipped unless needed):

"Docker image builds successfully" - Full build (~15-30 min)
"Can execute Flash-X simulation" - Runs actual simulation (~varies)

Example: Enabling a Single Test

Before:

@test "Container has yt toolkit installed" {
    skip "Requires built image"
    docker run --rm "$TEST_IMAGE" python -c "import yt"
}

After:

@test "Container has yt toolkit installed" {
    # skip "Requires built image"  # Commented out
    docker run --rm "$TEST_IMAGE" python -c "import yt"
}

GitHub Actions CI/CD

The project includes a comprehensive CI/CD pipeline defined in .github/workflows/ci.yml.

Workflow Jobs

lint-shell: Runs ShellCheck on run_flashx.sh
lint-dockerfile: Runs Hadolint on flashx_dockerfile
test-shell-script: Executes BATS tests on Ubuntu and macOS
test-dockerfile: Validates Dockerfile structure
test-integration: Runs integration test suite
docker-build-test: Tests Docker image build (lightweight)
security-scan: Scans for vulnerabilities with Trivy
validation-summary: Aggregates all test results

Triggering CI

The workflow runs automatically on:

Push to main or develop branches
Pull requests to main or develop branches
Manual trigger via GitHub Actions UI

Viewing Results

Go to the "Actions" tab in your GitHub repository
Click on the latest workflow run
View individual job results
Check logs for detailed error messages

CI Configuration

The workflow uses:

Ubuntu runners for most tests
macOS runners for cross-platform validation
Docker Buildx for efficient builds
GitHub Actions cache for faster subsequent runs

Writing New Tests

BATS Test Structure

@test "description of test" {
    # Test commands
    run some_command
    [ "$status" -eq 0 ]
    [ "$output" = "expected output" ]
}

Common BATS Assertions

# Check exit status
[ "$status" -eq 0 ]

# Check output
[ "$output" = "expected" ]

# Pattern matching
[[ "$output" =~ pattern ]]

# File existence
[ -f "/path/to/file" ]

# Command availability
command -v docker

Adding Tests

Choose the appropriate test file:
- Shell script behavior → test_run_flashx.bats
- Dockerfile content → test_docker_build.bats
- End-to-end workflow → test_integration.bats
Add a new @test block with a descriptive name
Implement the test logic
Run locally to verify:
```
bats tests/test_your_file.bats
```
Commit and push to trigger CI

Example: Adding a New Test

@test "Script handles network errors gracefully" {
    # Mock a network failure scenario
    # Test that the script provides appropriate error message
    # This is a placeholder for actual implementation
    skip "Network error handling test - implement as needed"
}

Troubleshooting

Common Issues

BATS Not Found

Error: bats: command not found

Solution: Install BATS using instructions in Prerequisites

Docker Not Running

Error: Cannot connect to the Docker daemon

Solution: Start Docker Desktop or Docker daemon

Permission Denied

Error: Permission denied when accessing run_tests.sh

Solution:

chmod +x tests/run_tests.sh

Tests Fail on macOS but Pass on Linux

Check platform-specific logic in the script
Verify path differences (macOS vs Linux)
Test Docker volume mounting behavior

Integration Tests All Skipped

This is normal for fresh installations
Integration tests require a built image
Build the image first: ./run_flashx.sh
Then manually enable desired integration tests

Debug Mode

Run BATS in verbose mode:

bats -t tests/test_run_flashx.bats

Print all commands:

bats -x tests/test_run_flashx.bats

Getting Help

Review test output carefully
Check individual test descriptions
Examine the actual scripts being tested
Consult BATS documentation: https://bats-core.readthedocs.io/
Review GitHub Actions logs for CI failures

Best Practices

Keep tests independent: Each test should run in isolation
Use descriptive names: Test names should clearly indicate what is being tested
Clean up after tests: Use teardown() to remove temporary files
Skip expensive tests: Use skip for time-consuming integration tests in CI
Test both success and failure cases: Verify error handling
Document complex tests: Add comments explaining non-obvious test logic
Run tests before committing: Catch issues early in development

Contributing

When adding new features to flashx_docker:

Write tests for new functionality
Ensure all existing tests pass
Update test documentation as needed
Verify CI pipeline succeeds
Submit pull request with tests included

Additional Resources

Quick Reference

Run All Tests

./tests/run_tests.sh

Run Specific Test Suite

bats tests/test_run_flashx.bats      # Shell script tests
bats tests/test_docker_build.bats    # Dockerfile tests
bats tests/test_integration.bats     # Integration tests

Lint Code

shellcheck run_flashx.sh             # Lint shell script
hadolint flashx_dockerfile           # Lint Dockerfile

Manual CI Trigger

Go to GitHub repository → Actions tab
Select "CI" workflow
Click "Run workflow"
Choose branch and click "Run workflow"

FilesExpand file tree

README_testing.md

Latest commit

History