ruby
diff --git a/‎doc/Index.md‎
Lines changed: 51 additions & 32 deletions b/‎doc/Index.md‎
Lines changed: 51 additions & 32 deletions
diff --git a/‎doc/appendices/a-directive-reference.md‎
Lines changed: 76 additions & 0 deletions b/‎doc/appendices/a-directive-reference.md‎
Lines changed: 76 additions & 0 deletions
diff --git a/‎doc/appendices/b-command-line-option-reference.md‎
Lines changed: 83 additions & 0 deletions b/‎doc/appendices/b-command-line-option-reference.md‎
Lines changed: 83 additions & 0 deletions
diff --git a/‎doc/appendices/c-bison-compatibility.md‎
Lines changed: 48 additions & 0 deletions b/‎doc/appendices/c-bison-compatibility.md‎
Lines changed: 48 additions & 0 deletions
@@ -1,58 +1,77 @@
 # Lrama
 
-[![Gem Version](https://badge.fury.io/rb/lrama.svg)](https://badge.fury.io/rb/lrama)
-[![build](https://github.com/ruby/lrama/actions/workflows/test.yaml/badge.svg)](https://github.com/ruby/lrama/actions/workflows/test.yaml)
+Lrama is a Ruby implementation of a LALR(1) parser generator. It reads a Bison-style grammar file, builds parser tables, and emits a C parser. Its primary goal is to support CRuby parser development, including Bison-compatible grammar files, error-tolerant parsing work, parameterized grammar rules, inlining, syntax diagrams, and a toolchain that can run as part of Ruby's build.
 
+## Quick Start
 
-## Overview
-
-Lrama is LALR (1) parser generator written by Ruby. The first goal of this project is providing error tolerant parser for CRuby with minimal changes on CRuby parse.y file.
-
-## Installation
-
-Lrama's installation is simple. You can install it via RubyGems.
+Install the released gem:
 
 ```shell
 $ gem install lrama
+$ lrama --version
 ```
 
-From source codes, you can install it as follows:
+From a checkout of this repository:
 
 ```shell
-$ cd "$(lrama root)"
 $ bundle install
-$ bundle exec rake install
-$ bundle exec lrama --version
-lrama 0.7.0
+$ bundle exec ruby exe/lrama --version
 ```
-## Usage
 
-Lrama is a command line tool. You can generate a parser from a grammar file by running `lrama` command.
+Generate and run the calculator sample:
 
 ```shell
-# "y.tab.c" and "y.tab.h" are generated
-$ lrama -d sample/parse.y
+$ bundle exec ruby exe/lrama -d sample/calc.y -o /tmp/calc.c
+$ gcc -Wall /tmp/calc.c -o /tmp/calc
+$ /tmp/calc
 ```
-Specify the output file with `-o` option. The following example generates "calc.c" and "calc.h".
+
+Generate a state report and a syntax diagram while developing a grammar:
 
 ```shell
-# "calc", "calc.c", and "calc.h" are generated
-$ lrama -d sample/calc.y -o calc.c && gcc -Wall calc.c -o calc && ./calc
-Enter the formula:
-1
-=> 1
-1+2*3
-=> 7
-(1+2)*3
-=> 9
+$ bundle exec ruby exe/lrama -v --report-file=/tmp/calc.output sample/calc.y
+$ bundle exec ruby exe/lrama --diagram=/tmp/calc.html sample/calc.y
 ```
 
-## Supported Ruby version
+## Manual
+
+Start with the introduction and examples if you are new to Lrama. Use the grammar, invocation, directive, and option references when you are maintaining an existing grammar.
+
+1. [Introduction](chapters/00-introduction.md)
+2. [Installation and Conditions](chapters/00-installation-and-conditions.md)
+3. [Concepts](chapters/01-concepts.md)
+4. [Examples](chapters/02-examples.md)
+5. [Grammar Files](chapters/03-grammar-files.md)
+6. [Parser C Interface](chapters/04-parser-interface.md)
+7. [Parser Algorithm](chapters/05-parser-algorithm.md)
+8. [Error Recovery and Error Tolerance](chapters/06-error-recovery.md)
+9. [Context Dependencies](chapters/07-context-dependencies.md)
+10. [Debugging Your Parser](chapters/08-debugging.md)
+11. [Invoking Lrama](chapters/09-invoking-lrama.md)
+12. [Generated Parser and Integration](chapters/10-generated-parser-and-integration.md)
+13. [History](chapters/11-history.md)
+14. [Version Compatibility](chapters/12-version-compatibility.md)
+15. [FAQ](chapters/13-faq.md)
+
+## Appendices
+
+- [Directive Reference](appendices/a-directive-reference.md)
+- [Command Line Option Reference](appendices/b-command-line-option-reference.md)
+- [Bison Compatibility](appendices/c-bison-compatibility.md)
+- [Standard Library](appendices/d-standard-library.md)
+- [Glossary](appendices/e-glossary.md)
+- [Troubleshooting](appendices/f-troubleshooting.md)
+- [License and Legal Notes](appendices/g-license-and-legal-notes.md)
+
+## Development Documents
+
+- [Profiling](development/profiling.md)
+- [Compressed state table](development/compressed_state_table/main.md)
 
-Lrama is executed with BASERUBY when building ruby from source code. Therefore Lrama needs to support BASERUBY, currently 3.1, or later version.
+## Supported Ruby Version
 
-This also requires Lrama to be able to run with only default gems because BASERUBY runs with `--disable=gems` option.
+Lrama is executed with BASERUBY when building Ruby from source. For that reason, Lrama must run on the BASERUBY version used by Ruby and must work with default gems only, because BASERUBY is executed with `--disable=gems`.
 
 ## License
 
-See [LEGAL.md](https://github.com/ruby/lrama/blob/master/LEGAL.md) file.
+See [LEGAL.md](../LEGAL.md) for the authoritative legal notice for this repository.
@@ -0,0 +1,76 @@
+# Directive Reference
+
+This table is based on the current grammar accepted by `parser.y`. "Recognized" means the parser accepts the syntax, but the current implementation has limited or no generated-code effect.
+
+| Directive | Syntax | Status | Notes |
+| --- | --- | --- | --- |
+| Prologue | `%{ ... %}` | Supported | Copied into generated C output. |
+| `%require` | `%require "VERSION"` | Recognized | Accepted as grammar syntax. With warnings enabled, Lrama reports that it currently does nothing. |
+| `%expect` | `%expect INTEGER` | Supported | Sets the expected conflict count. |
+| `%define` | `%define name value`, `%define name { value }` | Supported | Stored in grammar definitions. `lr.type=ielr` enables IELR computation; `parse.trace` enables parser debug parsing of the grammar file. |
+| `%param` | `%param {type name}` | Recognized | Accepted by `parser.y`; prefer `%lex-param` and `%parse-param` for generated parser parameters. |
+| `%lex-param` | `%lex-param {type name}` | Supported | Adds a parameter passed to `yylex`. |
+| `%parse-param` | `%parse-param {type name}` | Supported | Adds a parameter passed to `yyparse` and `yyerror`. |
+| `%code` | `%code ID { ... }` | Supported | Stores named code blocks for skeleton output. Common examples use `%code provides`. |
+| `%initial-action` | `%initial-action { ... }` | Supported | Stores initialization code for generated parser use. |
+| `%no-stdlib` | `%no-stdlib` | Supported | Prevents automatic loading of `lib/lrama/grammar/stdlib.y`. |
+| `%locations` | `%locations` | Supported | Enables location support. Location references also enable locations during grammar preparation. |
+| `%union` | `%union { ... }` | Supported | Defines `YYSTYPE` union members. |
+| `%destructor` | `%destructor { ... } symbols-or-tags` | Supported | Associates cleanup code with symbols or tags. |
+| `%printer` | `%printer { ... } symbols-or-tags` | Supported | Associates debug printing code with symbols or tags. |
+| `%error-token` | `%error-token { ... } symbols-or-tags` | Supported | Associates error-token code with symbols or tags. |
+| `%after-shift` | `%after-shift identifier` | Supported | Lrama-specific generated parser hook. |
+| `%before-reduce` | `%before-reduce identifier` | Supported | Lrama-specific generated parser hook. |
+| `%after-reduce` | `%after-reduce identifier` | Supported | Lrama-specific generated parser hook. |
+| `%after-shift-error-token` | `%after-shift-error-token identifier` | Supported | Lrama-specific generated parser hook. |
+| `%after-pop-stack` | `%after-pop-stack identifier` | Supported | Lrama-specific generated parser hook. |
+| `%token` | `%token [<tag>] NAME [NUMBER] ["alias"]` | Supported | Declares terminals. Multiple tag groups are accepted. |
+| `%type` | `%type <tag> symbol...` | Supported | Assigns semantic value tags. |
+| `%nterm` | `%nterm <tag> symbol...` | Supported | Assigns tags to nonterminals and errors if a terminal is redeclared as a nonterminal. |
+| `%left` | `%left [<tag>] token...` | Supported | Declares left associativity and precedence. |
+| `%right` | `%right [<tag>] token...` | Supported | Declares right associativity and precedence. |
+| `%nonassoc` | `%nonassoc [<tag>] token...` | Supported | Declares nonassociativity and precedence. |
+| `%precedence` | `%precedence [<tag>] token...` | Supported | Declares precedence without associativity. |
+| `%start` | `%start nonterminal` | Supported | Sets the start nonterminal. Multiple `%start` declarations are an error in Lrama. |
+| `%rule` | `%rule name(args): alternatives ;` | Supported | Defines a parameterized rule. |
+| `%rule %inline` | `%rule %inline name(args): alternatives ;` | Supported | Defines a parameterized rule expanded at use sites. |
+| Grammar separator | `%%` | Supported | Separates declarations, rules, and optional epilogue. |
+| `%empty` | `%empty` | Supported | Marks an empty alternative explicitly. |
+| `%prec` | `%prec symbol` | Supported | Overrides the rule precedence. Multiple `%prec` directives in one rule are an error. |
+
+## Rule References
+
+Semantic value references:
+
+| Form | Meaning |
+| --- | --- |
+| `$$` | Value of the left-hand side. |
+| `$1`, `$2` | Positional right-hand side values. |
+| `$name` | Named reference. |
+| `$[name.with-punctuation]` | Bracketed named reference. |
+| `$<tag>1`, `$<tag>$` | Explicit tag override. |
+
+Location references:
+
+| Form | Meaning |
+| --- | --- |
+| `@$` | Location of the left-hand side. |
+| `@1`, `@2` | Positional right-hand side locations. |
+| `@name` | Named location reference. |
+| `@[name.with-punctuation]` | Bracketed named location reference. |
+
+Index references:
+
+| Form | Meaning |
+| --- | --- |
+| `$:1`, `$:name` | Parser stack index reference used by Lrama-generated code. |
+| `$:$` | Parsed as a reference form but not supported by code generation. |
+
+## Parameterized Rule Forms
+
+| Form | Expansion target |
+| --- | --- |
+| `symbol?` | `option(symbol)` |
+| `symbol*` | `list(symbol)` |
+| `symbol+` | `nonempty_list(symbol)` |
+| `name(arg1, arg2)` | User-defined or standard-library parameterized rule. |
@@ -0,0 +1,83 @@
+# Command Line Option Reference
+
+This table is based on `lib/lrama/option_parser.rb`.
+
+| Option | Argument | Description | Example |
+| --- | --- | --- | --- |
+| `-S`, `--skeleton=FILE` | File | Use a skeleton other than the default `bison/yacc.c`. | `lrama -S template/bison/yacc.c grammar.y` |
+| `-t`, `--debug` | None | Display debugging output from Lrama's internal grammar parser. Equivalent to `-Dparse.trace`. | `lrama -t grammar.y` |
+| `--locations` | None | Enable location support. | `lrama --locations grammar.y` |
+| `-D`, `--define=NAME[=VALUE]` | Name or name/value | Similar to a `%define` declaration. | `lrama -Dlr.type=ielr grammar.y` |
+| `-H`, `--header=[FILE]` | Optional file | Generate a header, optionally at `FILE`. | `lrama -Hparser.h grammar.y` |
+| `-d` | None | Generate a header with a derived path. | `lrama -d -o parser.c grammar.y` |
+| `-r`, `--report=REPORTS` | Comma-separated words | Generate automaton details. | `lrama --report=states,itemsets grammar.y` |
+| `--report-file=FILE` | File | Write report output to `FILE`. | `lrama --report=all --report-file=parser.output grammar.y` |
+| `-o`, `--output=FILE` | File | Write generated C output to `FILE`. | `lrama -o parser.c grammar.y` |
+| `--trace=TRACES` | Comma-separated words | Write generation traces to standard error. | `lrama --trace=rules,actions grammar.y` |
+| `--diagram=[FILE]` | Optional file | Generate an HTML syntax diagram. Defaults to `diagram.html`. | `lrama --diagram=/tmp/grammar.html grammar.y` |
+| `--profile=PROFILES` | Comma-separated words | Profile parser generation. | `lrama --profile=call-stack grammar.y` |
+| `-v`, `--verbose` | None | Same as adding the `states` report. | `lrama -v grammar.y` |
+| `-W`, `--warnings` | None | Enable warnings. | `lrama -W grammar.y` |
+| `-e` | None | Enable error recovery support in generated output. | `lrama -e grammar.y` |
+| `-V`, `--version` | None | Print version and exit. | `lrama --version` |
+| `-h`, `--help` | None | Print help and exit. | `lrama --help` |
+
+## Report Keywords
+
+| Keyword | Meaning |
+| --- | --- |
+| `states` | Describe parser states. |
+| `itemsets` | Include complete item-set closures. |
+| `lookaheads` | Show lookahead tokens for reduce items. |
+| `solved` | Describe solved shift/reduce conflicts. |
+| `counterexamples`, `cex` | Generate conflict counterexamples. |
+| `rules` | List unused rules. |
+| `terms` | List unused terminals. |
+| `verbose` | Include detailed internal state and analysis information. |
+| `all` | Enable all report keywords above. |
+| `none` | Disable reports. |
+
+The default report option set contains the grammar report internally. A report file is written when `--report`, `-v`, or `--report-file` causes a report path to be present.
+
+## Trace Keywords
+
+| Keyword | Meaning |
+| --- | --- |
+| `automaton` | Trace states. |
+| `closure` | Trace item-set closure computation. |
+| `rules` | Trace grammar rules. |
+| `only-explicit-rules` | Trace only rules written explicitly in the grammar. |
+| `actions` | Trace grammar rules with actions. |
+| `time` | Trace generation time. |
+| `all` | Enable all supported trace keywords except `only-explicit-rules`. |
+| `none` | Disable traces. |
+
+The validator knows additional Bison trace names, but only the keywords listed above are supported by the current Lrama tracer.
+
+## Profile Keywords
+
+| Keyword | Meaning |
+| --- | --- |
+| `call-stack` | Use the sampling call-stack profiler. |
+| `memory` | Use the memory profiler. |
+
+## Defaults And Derived Paths
+
+| Setting | Default |
+| --- | --- |
+| Output file | `y.tab.c` |
+| Skeleton | `bison/yacc.c` |
+| Diagram file | `diagram.html` |
+| Header path with `-d -o parser.c` | `parser.h` |
+| Header path with `-d grammar.y` | `grammar.h` |
+| Report path with `--report=all grammar.y` | `grammar.output` |
+
+## STDIN Mode
+
+Use:
+
+```shell
+$ lrama [options] - FILE
+```
+
+Lrama reads grammar text from standard input and uses `FILE` for diagnostics and derived paths.
@@ -0,0 +1,48 @@
+# Bison Compatibility
+
+Lrama is Bison-style, not Bison-complete. This matrix records practical compatibility for the current repository.
+
+| Bison feature or area | Lrama status | Notes |
+| --- | --- | --- |
+| `.y` grammar layout | Supported | Prologue, declarations, rules, and epilogue are supported. |
+| C LALR(1) parser generation | Supported | This is Lrama's main output mode. |
+| `%token`, `%type`, `%nterm`, `%union` | Supported | See the directive reference for syntax details. |
+| Precedence and associativity | Supported | `%left`, `%right`, `%nonassoc`, `%precedence`, and `%prec` are accepted. |
+| `%start` | Supported with difference | Multiple `%start` declarations are an error in Lrama. |
+| Semantic actions | Supported | Includes positional, named, bracketed named, and explicit-tag references. |
+| Location references | Supported | `@` references enable location support during grammar preparation. |
+| `%destructor` and `%printer` | Supported | Used by generated C output and debug support. |
+| `%require` | Recognized only | Accepted, but warnings state that it currently has no effect. |
+| `%param` | Recognized with limited effect | Use `%lex-param` and `%parse-param` for generated parser parameters. |
+| `%lex-param`, `%parse-param` | Supported | Parameters are passed to generated parser functions. |
+| IELR | Supported when requested | Use `%define lr.type ielr`; covered by integration fixtures. |
+| GLR | Not currently provided | No GLR skeleton or parser mode is provided. |
+| Push parser | Not currently provided | The generated C skeleton is pull-parser oriented. |
+| LAC | Not currently provided | The README records the LAC branch as disabled in the compatibility assumptions. |
+| C++/D/Java backends | Not currently provided | Current repository templates target C output. |
+| Parameterized rules | Lrama extension | `%rule`, suffixes, and `stdlib.y` helpers are Lrama grammar features. |
+| `%inline` parameterized rules | Lrama extension | Rules are expanded before parser states are finalized. |
+| Syntax diagrams | Lrama extension | Generated with `--diagram`. |
+| Error-tolerant parser support | Lrama extension | Enabled with `-e` and related grammar support. |
+| Bison manual text | Not reused | Documentation should explain Lrama behavior in original wording. |
+
+## Compatibility Guidance
+
+When porting a Bison grammar:
+
+1. Generate reports with both tools if possible.
+2. Check unsupported directives before changing parser behavior.
+3. Compile and run generated parser tests.
+4. Confirm conflict counts with `%expect`.
+5. Avoid documenting future or unmerged behavior as supported.
+
+## README Compatibility Assumptions
+
+The README records several Bison template compatibility assumptions:
+
+- `b4_locations_if` is always true.
+- `b4_pure_if` is always true.
+- `b4_pull_if` is always false.
+- `b4_lac_if` is always false.
+
+These are implementation compatibility notes for Lrama's Bison-style template layer. They should not be read as a promise that every Bison command-line option or skeleton branch exists in Lrama.