PS-10995 [DOCS] -[feedback] PS 8.4 Innodb index creation

patrickbirch · patrickbirch · commit fe8f06df1677 · 2026-04-01T06:13:32.000-05:00
modified:   docs/innodb-expanded-fast-index-creation.md
diff --git a/docs/extended-mysqldump.md b/docs/extended-mysqldump.md
@@ -17,6 +17,16 @@ More information can be found in [Backup Locks](backup-locks.md).
 More information can be found in
 [Compressed columns with dictionaries](compressed-columns.md).
 
+## `InnoDB` secondary keys and `--innodb-optimize-keys`
+
+For *InnoDB* tables, `--innodb-optimize-keys` omits secondary keys (and related
+constraints) from the initial `CREATE TABLE` in the dump and adds them in a
+follow-up `ALTER TABLE` after the data is loaded. That pattern works well when
+the target server can build those indexes using [expanded fast index
+creation](innodb-expanded-fast-index-creation.md). See that page for
+limitations (foreign keys, partitioned tables, `AUTO_INCREMENT`, implicit
+primary keys, and others) and for the `expand_fast_index_creation` variable.
+
 ## Taking backup by descending primary key order
 
 –order-by-primary-desc tells `mysqldump` to take the backup by
diff --git a/docs/innodb-expanded-fast-index-creation.md b/docs/innodb-expanded-fast-index-creation.md
@@ -1,19 +1,144 @@
 # Expanded fast index creation
 
-Percona has implemented several changes related to *MySQL*’s fast index creation
-feature. Fast index creation was implemented in *MySQL* as a way to speed up the
-process of adding or dropping indexes on tables with many rows.
+## What fast index creation is
 
-This feature implements a session variable that enables extended fast index
-creation. Besides optimizing DDL directly,
-[expand_fast_index_creation](#expanded-fast-index-creation) may also optimize index access for
-subsequent DML statements because using it results in much less fragmented
-indexes.
+In *InnoDB*, secondary indexes are separate B-tree structures from the clustered
+index (the primary key). When the server creates a new secondary index on
+an existing table, it can do so in two conceptually different ways:
 
-## The **mysqldump** command
+1. Row-by-row maintenance — For each row, insert that row’s entries into the
+   new secondary index as you go. Those inserts arrive in primary-key order, not
+   in secondary-key order, so the growing B-tree suffers many random-looking
+   page splits and a large amount of write amplification. If the server is
+   also copying the table (rebuild `ALTER TABLE`), the same pattern applies:
+   every row copied into the new table must update every secondary index
+   immediately.
 
-A new option, `--innodb-optimize-keys`, was implemented in **mysqldump**. It
-changes the way *InnoDB* tables are dumped, so that secondary and foreign keys
+2. Fast index creation (sorted / bulk build) — The server scans the table’s
+   clustered index in primary-key order, generates secondary-key tuples, sorts
+   them (often using external merge sort), and builds the secondary index from
+   that ordered stream. Work is staged in temporary files under the configured
+   `tmpdir`, then merged into a compact B-tree. That path avoids the worst
+   random-insert behavior of the row-by-row approach and usually completes with
+   less I/O and a less fragmented index.
+
+So fast index creation means: build the secondary index from a sorted
+stream after reading the table (or a copy) in clustered index order, instead of
+growing the index by arbitrary-order inserts during the same phase as the data
+copy.
+
+Dropping an index is already a cheap metadata change in many cases; the
+performance win is dominated by creating indexes on large tables.
+
+## How this differs from Oracle MySQL {{vers}}
+
+The figure below contrasts a typical Oracle MySQL workflow with Percona Server
+when expanded fast index creation is in use (including
+`mysqldump --innodb-optimize-keys` during restore).
+
+![Comparison of standard MySQL and Percona Server with expanded fast index creation for backup restore and copy-style ALTER TABLE or OPTIMIZE TABLE](_static/expand-fast-creation.png)
+
+The following compares Percona Server for MySQL with Oracle MySQL {{vers}} on
+code paths that still rebuild the table. InnoDB classifies each `ALTER TABLE`
+operation by algorithm (`INSTANT`, `INPLACE`, `COPY`, and so on); those
+classifications can change between releases, so treat the [InnoDB online DDL
+documentation :octicons-link-external-16:](https://dev.mysql.com/doc/refman/{{vers}}/en/innodb-online-ddl.html)
+as authoritative for whether a specific statement performs a copy in your
+version.
+
+Upstream *MySQL* (*InnoDB*) already uses a sorted, bulk-style path when it
+adds a secondary index in operations that are implemented as “add index
+only” (for example, some `CREATE INDEX` / `ALTER TABLE ... ADD INDEX` flows
+that do not rebuild the whole table).
+
+Where Oracle MySQL still does a full table rebuild (copy algorithm —
+for example many `ALTER TABLE` changes that force a new table), rows are
+inserted into the new copy while all secondary indexes are live. Each
+insert must update every non-primary index at once. Even if the server later
+uses efficient mechanics for individual index builds, interleaving those
+updates with the copy keeps more indexes “hot” for the whole copy and tends to
+produce heavier random I/O and more fragmented trees than deferring secondary
+index creation until the clustered data is complete.
+
+Percona Server for MySQL extends that behavior with expanded fast index
+creation (controlled by
+[`expand_fast_index_creation`](#expanded-fast-index-creation)): on rebuild-style
+`ALTER TABLE` / `OPTIMIZE TABLE`, eligible non-unique secondary indexes are
+dropped for the copy phase and recreated afterward using the fast sorted-build
+path on the finished table. The copy phase then maintains only what *InnoDB*
+requires for the clustered index (and any indexes that cannot be deferred),
+which is the main difference from Oracle MySQL on the same code paths.
+
+Oracle MySQL {{vers}} can apply `INSTANT` or in-place (`INPLACE`)
+DDL to many `ALTER TABLE` operations so the server avoids a full table copy or
+keeps work inside the existing *InnoDB* file. That path is separate from the
+rebuild logic `expand_fast_index_creation` augments; there is no interaction to
+“tune” for those statements.
+
+## When this optimization applies
+
+### `INSTANT`, `INPLACE`, and why this variable usually does not matter for them
+
+If an `ALTER TABLE` runs as `INSTANT` (for example, adding a nullable column at
+the end of the table when supported) or as an online in-place operation that
+does not rebuild the whole table, the server is not performing a full table
+copy that Percona optimizes. In those cases
+`expand_fast_index_creation` is generally unnecessary: the expensive secondary
+index pattern this feature improves simply is not used in the same way.
+
+### When `expand_fast_index_creation` helps
+
+`expand_fast_index_creation` is most beneficial when the operation requires a
+table copy—for example changing a column’s data type in a way that forces a
+rebuild, or other alters classified with the copy algorithm. On that path,
+Percona Server intercepts the copy so eligible non-unique secondary indexes are
+rebuilt with the sorted temporary-file workflow instead of being maintained on
+every inserted row during the copy.
+
+Expanded fast index creation only affects statements that rebuild the table and
+copy rows into a new *InnoDB* table. Typical cases include:
+
+* `OPTIMIZE TABLE` on an *InnoDB* table (internally `ALTER TABLE ... ENGINE=InnoDB`)
+* `ALTER TABLE` operations that the server implements with a table rebuild and
+  the copy algorithm, as listed in the [InnoDB online DDL operations
+  table :octicons-link-external-16:](https://dev.mysql.com/doc/refman/{{vers}}/en/innodb-online-ddl-operations.html)
+* An `ALTER TABLE` where you explicitly request `ALGORITHM=COPY` (when that
+  algorithm is permitted for the operation)
+
+Routine schema changes that stay on `INSTANT` or `INPLACE` never enter this path
+and are unaffected by `expand_fast_index_creation`.
+
+## Verify and monitor
+
+* Check whether the feature is enabled:
+
+    ```sql
+    SHOW VARIABLES LIKE 'expand_fast_index_creation';
+    ```
+
+    In Percona Server for MySQL {{vers}} the default is `OFF`. Enable it for a
+    session or globally before running DDL, for example
+    `SET SESSION expand_fast_index_creation = ON;`.
+
+* To see how *MySQL* classifies a specific `ALTER TABLE`, use the online DDL
+  documentation for your version (linked [above](#when-this-optimization-applies)).
+  There is no single `EXPLAIN` for DDL; classification is per operation and
+  version.
+
+* [`tmpdir`](https://dev.mysql.com/doc/refman/{{vers}}/en/server-system-variables.html#sysvar_tmpdir)
+  free space is the usual operational bottleneck; see
+  [Limitations](#limitations) for how large it must be and what happens when it
+  is exhausted.
+
+Besides shortening DDL directly,
+[`expand_fast_index_creation`](#expanded-fast-index-creation) may also help
+subsequent DML because indexes built in one sorted pass are often less
+fragmented than those maintained incrementally through a long copy.
+
+## The mysqldump command
+
+The `--innodb-optimize-keys` option changes the way *InnoDB* tables are dumped,
+so that secondary and foreign keys
 are created after loading the data, thus taking advantage of fast index
 creation. More specifically:
 
@@ -25,7 +150,7 @@ create the previously omitted keys.
 
 ## `ALTER TABLE`
 
-When `ALTER TABLE` requires a table copy, secondary keys are now dropped and
+When `ALTER TABLE` requires a table copy, secondary keys are dropped and
 recreated later, after copying the data. The following restrictions apply:
 
 * Only non-unique keys can be involved in this optimization.
@@ -39,16 +164,42 @@ keys.
 ## `OPTIMIZE TABLE`
 
 Internally, `OPTIMIZE TABLE` is mapped to `ALTER TABLE ... ENGINE=innodb`
-for *InnoDB* tables. As a consequence, it now also benefits from fast index
-creation, with the same restrictions as for `ALTER TABLE`.
+for *InnoDB* tables. As a consequence, it also benefits from fast index
+creation when `expand_fast_index_creation` is enabled and the optimization
+applies, with the same restrictions as for `ALTER TABLE`.
+
+## Limitations
+
+!!! warning "`tmpdir` free space — the most common failure"
+
+    In practice, the usual reason expanded fast index creation fails is
+    running out of disk space on the filesystem used for
+    [`tmpdir`](https://dev.mysql.com/doc/refman/{{vers}}/en/server-system-variables.html#sysvar_tmpdir)
+    (often the same mount as `/tmp`).
 
-## Caveats
+    With this optimization enabled, the server does not only make index
+    maintenance cheaper in memory: it materializes each secondary index in
+    temporary files (sorted runs and merge passes) and only then merges the
+    result into the final *InnoDB* index. That can consume far more transient
+    space than a rough “indexes fit in the tablespace” estimate suggests.
 
-*InnoDB* fast index creation uses temporary files in tmpdir for all indexes
-being created. So make sure you have enough tmpdir space when using
-[expand_fast_index_creation](#expanded-fast-index-creation). It is a session variable, so you can
-temporarily switch it off if you are short on tmpdir space and/or don’t want
-this optimization to be used for a specific table.
+    Size the filesystem using the secondary index footprint you are
+    rebuilding, not the primary table size alone. You typically need
+    well above the on-disk size of those secondary indexes as free
+    space under `tmpdir`, on top of anything else the same `ALTER TABLE` or
+    `OPTIMIZE TABLE` already needs. For example, a table with about 500 GB
+    of data and about 200 GB of secondary indexes may still require
+    significantly more than 200 GB of free `tmpdir` space while those
+    indexes are being built.
+
+    If `tmpdir` fills during the operation, the statement fails and rolls
+    back. You lose the work done up to that point and must free or enlarge
+    storage (or point `tmpdir` at a larger volume), or run with
+    `expand_fast_index_creation` disabled for that job, before retrying.
+
+[`expand_fast_index_creation`](#expanded-fast-index-creation) is a session or
+global variable: you can set it to `OFF` for a single session if `tmpdir` is
+too small for a specific table or maintenance window.
 
 There’s also a number of cases when this optimization is not applicable:
 
@@ -62,16 +213,16 @@ dropping keys that are part of a FOREIGN KEY constraint;
 * `ALTER TABLE` and `OPTIMIZE TABLE` always process partitioned tables as if
 [expand_fast_index_creation](#expanded-fast-index-creation) is OFF;
 
-* **mysqldump --innodb-optimize-keys** ignores foreign keys because
+* mysqldump --innodb-optimize-keys ignores foreign keys because
 *InnoDB* requires a full table rebuild on foreign key changes. So adding them
 back with a separate `ALTER TABLE` after restoring the data from a dump
 would actually make the restore slower;
 
-* **mysqldump --innodb-optimize-keys** ignores indexes on
+* mysqldump --innodb-optimize-keys ignores indexes on
 `AUTO_INCREMENT` columns, because they must be indexed, so it is impossible
 to temporarily drop the corresponding index;
 
-* **mysqldump --innodb-optimize-keys** ignores the first UNIQUE index on
+* mysqldump --innodb-optimize-keys ignores the first UNIQUE index on
 non-nullable columns when the table has no `PRIMARY KEY` defined, because in
 this case *InnoDB* picks such an index as the clustered one.
 
@@ -86,12 +237,24 @@ this case *InnoDB* picks such an index as the clustered one.
 | Scope:         | Local/Global       |
 | Dynamic:       | Yes                |
 | Data type      | Boolean            |
-| Default value  | ON/OFF             |
+| Default value  | OFF                |
+
+When set to `ON`, *InnoDB* may drop eligible non-unique secondary indexes for the
+data-copy phase of rebuild-style `ALTER TABLE` and `OPTIMIZE TABLE`, then
+recreate them with the sorted bulk build described [above](#what-fast-index-creation-is).
+
+## Related documentation
 
-!!! admonition "See also"
+### In this manual
 
-    [Improved InnoDB fast index creation :octicons-link-external-16:](https://www.mysqlperformanceblog.com/2011/11/06/improved-innodb-fast-index-creation/)
+* [Percona Server for MySQL feature comparison](feature-comparison.md) — how this capability compares to MySQL {{vers}}
+* [Percona Server for MySQL variables](percona-server-system-variables.md) — full list of Percona-specific system variables, including `expand_fast_index_creation`
+* [Extended mysqldump](extended-mysqldump.md) — Percona `mysqldump` enhancements, including `--innodb-optimize-keys`
+* [InnoDB page fragmentation counters](innodb-fragmentation-count.md) — monitoring index fragmentation
 
-    [Thinking about running OPTIMIZE on your InnoDB Table? Stop! :octicons-link-external-16:](https://www.mysqlperformanceblog.com/2010/12/09/thinking-about-running-optimize-on-your-innodb-table-stop/) 
+### MySQL Reference Manual
 
+* [InnoDB and online DDL :octicons-link-external-16:](https://dev.mysql.com/doc/refman/{{vers}}/en/innodb-online-ddl.html)
+* [InnoDB online DDL operations :octicons-link-external-16:](https://dev.mysql.com/doc/refman/{{vers}}/en/innodb-online-ddl-operations.html)
+* [`tmpdir` system variable :octicons-link-external-16:](https://dev.mysql.com/doc/refman/{{vers}}/en/server-system-variables.html#sysvar_tmpdir)