Skip to content

BUG: Chinese characters are garbled in all import/export functionalities (UTF-8 encoding issue) #25764

@junshiwang

Description

@junshiwang

Affected module
UI / Backend
(All import and export functionalities)

Describe the bug
Chinese characters are corrupted (garbled text) in all import and export functionalities in OpenMetadata.
This issue is not limited to lineage export, but affects every feature that involves exporting or importing metadata files (e.g. CSV ).

Any metadata containing Chinese characters (table names, column names, descriptions, tags, glossary terms, etc.) becomes unreadable after export, or is incorrectly parsed after import.

To Reproduce

Prepare metadata containing Chinese characters (e.g. table name, column name, description, glossary term)

Perform any of the following operations:

Export lineage

Export metadata (tables / columns / glossary / tags, etc.)

Import metadata using CSV

Open the exported file or review the imported result

Observe that Chinese characters appear as garbled text

This issue is consistently reproducible across different import/export features.

Expected behavior
All import and export functionalities in OpenMetadata should fully support UTF-8 encoding.
Chinese characters should remain intact and readable:

After exporting metadata files

After importing metadata files back into OpenMetadata
No manual encoding conversion should be required.

Version:

OpenMetadata version: 1.11.3

Additional context
This appears to be a global encoding issue across OpenMetadata import/export pipelines:

Exported files may not explicitly use UTF-8 encoding

CSV / Excel exports may be missing UTF-8 BOM, causing Excel (Windows) to misinterpret encoding

HTTP response headers may not include charset=utf-8

Import logic may rely on system default encoding instead of enforcing UTF-8

Because the problem affects all import and export features, it likely requires a centralized fix in the export/import framework rather than a single feature-specific patch.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

Projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions