Skip to content

chore: add changeset and update docs for configurable PII types#471

Open
yjouini wants to merge 3 commits into
dataiku:mainfrom
yjouini:pr/docs-changeset
Open

chore: add changeset and update docs for configurable PII types#471
yjouini wants to merge 3 commits into
dataiku:mainfrom
yjouini:pr/docs-changeset

Conversation

@yjouini
Copy link
Copy Markdown
Contributor

@yjouini yjouini commented May 27, 2026

Closes #458, Closes #459

Follow-up to #470 (main feature PR), adds the changeset and updates docs/06-chrome-extension.md to document the new PII entity types and custom patterns configuration options available in the extension settings.

@hanneshapke
Copy link
Copy Markdown
Collaborator

Tiny code example on how to read the list of labels and unify it.

import json
from urllib.request import urlopen

url = 'https://raw.githubusercontent.com/dataiku/kiji-proxy/main/model/quantized/label_mappings.json'
with urlopen(url) as r:
    m = json.load(r)
labels = m['pii']['id2label'].values()
entities = {l.split('-', 1)[1] for l in labels if l.startswith(('B-', 'I-'))}
print(sorted(e.title() for e in entities))

@yjouini yjouini force-pushed the pr/docs-changeset branch from f3ad2f3 to 7b8c5a3 Compare May 28, 2026 18:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Allow user-defined custom regex patterns from the extension UI Make detected entity types configurable in the extension

2 participants