Commit 892ddfe
authored
Fix template text extraction for Lang, Native name, and Nihongo templates (#828)
## Problem
Templates like `{{lang|nap|Abbrùzzu}}` and `{{Nihongo2|東京都}}` in Wikipedia infoboxes
were not being extracted, resulting in missing text content in DBpedia.
## Root Cause
The `Lang` template was configured to extract parameter 3, but `{{lang}}` only has 2 parameters.
Additionally, `Native name`, `Nihongo`, and `Nihongo2` templates were not configured.
## Fix
Updated [templatetransform.json](cci:7://file:///c:/Users/Vaibhav%20Mishra/gsoc2026/prr5/extraction-framework/core/src/main/resources/templatetransform.json:0:0-0:0):
- **Lang**: Extract param 2 (was incorrectly param 3)
- **Native name|native_name**: Added - extracts param 2
- **Nihongo2**: Added - extracts param 1
- **Nihongo**: Added - extracts param 2
## Examples
| Template | Before | After |
|----------|--------|-------|
| `{{lang\|nap\|Abbrùzzu}}` | *(empty)* | Abbrùzzu |
| `{{Nihongo2\|東京都}}` | *(empty)* | 東京都 |1 parent 9e66903 commit 892ddfe
2 files changed
Lines changed: 60 additions & 1 deletion
File tree
- core/src
- main/resources
- test/scala/org/dbpedia/extraction/wikiparser
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
30 | 30 | | |
31 | 31 | | |
32 | 32 | | |
33 | | - | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
34 | 58 | | |
35 | 59 | | |
36 | 60 | | |
| |||
Lines changed: 35 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
53 | 53 | | |
54 | 54 | | |
55 | 55 | | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
56 | 91 | | |
57 | 92 | | |
58 | 93 | | |
| |||
0 commit comments