Process FHIR extensions

Generate parquet data for FHIR extensions

1. Prepare the customized schema templates

For each resource types, e.g. Patient, Observation, you need to prepare a liquid template and a related JSON schema file.

The liquid template will be used to convert the raw FHIR JSON data into target structure.
The JSON schema will be used to generate the Parquet schema and create table definitions.

Follow the links for more information about liquid template and JSON schema.

Below are the example liquid template and JSON schema for "Patient" resource, flatten the Patient birthPlace extension for analytics. You can refer to this sample template that handles more complex extension types in FHIR resources.

Liquid template:

{% validate "Schema/Patient.schema.json" -%}
{
"resourceType": "{{ msg.resourceType }}",
"id": "{{ msg.id }}",

{% assign birth_place_extension = msg["extension"] | where: "url", "http://hl7.org/fhir/StructureDefinition/patient-birthPlace" | first -%}
"birthPlaceExtension.valueAddress.city": "{{ birth_place_extension.valueAddress.city }}",
"birthPlaceExtension.valueAddress.state": "{{ birth_place_extension.valueAddress.state }}",
"birthPlaceExtension.valueAddress.country": "{{ birth_place_extension.valueAddress.country }}",
"birthPlaceExtension.valueAddress.postalCode": "{{ birth_place_extension.valueAddress.postalCode }}",
}
{% endvalidate -%}

JSON schema file:

{
    "title": "Patient customized schema",
    "type": "object",
    "properties": {
        "resourceType": { "type": "string" },
        "id": { "type": "string" },
	"birthPlaceExtension.valueAddress.city": { "type": "string" },
	"birthPlaceExtension.valueAddress.state": { "type": "string" },
	"birthPlaceExtension.valueAddress.country": { "type": "string" },
	"birthPlaceExtension.valueAddress.postalCode": { "type": "string" }
    },
    "required": [ "id" ]
}

Note:

The JSON schema files must be saved at Schema directory in the image.
The JSON schema files need to have suffix ".schema.json".
The liquid templates and JSON schema files name should be aligned with resource type name.

E.g. "Patient.schema.json" and "Patient.liquid" be used to process the "Patient" resource.
The "validate" tag in template is optional and we recommend using it in your liquid template.
We internally leverage the FHIR-Converter to convert the raw FHIR data. You can test your templates and schema files with it before deploying the analytics pipeline.

2. Push the customized schema to Azure Container Registry

Refer here to push the prepared schema to Azure Container Registry, later we will use the schema image reference from the Container Registry to deploy the analytics pipeline.

3. Deploy the Analytics pipeline with customized schema enabled

Use the button below to deploy the pipeline with ARM template through the Azure Portal.

Set the parameter “Customized Schema” as “true”, and “Customized Schema Image Reference” as the reference of image where your schema is stored.

After deploying the pipeline, the Container App agent will try to pull the customized schema from the given image reference.

4. Provide access of the Container Registry to the Azure Container App

Go to the Azure Container Registry instance, assign “AcrPull” role to principal account of your created Azure Container App.

5. Verify customized data on the Storage

Customized data will be generated to “{resource type}_Customized” folder on the Storage.

Example customized patient Parquet data on the Storage

Query customized data in Synapse Serverless SQL pool

1. Run the Powershell script to create the tables and views with customized schema image

Browse to the scripts folder under this path (..\FhirToDataLake\scripts).

Run the following PowerShell script.

./Set-SynapseEnvironment.ps1 -SynapseWorkspaceName "Synapse name" -StorageName "Storage name" -CustomizedSchemaImage "Schema image reference"

Example:

./Set-SynapseEnvironment.ps1 -SynapseWorkspaceName examplesynapse -StorageName examplestorage -CustomizedSchemaImage exampleacr.azurecr.io/customizedtemplate:extensiontemplates

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Process FHIR extensions

Generate parquet data for FHIR extensions

1. Prepare the customized schema templates

2. Push the customized schema to Azure Container Registry

3. Deploy the Analytics pipeline with customized schema enabled

4. Provide access of the Container Registry to the Azure Container App

5. Verify customized data on the Storage

Query customized data in Synapse Serverless SQL pool

1. Run the Powershell script to create the tables and views with customized schema image

2. Query customized data in Synapse SQL pool

FilesExpand file tree

Process FHIR extensions.md

Latest commit

History

Process FHIR extensions.md

File metadata and controls

Process FHIR extensions

Generate parquet data for FHIR extensions

1. Prepare the customized schema templates

2. Push the customized schema to Azure Container Registry

3. Deploy the Analytics pipeline with customized schema enabled

4. Provide access of the Container Registry to the Azure Container App

5. Verify customized data on the Storage

Query customized data in Synapse Serverless SQL pool

1. Run the Powershell script to create the tables and views with customized schema image

2. Query customized data in Synapse SQL pool