Announcing Bunsen: FHIR Data with Apache Spark
November 27, 2017
We’re excited to open source Bunsen, a library to make analyzing FHIR data with Apache Spark simple and scalable. Bunsen encodes FHIR resources directly into Apache Spark’s native data structures. This lets users leverage well-defined FHIR data models directly within Spark SQL.
Here’s a simple query against a table of FHIR observations that produces a table of heart rate values:
spark.sql(""" select subject.reference person_id, effectiveDateTime date_time, valueQuantity.value value from observations where in_valueset(code, 'heart_rate') """).