Use-case quality measurement #61

EvanMachusak · 2023-03-21T13:09:49Z

EvanMachusak
Mar 21, 2023

NCQA's primary interest is burden reduction in the quality measurement space.

We're best known for authoring the HEDIS® measures which every health plan in the United States is required to compute & report annually. This is a significant market and there is no single HEDIS® reference implementation that everyone uses. Many companies' sole business is computing & submitting quality measures on behalf of payers. At least $1bn is spent by the US health care system annually.

NCQA is currently digitizing HEDIS® into FHIR CQL measures.

CQL is translated to an intermediary logical model called ELM

There are a couple of open source CQL execution engines in the space:

JavaScript - written by MITRE on behalf of CMS.
- All digital measures authored under contract with CMS must run using this engine
- It has several major, known bugs that, due to MITRE's contracting arrangement, can only be fixed if someone pays them to fix it
- It has significant performance issues (e.g., if you use large value sets, measures can take 5+ minutes per patient)
Java - written by Bryn Rhodes & co
- Implemented as a visitor pattern over the ELM graph
- Is the "de facto" engine most people use
- Performance is reasonable - for well written CQL it can do > 1k patients per second
- Has a number of bugs & feature gaps
DotNet - originally written by NCQA for HEDIS and now jointly developed with Firely
Clojure - an in-dev server designed for analytics.
- Not complete yet; no CQL benchmarks published

With all of these engines, the flaw is that they require bringing the data to the execution.

Imagine if SQL didn't exist, and if you wanted to query a database, you had to download the entire database to a processing node to run the query.

That's the state of all current CQL engines.

We can pump around 1m patients through our engine in about an hour. The majority of the time is spent downloading the data, writing results to disk cache, and uploading the results back to the store.

Bringing the execution to the data is a much smarter strategy for population health. SQL is the best tool for that job.

Here's a very simple CQL library:

library SqlOnFhir version '1.0.0'

using FHIR version '4.0.1'

include FHIRHelpers version '4.0.1' called FHIRHelpers

define "Born after 1980":
	from [Patient] p where p.birthDate > @1980-01-01

Here's what the (very abbreviated) ELM looks like:

{
  "type" : "Library$Statements",
  "def" : [ {
    "type" : "ExpressionDef",
    "name" : "Born after 1980",
    "context" : "Patient",
    "accessLevel" : "Public",
    "expression" : {
      "type" : "Query",
      "source" : [ {
        "type" : "AliasedQuerySource",
        "expression" : {
          "type" : "Retrieve",
          "dataType" : "{http://hl7.org/fhir}Patient"
        },
        "alias" : "p"
      } ],
      "relationship" : [ ],
      "where" : {
        "type" : "Greater",
        "operand" : [ {
          "type" : "Property",
          "path" : "birthDate",
          "scope" : "p"
        }, {
          "type" : "Date",
          "year" : {
            "type" : "Literal",
            "value" : "1980"
          },
          "month" : {
            "type" : "Literal",
            "value" : "1"
          },
          "day" : {
            "type" : "Literal",
            "value" : "1"
          }
        } ]
      }
    }
  } ]
}

Now it should be fairly clear how one could translate ELM into a SQL query. By visiting the Query expression we can take its pieces (e.g. the where property) and translate them into corresponding SQL queries or meta queries as we discussed.

Taking the example Parquet files Nikolai gave us, I loaded them into a DuckDB instance. Since I am a C# guy I used this NuGet package to use ADO.NET to interact with an in-memory DB.

Ironically, Struct types are not supported so if I want to use C# I need to unroll our structures using views. This doesn't affect this simple example but would if we were doing e.g. Observations which have STRUCT columns for codeable concepts.

The Parquet files treat patient.birthDate as a VARCHAR, so I created a simple view to cast it as a Date:

CREATE TABLE patients AS SELECT * FROM 'Input/Parquet/Patient.parquet/resources.parquet'
CREATE SCHEMA IF NOT EXISTS qm
CREATE VIEW qm.patients AS SELECT CAST(birthDate AS DATE) AS birthDate FROM patients

Then using an ELM visitor pattern I implemented a basic query translator which takes the above ELM and creates:

SELECT p.* FROM qm.patients p WHERE (p.birthDate > '1980-1-1'::DATE)

But this is too simple.

Real CQL libraries are not as simple as above. CQL has dozens of keywords that will almost certainly not map 1:1 to SQL functions. Even in the above example, if I change the CQL to this:

define "Born after 1980":
	from [Patient] p where p.birthDate > @1980

This would not work in DuckDB at least. One could say that anyone whose birthdate is after 1980 would be anyone whose birthdate is 1981-01-01 or later. Certainly the query translator can do this, but it wouldn't be correct in all cases. For example, in CQL:

@1980 > @1980-04-04

Will evaluate to null because the answer is uncertain. 1980 is not 1980. It's actually [1980-01-01, 1980-12-31]. Comparing an interval to a value inside that interval using greater than is undefined.

Therefore I think what is needed is to be able to create scalar & table-valued functions that implement these rules.

This is a simple example but there are many more. When we started converting CQL to C# we started by trying to use .NET native syntax (like greater than), but in the end we just turned everything into a function. Every single operator in CQL had some subtlety that made it not work with standard C# syntax - mostly because in CQL everything can be null, and C# doesn't allow you to compare nullable values using standard operators and instead requires that you coalesce the values first.

I think we would need to write functions in "meta SQL" so we can translate them for various RDBMS platforms. I am not concerned
with 100% coverage out of the box, but they should be at least translatable to ANSI SQL.

If we use a lot of functions in queries they will be slower, but they would have to be so much dramatically slower as to overcome the I/O cost of pulling all the data out of the tables for an off-platform computation engine as all listed above to make it worse.

If we achieved this, we would enable all CQL digital measures to execute against any schema-compliant platform (like Aidbox), and also any platform whose schema can be mapped to our schema using any mechanism.

Virtually every payer in the world runs on an RDBMS. So this would be huge for them.

ewoutkramer · 2023-03-27T15:35:42Z

ewoutkramer
Mar 27, 2023
Maintainer

Would it be possible that we use SQL to do a preselection - so you don't need to implement all of CQL on the RDBMS side, but just make sure that the amount of data is reduced as much as possible - and then do the final selection on the "client" side of things?

1 reply

EvanMachusak Mar 31, 2023
Author

That's the direction Bryn is going with his work on $data-requirements. Angelo at Ballista Group is doing the same kind of hybrid approach where you try to do most of the work up front . He indexes all resources by the value sets they're in, and then tries to infer which value sets are used by a measure so he can minimize the number of resources the CQL has to execute against on the client side. That's quite hard to do though, e.g.:
define "Gotcha": from [Conditions] c where c.code in "value set" and c.code not in "value set"

The test data we use for HEDIS is already pre-minified because our certification test decks only generate the minimum amount of data necessary for our test cases, so our patient bundles have around 50 resources in them - I think @niquola said at the last DevDays that the average patient has about 1000 resources. Even with small patient bundles the I/O burden is fairly large, so with 20 times that for realistic patients, it scales badly.

I am interested in Microsoft's pipeline (and Google has one too). They claim to be able to stream data from FHIR servers to analytics stores "in near real time". Well written CQL on a decent engine could be injected as a step in one of those pipelines at very low cost. In our internal tests, most of our CQL executes faster than the JSON deserialization takes, so it's essentially free.

I think in absence of running population queries "at the source" (e.g. via SQL), analytics pipelines are the right architecture.

JPercival · 2023-08-01T16:09:14Z

JPercival
Aug 1, 2023

With all of these engines, the flaw is that they require bringing the data to the execution.

This is not an inherent flaw in the open-source Java engine. It was intentionally designed to support architectures such as the one Evan is suggesting. Apache Spark and other JVM big data platforms are able to distribute the open-source CQL engine libraries to nodes where the data lives and run calculations in situ. In fact, an engineer at Google published an example of such a use case here for Beam:

https://github.com/google/cql-on-beam

Alphora also published an example for Spark a couple years ago:

https://github.com/DBCG/spark-cql-fhir

Other deeper integrations are also possible and are being developed commercially.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use-case quality measurement #61

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments 1 reply

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Use-case quality measurement #61

EvanMachusak Mar 21, 2023

Replies: 2 comments · 1 reply

ewoutkramer Mar 27, 2023 Maintainer

EvanMachusak Mar 31, 2023 Author

JPercival Aug 1, 2023

EvanMachusak
Mar 21, 2023

Replies: 2 comments 1 reply

ewoutkramer
Mar 27, 2023
Maintainer

EvanMachusak Mar 31, 2023
Author

JPercival
Aug 1, 2023