Mule TokenAIzer Custom Policy

Powered by

Mule TokenAIzer Custom Policy

This custom policy enables tokenization and obfuscation of sensitive data in API requests and responses. By applying this policy, you can protect sensitive information based on various regulations (such as PCI DSS, GDPR, HIPAA, etc.), ensuring the original data structure is preserved while sensitive values are masked.

Why?

A tokenizer policy is needed to enhance data security and compliance when handling sensitive information. Here are some key reasons why it is important:

Data Protection: Tokenization replaces sensitive data (e.g., credit card numbers, Social Security Numbers) with non-sensitive tokens. These tokens are meaningless outside a secure tokenization system, reducing the risk of exposure during transmission.
Compliance with Regulations: Many regulations, such as PCI DSS, HIPAA, and GDPR, require mechanisms to protect sensitive data. A tokenizer policy ensures compliance by obfuscating or protecting data, helping to meet these regulatory standards.
Risk Reduction: By replacing sensitive data with tokens, the original data is not exposed if there is a breach. This minimizes the impact and liability associated with data leaks.
Consistency in Data Security: Applying a policy ensures uniform protection across all systems and APIs, preventing gaps in the security of sensitive data.
Flexibility: A tokenizer policy allows customization for specific data protection needs. For example: Masking credit card numbers as "**********1234". Tokenizing names or addresses based on the regulation (e.g., GDPR anonymization or PCI DSS requirements).
Improved Development Practices: A tokenizer policy abstracts the complexity of data protection from developers, making it easier to enforce security without requiring them to implement custom solutions.

How?

The policy uses an integration with the Einstein Trust Layer for tokenization of sensitive payloads. The process works as follows:

Context Enrichment: The policy identifies which regulations apply to the payload (based on policy config) and enriches the context for tokenization.
Tokenization: The payload is sent to Einstein AI, which generates obfuscated values based on the specified regulations. For this the policy creates a dynamic prompt, optimized for this purpose. NOTE: Please take into account that the models API (used by the MAC Einstein Connector) applies rate-limiting that vary depending on the type of org you are using.
TO-DO: Detokenization (optional): If detokenization is enabled, the policy will flag the token and store it to allow future detokenization (not performed by this policy).

Detailed Flow Explanation

Context Enrichment Flow: This flow is responsible for enriching the tokenization context based on the selected regulations. It generates a list of regulations to consider when processing the payload.

<flow name="enrich-context-flow">
    <set-variable value='#[%dw 2.0
                         output application/java
                         ---
                         (
                             [
                                 if ({{{pciEnabled}}}) "PCI DSS: Focuses on protecting payment card and transaction data..." else "",
                                 if ({{{hipaaEnabled}}}) "HIPAA: Protects health information..." else "",
                                 if ({{{gdprEnabled}}}) "GDPR: Protects personal data..." else "",
                                 ...
                             ] 
                             filter ((item) -> item != "")
                             joinBy ". \n"
                         )]' variableName="promptContext"/>
</flow>

Tokenization Flow: This flow communicates with Einstein AI to tokenize sensitive information based on the enriched context. The tokenized response is then processed and returned.

<flow name="ask-flow">
    <mac-einstein:chat-answer-prompt config-ref="Einstein_AI_Config" prompt="#[vars.prompt++'\nThe regulations to consider are '++ vars.promptContext ++ '\nThis is the payload to replace: \n' ++ write(payload, 'application/json')]" modelName="{{{model}}}"/>
    <set-payload value="#[output json --- read(payload.generation.generatedText, 'application/json')]"/>
</flow>

TO-DO: Detokenization Flow: If detokenization is allowed, this flow prepares the payload for the reversal of tokenization.

<flow name="detokenizer-prepare-flow">
    <logger level="DEBUG" message="#['Preparing detokenization for payload']" category="com.mule.policies.tokenAIzer" />
    <!-- TO-DO -->
</flow>

Usage

Before publishing this policy to Exchange you will need:

Add anypoint-exchange as a valid server with credentials to Settings.xml in your ~./m2 folder

After publishing the policy to Exchange (see Publish to Exchange under Development section), follow these steps to apply it to an existing managed API:

Log into Anypoint Platform.
Enter API Manager.
Click on the API version for the application you want to apply the policy to.
Click on Policies (located on the left).
Click on Apply New Policy.
Filter by the Custom category and select tokenAIzer. Click on the Configure Policy button.
Configure the policy parameters:

Parameter	Description
`requestProtected`	Tokenize Request Payload? Check if you want to protect the request payload by tokenizing sensitive data.
`responseProtected`	Tokenize Response Payload? Check if you want to protect the response payload by tokenizing sensitive data.
`detokenizationAllowed`	Do you want to support detokenization? Check if you want to support detokenization. Enabling this may impact performance as detokenization will need to reverse the tokenization process.
`pciEnabled`	PCI DSS Check if you want to protect request/response payloads following PCI DSS guidance. If enabled, it protects payment card and transaction data.
`hipaaEnabled`	HIPAA Check if you want to protect request/response payloads following HIPAA guidelines. It ensures protection of health-related information.
`gdprEnabled`	GDPR Check if you want to protect request/response payloads following GDPR regulations for personal data protection.
`ccpaEnabled`	CCPA Check if you want to protect request/response payloads following CCPA regulations to protect personal data of California residents.
`ferpaEnabled`	FERPA Check if you want to protect request/response payloads following FERPA guidelines to protect educational records in the US.
`glbaEnabled`	GLBA Check if you want to protect request/response payloads following GLBA regulations to protect financial data in the US.
`model`	Model Supported models for tokenization using Einstein AI. Select the AI model to use for tokenization and obfuscation tasks. Options include: `Anthropic Claude 3 Haiku on Amazon`, `Azure OpenAI Ada 002`, `Azure OpenAI GPT 3.5 Turbo`, `Azure OpenAI GPT 3.5 Turbo 16k`, `Azure OpenAI GPT 4 Turbo` - `OpenAI Ada 002`, `OpenAI GPT 3.5 Turbo`, `OpenAI GPT 3.5 Turbo 16k`, `OpenAI GPT 4`, `OpenAI GPT 4 32k`, `OpenAI GPT 4o (Omni)`, `OpenAI GPT 4 Turbo`. Reference
`einsteinClientId`	Client ID The client ID to authenticate with the external client app required by Einstein AI. This is a mandatory value for integrating with Einstein.
`einsteinClientSecret`	Client Secret The client secret to authenticate with the external client app required by Einstein AI. This is a mandatory value and should be kept secure.
`einsteinSfOrg`	Einstein Salesforce Org The Salesforce Organization (Org) where Einstein AI is running. This is required for the policy to interact with Einstein services.

Example Configuration

Here's an example of how you might configure the policy parameters:

requestProtected: true
responseProtected: true
detokenizationAllowed: false
pciEnabled: true
hipaaEnabled: false
gdprEnabled: false
ccpaEnabled: false
ferpaEnabled: false
glbaEnabled: false
model: "Azure OpenAI GPT 4 Turbo"
einsteinClientId: "your-client-id"
einsteinClientSecret: "your-client-secret"
einsteinSfOrg: "your-salesforce-org"

Development

The following commands are required during the development phase:

Task	Command
Package policy	`mvn clean install`
Publish to Exchange	`mvn deploy`

Dependencies

Mule Runtime 4.x: Required to run Mule 4 policies.
Maven: Used for building and deploying the policy.
MAC Einstein Connector
Einstein AI: Required for tokenization and AI-based obfuscation.

Original Developer

GitHub Repository

Contribution

Want to contribute? Great!

Just fork the repo, make your updates, and open a pull request!

To-do

Implement unit tests
Implement detokenization flow
Add support for additional data formats
Improve error handling for invalid input
Improve AI Features: RAG, Chat with Memory

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
src/main/mule		src/main/mule
README.md		README.md
UNLICENSE		UNLICENSE
mule-artifact.json		mule-artifact.json
pom.xml		pom.xml
tokenAIzer-policy.yaml		tokenAIzer-policy.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mule TokenAIzer Custom Policy

Why?

How?

Detailed Flow Explanation

Usage

Example Configuration

Development

Dependencies

Original Developer

Contribution

To-do

About

Releases

Packages

License

mulesoft-catalyst/tokenAIzer-policy

Folders and files

Latest commit

History

Repository files navigation

Mule TokenAIzer Custom Policy

Why?

How?

Detailed Flow Explanation

Usage

Example Configuration

Development

Dependencies

Original Developer

Contribution

To-do

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages