Skip to content

Automate dataset creation from minimal list of diseases along with their corresponding diseases (Feature Engineer & Medical Encode).

License

Notifications You must be signed in to change notification settings

sxaxmz/Feature_Engineer_and_Medical_Encode_Disease_Symptoms

Repository files navigation

Feature Engineer & Medical Encode (Diseases and Symptoms)

Python tool to automate dataset creation from minimal list of diseases along with their corresponding diseases (Feature Engineer & Medical Encode).

Automate Dataset Process

Input File Extension XLSX

Output Files:

  • Feature Engineered file.
  • Medical encoded file.

Steps:

  1. Have your list of diseases and their corresponding symptoms in a table prepared as the following format; first column is the disease name and add corresponding symptoms upto column N (One symptoms per column cell, unlimited number of columns as per your need), as follows:

Input Table Format

  1. Update The file name and sheet name accordingly.

file_name = 'File Name'

name_sheet = 'Sheet1'

  1. Run the script, sip coffee, and wait for the output :)

Caution:

If an incorrect term was used for a disease name or a symptom that has no matching code in the medical dictionary it shall have the value of 0. Check the console output to ensure that you are using the correct term. Below highlighted in red, a case of using a symptom that has no matching term in the medical dictionary.

Note: Incase of having non-matching terms, SNOMED CT Official Website can be referenced to acquire correct terms.

Medical Encode Output

If a used term have multiple matching codes, each matching code can be found in corresponding output segment (i.e. Melena in the above sample).

Console Output:

This tool outputs multiple useful information such as:

  • List of Symptoms:

Symptoms List

  • List of Column Names int eh feature engineered file:

Column Names

  • Values of each added row to the final dataset (row index, count of row values, row values):

Sample of added rows

  • Each diseases name and symptoms along with their matching medical code (if any).

Hope it saved someone's time.

Usage:

When using or modifying this tool do always give appropriate credit.

References:

About

Automate dataset creation from minimal list of diseases along with their corresponding diseases (Feature Engineer & Medical Encode).

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages