-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for 'Normalizer' tranformer #56
Comments
There's a conceptual mismatch between the PMML representation and Apache Spark/Scikit-Learn representations:
The |
To elaborate some more:
Also, consider the one-hot-encoding of categorical features for model training. Objectively, this is a stupid thing to do, but is very much needed in the current state of Apache Spark/Scikit-Learn, because they can't handle categorical features directly. PMML can, and doesn't need one-hot-encoding. |
Many thanks for your explanation. |
[More of an inquiry than a proper issue but I searched for prior issues/comments about this and did not find one, so I raise the issue so the reply is available for others.]
From the documentation, transformer 'org.apache.spark.ml.feature.Normalizer' is not currently supported and the API complains when trying to convert pipelines that contains said transformation.
We'd like to know a bit more about whether there is any particular reason for this transformation not being supported, and if there are plans to support it in the future.
Keep up the great work! Yours is an invaluable contribution to the industry.
The text was updated successfully, but these errors were encountered: