User Import - small import improvement (Mail / gender recognition) #3823

MSoeb · 2024-07-08T09:05:10Z

Description: When importing accounts or participants, there is always the problem that some attributes (e-mail and gender) can have spaces BEFORE and AFTER the value. This leads to problems during import. The conflict must be resolved manually.

Example of the csv file with additional spaces. First mail column has spaces before and after the mail. Second mail has no spaces:

What should happen:
It would be good if spaces BEFORE and AFTER certain attributes were automatically ignored when uploading the csv file, as these are not logically possible.

Affected attributes:

E-mail
Gender

Info: Issue is part of META issue #3809

vkrasnovyd · 2024-08-22T07:26:05Z

@MSoeb Hi Marcus! I've done some research before writing a solution and found 2 options how to resolve this issue. I need your help to choose which approach would be more suitable.

First, some context. While recreating the problem I found out, that actually all attributes may be affected by it: removing spaces at the beginning and end of a string during parsing is not currently configured.
But only some of such fields with extra whitespaces trigger errors: email, gender (mentioned in this issue) and username.

There are 2 ways to deal with this:

Remove extra spaces only for the attributes that cause errors (email, gender, username) - as described in this issue.
Remove spaces at the beginning and end of each line for all attributes.
In both cases, the spaces in the middle of the lines will not be changed or removed.

Option 1 (only fields that cause errors) is a better choice if whitespaces in the begginning or end of some other fields (for example name) are important. But I can't think of a case where this would be relevant for this project.

Option 2 (cleanup all data) is as easy to implement as option 1, but it will produce cleaner parsed data.

Please let me know which option you think is more appropriate.

MSoeb · 2024-08-30T10:33:13Z

Option 2 sounds good. You could solve it this way.

MSoeb added enhancement General enhancement which is neither bug nor feature Schrödinger projectname labels Jul 8, 2024

MSoeb added this to the 4.2 milestone Jul 8, 2024

MSoeb changed the title ~~User Import - small import improvement (Mail recognition)~~ User Import - small import improvement (Mail / gender recognition) Jul 8, 2024

MSoeb mentioned this issue Jul 8, 2024

[META] Import Pages - Participants and Accounts #3809

Open

4 tasks

bastianjoel added the good first issue label Jul 23, 2024

vkrasnovyd self-assigned this Aug 21, 2024

vkrasnovyd mentioned this issue Sep 2, 2024

Add parsed data cleanup rule #4073

Merged

rrenkert closed this as completed in #4073 Sep 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

User Import - small import improvement (Mail / gender recognition) #3823

User Import - small import improvement (Mail / gender recognition) #3823

MSoeb commented Jul 8, 2024

vkrasnovyd commented Aug 22, 2024

MSoeb commented Aug 30, 2024

User Import - small import improvement (Mail / gender recognition) #3823

User Import - small import improvement (Mail / gender recognition) #3823

Comments

MSoeb commented Jul 8, 2024

vkrasnovyd commented Aug 22, 2024

MSoeb commented Aug 30, 2024