Fix parsing of search response #294

consideRatio · 2024-10-31T13:44:40Z

Fixes #292 and a bug that maybe never surfaced where things could work before 2.0.0 when the conn.response list had multiple elements thanks to being ordered in a way where the first element was the one of relevance, even though LDAP specification sais any ordering is allowed. Maybe the ldap3 client orders them, but who knows.

Also fixes #295

Investigation

Should we use conn.entries instead of conn.response? How is conn.entries populated? Is it the type=searchResEntry responses?
Answer: yes we should use conn.entries, it is only including type=searchResEntry responses, see the code here.

References

discussion: Incorrect handling for LDAP responses with Search Reference Results. #292
ldap3 docs: https://ldap3.readthedocs.io/en/latest/searches.html#entries
LDAP docs: https://datatracker.ietf.org/doc/html/rfc4511#section-4.5.2

manics · 2024-10-31T13:58:32Z

Assuming this works can you add a unit test for _filter_search_response using the responses in #292 (comment) as test data?

consideRatio · 2024-10-31T14:00:29Z

@manics I'll make sure to include that if the function is retained, but I think maybe use of conn.entries can make it irrelevant. The documentation isn't so clear, so I look to investigate further.

We were using conn.response but should have used conn.entries as we only cared for search results and not other kinds of messages that could be part of conn.response.

consideRatio · 2024-11-01T11:10:52Z

@manics @franciscaestecker this is ready for review now. I think the bug resided in this projet's use of conn.response while we expected the data from conn.entries that doesn't include anything but the searchResEntry type of responses.

ldapauthenticator/ldapauthenticator.py

manics · 2024-11-01T12:08:46Z

ldapauthenticator/ldapauthenticator.py

@@ -547,6 +550,9 @@ def get_user_attributes(self, conn, userdn):
                search_filter="(objectClass=*)",
                attributes=self.auth_state_attributes,
            )
+            # FIXME: Handle situations with multiple entries below or comment
+            #        why its not important to do.
+            #


Maybe we should throw an error, same as in resolve_username? If there's a possibility of the entries corresponding to different Identities this implies a change in the LDAP server could lead to a different ordering of responses, resulting in a user gaining access to another user's account.

If it's two entries for the same user we still need to understand what the difference is, in case some attributes are different which could lead to inconsistent configuration of the singleuser server.

I don't think users will get access to another JupyterHub account, as its just impacting the auth state (see code below from the end of af the authenticate function) - but they could get access to another users ldap data through their jupyterhub account.

I'll open an issue for this to be tracked separately from this PR - do you think its important to get fixed before release?

user_attributes = self.get_user_attributes(conn, userdn) self.log.debug("username:%s attributes:%s", login_username, user_attributes) username = resolved_username if self.use_lookup_dn_username else login_username auth_state = { "ldap_groups": ldap_groups, "user_attributes": user_attributes, } return {"name": username, "auth_state": auth_state}

@manics btw off topic, but I just wanted to say I greatly appreciate your effort into the jupyterhub - you regularly make me very thankful!!

My preference is to include this in the next release.

I think we should try and avoid any ambiguities in authenticator given their importance- I'd rather we were too strict and then relax things later. Ideally we'd get more input from people with expertise in LDAP, but it's clear we don't have that, and unfortunately the only way we can get real-world input is to release something and wait for bug reports.

Yeah I agree, I figure it makes sense to be in a separate PR though - I'll work it next!

I've come to disagree with my former self, this is in scope of "Fix parsing of search response".

I pushed the commit to this PR!

It seems like a error in how things were setup if this happens. Co-authored-by: Simon Li <[email protected]>

consideRatio · 2024-11-01T15:39:42Z

I iterated a bit on the log messages as well to make errors have some pointer on what may be configured wrong or similar. I'm hands off now!

manics

The more detailed logging is good! I'm worried about including login ... denied in resolve_username though, since the decision is made in authenticate(). If there's a future refactor this could easily be missed, resulting in incorrect logs. Can we split the errors logs, so the resolve_username error contains the detail of why a username wasn't resolve, and put the definite denied error log in

ldapauthenticator/ldapauthenticator/ldapauthenticator.py

Lines 598 to 600 in 70ea3cd

    
           resolved_username, resolved_dn = self.resolve_username(login_username) 
        
           if not resolved_dn: 
        
               return None

instead?

The two log messages will still occur together so there's no loss of context.

consideRatio · 2024-11-03T11:56:39Z

Good point @manics, I've made the resolve_username function not draw conclusions on whats happening outside the function and instead focused on logging things in scope for the function to log based on its purpose only.

manics

Thanks!
@franciscaestecker please could you try this PR? Assuming it fixes your bug reported in #292 we can release this in the next few days as 2.0.2

consideRatio · 2024-11-05T10:19:34Z

@manics I'll proceed with a release

consideRatio marked this pull request as draft October 31, 2024 13:48

consideRatio added the bug label Oct 31, 2024

consideRatio mentioned this pull request Oct 31, 2024

Incorrect handling for LDAP responses with Search Reference Results. #292

Closed

consideRatio force-pushed the pr/conn-stuff branch from 70ac79e to 2caa725 Compare October 31, 2024 13:53

consideRatio force-pushed the pr/conn-stuff branch from 2caa725 to 3ffc6ba Compare November 1, 2024 10:05

consideRatio changed the title ~~Preliminary work on fixing search result handling~~ Fix parsing of search response Nov 1, 2024

Fix parsing of search response

2123f36

We were using conn.response but should have used conn.entries as we only cared for search results and not other kinds of messages that could be part of conn.response.

consideRatio force-pushed the pr/conn-stuff branch from 3ffc6ba to 2123f36 Compare November 1, 2024 11:04

consideRatio marked this pull request as ready for review November 1, 2024 11:05

manics reviewed Nov 1, 2024

View reviewed changes

Fix log level

148eeb8

It seems like a error in how things were setup if this happens. Co-authored-by: Simon Li <[email protected]>

consideRatio mentioned this pull request Nov 1, 2024

Review user attributes fetched for auth_state - could end up not being the users without an error being raised? #295

Closed

Ensure unique user in search result for get_user_attributes

ccd244f

consideRatio force-pushed the pr/conn-stuff branch from dcb65fd to cf8aa04 Compare November 1, 2024 14:45

consideRatio added 2 commits November 1, 2024 15:46

Coalesce similar log events to reduce code complexity

f45d87e

refactor: a bit less code

00f8af1

consideRatio force-pushed the pr/conn-stuff branch from cf8aa04 to 00f8af1 Compare November 1, 2024 14:46

consideRatio added 2 commits November 1, 2024 16:11

Tweak log messages

0732444

Tweak log messages further, make users actionable

70ea3cd

consideRatio requested a review from manics November 1, 2024 15:39

manics reviewed Nov 2, 2024

View reviewed changes

Update logging in resolve_username to only consider its purpose

769f4fb

consideRatio force-pushed the pr/conn-stuff branch from c858a76 to 769f4fb Compare November 3, 2024 11:55

manics approved these changes Nov 3, 2024

View reviewed changes

manics mentioned this pull request Nov 4, 2024

v4 upgrade doc, changelog for 4.0 jupyterhub/zero-to-jupyterhub-k8s#3557

Merged

consideRatio merged commit f338ca3 into jupyterhub:main Nov 5, 2024
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix parsing of search response #294

Fix parsing of search response #294

consideRatio commented Oct 31, 2024 •

edited

Loading

manics commented Oct 31, 2024

consideRatio commented Oct 31, 2024

consideRatio commented Nov 1, 2024

manics Nov 1, 2024

consideRatio Nov 1, 2024

consideRatio Nov 1, 2024

manics Nov 1, 2024

consideRatio Nov 1, 2024

consideRatio Nov 1, 2024

consideRatio Nov 1, 2024

consideRatio commented Nov 1, 2024

manics left a comment

consideRatio commented Nov 3, 2024

manics left a comment

consideRatio commented Nov 5, 2024

	resolved_username, resolved_dn = self.resolve_username(login_username)
	if not resolved_dn:
	return None

Fix parsing of search response #294

Fix parsing of search response #294

Conversation

consideRatio commented Oct 31, 2024 • edited Loading

Investigation

References

manics commented Oct 31, 2024

consideRatio commented Oct 31, 2024

consideRatio commented Nov 1, 2024

manics Nov 1, 2024

Choose a reason for hiding this comment

consideRatio Nov 1, 2024

Choose a reason for hiding this comment

consideRatio Nov 1, 2024

Choose a reason for hiding this comment

manics Nov 1, 2024

Choose a reason for hiding this comment

consideRatio Nov 1, 2024

Choose a reason for hiding this comment

consideRatio Nov 1, 2024

Choose a reason for hiding this comment

consideRatio Nov 1, 2024

Choose a reason for hiding this comment

consideRatio commented Nov 1, 2024

manics left a comment

Choose a reason for hiding this comment

consideRatio commented Nov 3, 2024

manics left a comment

Choose a reason for hiding this comment

consideRatio commented Nov 5, 2024

consideRatio commented Oct 31, 2024 •

edited

Loading