Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
__wip: Make the geocoding much more accurate
Before this pass, we had very few reliable high scores (above 0.8) and even then, many of those were more than questionable. For instance: - we could have a 0.97 score for an address (without city info) - we could have a 0.95 score for just a city - only 411 (bad) results found if city or zipcode not present - there were 90741 scores above 0.8 among 136246 (66%) but mostly unreliable Now, we have: - 38637 rows found with score >0.8 if at least city/zipcode/insee is present instead of 411 - there is no single row resolving to a municipality - accuracy of the scores above 0.8 has been manually checked - we still have 66% of the results above the 0.8 score, but "lost" 20_000 records that could be considered garbage. For instance, some adresses containing a number now resolve to a street. After check, it seems that those are adresses (DOM/TOM mainly) where street numbers are unknown for the BAN and usually Google Maps as well.
- Loading branch information