Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stripping/removing URL parameters #26

Open
alexeiramone opened this issue Aug 6, 2021 · 1 comment
Open

Stripping/removing URL parameters #26

alexeiramone opened this issue Aug 6, 2021 · 1 comment

Comments

@alexeiramone
Copy link

alexeiramone commented Aug 6, 2021

It's stripping url parameters

url = 'https://www.example.com/xx/path/slug-whatever?atag=1234de&utm_medium=affiliates&utm_source=whatever_5443de' print(url_normalize(url,sort_query_params=True))
https://www.example.com/xx/path/slug-whatever?atag=1234de&utm_medium=affiliates
print(url_normalize(url,sort_query_params=False))
https://www.example.com/xx/path/slug-whatever?atag=1234de&utm_medium=affiliates

@alexeiramone
Copy link
Author

url_normalize.py, line 58

url = re.sub(r"utm_source=[^&]+&?", "", url)

Why utm_source is stripped as 'unecessary data'?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant