URLs
rigour.urls
build_url(url, params=None)
Compose a URL with the given query parameters.
Source code in rigour/urls/cleaning.py
clean_url(text)
Perform intensive care on URLs to make sure they have a scheme and a host name. If no scheme is given HTTP is assumed.
Source code in rigour/urls/cleaning.py
clean_url_compare(text)
Destructively clean a URL for comparison.
Source code in rigour/urls/cleaning.py
compare_urls(left, right)
Compare two URLs and return a float between 0 and 1 representing the similarity between them. Before comparison, clean both URLs in a destructive way.