Comments (3)
Furl is doing the correct thing here, though it can be slightly confusing at
first glance.
The confusion boils down to how -- in the absence of URL delimiters (e.g. ://
,
/
, ?
, etc) -- solitary strings like www.example.com
are parsed.
While www.example.com
looks like a host to you and me, how does furl know it's
a host? What about
>>> furl('localhost')
Is localhost
a host, too? Or is it a path? Remember host strings can be almost
anything -- they needn't contain periods (e.g. localhost
). That, and every
valid host string is also a valid path segment.
http://foo.com/url/www.example.com/
is a perfectly valid URL, too.
So, to be consistent, furl parses all single-component strings (strings without
multiple URL components separated by URL delimiters) as path strings. You can
see this behavior with www.example.com
above.
>>> f = furl('www.example.com')
>>> f.host
None
>>> f.path.segments
['www.example.com']
This is also the exact behavior of Python's built-in urlparse module, too.
>>> from urlparse import urlparse
>>> urlparse('www.example.com')
ParseResult(scheme='', netloc='', path='www.example.com', params='', query='', fragment='')
Back to your original example. There are three slashes (///
) in the final URL
because www.example.com
is parsed as a path segment and not a host.
>>> f = furl('www.example.com').set(scheme='http').add(path='test/path')
>>> f.path.segments
['www.example.com', 'test', 'path']
>>> f.host
None
If you want furl to adopt www.example.com
as the host, explicitly set it as
the host with set()
. Like
>>> f = furl().set(host='www.example.com', scheme='http').add(path='test/path')
>>> f.path.segments
['test', 'path']
>>> f.host
'www.example.com'
I've thought about adding the ability for furl to 'guess' whether a solitary
string is the host string or a path segment based on whether or not that string
contains periods, but that would lead to inconsistent behavior. Consistent
behavior, even if sometimes initially surprising, is better behavior.
Does that answer your original question?
from furl.
Yes, absolutely. Thanks a lot for your very comprehensive answer.
from furl.
Wonderful.
Thank you for bringing this question to my attention, Max. Don't hesitate to let
me know if there's anything I can do for you.
from furl.
Related Issues (20)
- furl doesn't type-hint itself as Text HOT 3
- url parsing still accepts semicolon as separator HOT 5
- Add typing support HOT 6
- Hi, if the URL is like '127.0.0.1:8080/a/b', the result is '8080/a/b', which seems incorrect HOT 2
- How handle javascript:void(0) HOT 2
- Feature: Addition of `is_domain` flag
- Maximum recursion depth exceeded error for url with large list HOT 3
- Adding README to PyPi HOT 2
- '127.0.0.1:8329' parsed wrong in Python 3.9+ HOT 3
- Add SECURITY.md HOT 3
- Feature request: Convert to unescaped string HOT 2
- .join() and .path.normalize() incorrectly interpret base paths ending /. or /..
- furl.path object is NOT of type: pathlib.Path HOT 1
- Feature requests: get url relative to another url
- Setting multiple query params for the same key and resultant url format
- Seems to be a bug when paring multiple '@'s ?
- Use the `ipaddress` module to detect valid IPV6 HOT 1
- a method to set an individual query parameter without removing the rest or keeping the original value
- Removes "git@" from git remote URL HOT 1
- URL fragment percent-encoding
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from furl.