HTML Uniform Resource Locators Encoding

A URL is another word for a web address.

A URL can be composed of words (e.g. Examples.com), or an Internet Protocol (IP) address (e.g. 192.68.20.50).

The vast majority enter the name when surfing, since names are simpler to recall than numbers.

URL – Uniform Resource Locator

Internet browsers demand pages from web workers by utilizing a URL.

A Uniform Resource Locator (URL) is utilized to address a record (or different information) on the web.

Explanation:

  • scheme – defines the type of Internet service (most common is http or https)
  • prefix – defines a domain prefix (default for http is www)
  • domain – defines the Internet domain name (like examples.com)
  • port – defines the port number at the host (default for http is 80)
  • path – defines a path at the server (If omitted: the root directory of the site)
  • filename – defines the name of a document or resource

Common URL Schemes

The table below lists some common schemes:

SchemeShort forUsed for
httpHyperText Transfer ProtocolCommon web pages. Not encrypted
httpsSecure HyperText Transfer ProtocolSecure web pages. Encrypted
ftpFile Transfer ProtocolDownloading or uploading files
file A file on your computer
URL

URL Encoding

URLs can only be sent over the Internet using the ASCII character-set. If a URL contains characters outside the ASCII set, the URL has to be converted.

URL encoding converts non-ASCII characters into a format that can be transmitted over the Internet.

URL encoding replaces non-ASCII characters with a “%” trailed by hexadecimal digits.

URLs cannot contain spaces. URL encoding normally replaces a space with a plus (+) sign, or %20.

ASCII URL Encoding Examples

Your browser will encode input, according to the character-set used in your page.

The default character-set in HTML5 is UTF-8.

CharacterFrom Windows-1252From UTF-8
%80%E2%82%AC
£%A3%C2%A3
©%A9%C2%A9
®%AE%C2%AE
À%C0%C3%80
Á%C1%C3%81
Â%C2%C3%82
Ã%C3%C3%83
Ä%C4%C3%84
Å%C5%C3%85
URL