Question 1

Why does & sometimes appear in URLs inside HTML?

Accepted Answer

When URL parameters separated by & appear inside an HTML attribute value, the & must be encoded as & so the HTML parser does not try to interpret it as the start of an entity. The encoded HTML still produces the correct URL when the link is followed; it is just escaped at the HTML layer.

Question 2

When should I use named entities versus numeric?

Accepted Answer

Named entities (& < > " ') are easier to read and write. Numeric entities (&#38; etc.) are more portable across HTML and XML parsers. The big five are universally supported as named; less common ones (like &lambda;) may not work in strict XML. When in doubt, prefer named for HTML readability.

Question 3

Is HTML entity encoding sufficient to prevent XSS?

Accepted Answer

Only for HTML text-content and most attribute values, and only when applied uniformly. URL contexts (href, src) need URL encoding plus careful schema handling (no javascript: URLs). JavaScript contexts (script tags, event handlers) need JS escaping. Auto-escaping at template render time, handled by your templating engine, is the right layer; manual entity encoding is a fallback.

Question 4

Why is ' not always recognized?

Accepted Answer

' is an XML entity and was not defined in HTML 4. HTML5 added it but older HTML parsers may not recognize it. The numeric form ' is universally supported. Modern browsers handle both.

Question 5

What is the difference between encodeURIComponent and HTML entity encoding?

Accepted Answer

They serve different layers. URL percent-encoding handles characters with special meaning in URLs; HTML entity encoding handles characters with special meaning in HTML markup. A single character may need both: an & inside an href attribute first becomes %26 (URL-encoded as a parameter separator), then sometimes the & itself in the resulting URL becomes & in the surrounding attribute.

Question 6

Does decoding here handle every HTML entity?

Accepted Answer

This tool decodes the five common named entities (& < > " ') plus   plus all decimal numeric (&#NNN;) and hexadecimal numeric (&#xHHHH;) forms. Less common named entities like &eacute; or &lambda; are not in the lookup table and would need to be in numeric form to decode correctly.

HTML Entity Encoder & Decoder

What this tool does

How to use it

Common use cases

Common pitfalls

Frequently asked questions

Embed this tool

Cite this tool