DOM CDATA – Handle Unescaped Text Blocks in XML

Introduction – Why Learn About DOM CDATA?

In XML, there are situations where you want to include special characters—like <, &, or even HTML/JavaScript code—without parsing them as XML. That’s what CDATA sections are for. The DOM treats CDATA as a special type of text node (nodeType = 4) that preserves raw text content.

In this guide, you’ll learn:

What a CDATA section is and how the DOM handles it
How to create and access CDATASection nodes
Differences between CDATA and normal Text nodes
When to use CDATA and its best practices in XML

What Is CDATA in XML?

CDATA (Character Data) is a block of text in XML that is not parsed by the XML processor. Everything inside <![CDATA[ ... ]]> is treated as literal text.

Example:

<code><![CDATA[if (a < b && b > c) { return true; }]]></code>

The <, >, and && symbols are not escaped and are preserved exactly as-is.

DOM CDATA Section – Properties and Type

Property	Description
`nodeType`	`4` (`CDATA_SECTION_NODE`)
`nodeName`	`"#cdata-section"`
`nodeValue`	The actual CDATA text
`data`	Alias for `nodeValue`
`length`	Length of the character data

A CDATA section behaves similarly to a Text node but is distinct in XML.

Example – Creating CDATA in JavaScript

const xmlDoc = document.implementation.createDocument("", "", null);
const book = xmlDoc.createElement("book");
const code = xmlDoc.createElement("code");

const cdata = xmlDoc.createCDATASection("if (a < b && b > c) { return true; }");

code.appendChild(cdata);
book.appendChild(code);
xmlDoc.appendChild(book);

console.log(new XMLSerializer().serializeToString(xmlDoc));

Output:

<book>
  <code><![CDATA[if (a < b && b > c) { return true; }]]></code>
</book>

CDATA vs Text Node

Feature	`Text` Node	`CDATASection` Node
`nodeType`	`3`	`4`
Parsing	Text content is parsed by XML	Text is preserved literally
Use case	Normal text content	Code snippets, symbols, special characters
Read/write	Same (`textContent`, `nodeValue`)	Same

Limitations of CDATA

You cannot include ]]> inside a CDATA block (must be escaped or split)
CDATA is not supported in HTML—only XML
CDATA can confuse downstream processors if not expected

Best Practices with CDATA

✔️ Use for unescaped strings like JavaScript, code samples, formulas
✔️ Use createCDATASection() when working in the XML DOM
✔️ Serialize with XMLSerializer if needed
Don’t overuse—prefer plain text unless escaping is an issue
Never include ]]> directly—split or escape it

Summary – Recap & Next Steps

CDATA allows you to safely include raw, unescaped text in your XML documents. The DOM represents CDATA blocks as distinct node types (CDATASection) that behave similarly to text nodes but are preserved verbatim.

Key Takeaways:

CDATA is useful for embedding unescaped code or special characters
DOM treats it as nodeType = 4 (CDATA_SECTION_NODE)
Use createCDATASection() to create them dynamically

Real-world relevance: Used in XML APIs, config files, embedded scripts, e-learning packages, and document editors.

FAQs – DOM CDATA

What is the nodeType of CDATA?
4 – CDATA_SECTION_NODE

Can I use CDATA in HTML?
No. CDATA is XML-only and not valid in HTML.

Can CDATA contain </ or ]]>?
It can contain </ but not ]]>. That must be split.

How do I convert CDATA to string?
Use node.nodeValue or textContent.

Can I style or format CDATA content?
Not directly. It’s meant for raw text, not formatting.

« Previous Next »

Share Now :