--- a/rdf-turtle/index.html Wed Feb 22 18:19:29 2012 +0100
+++ b/rdf-turtle/index.html Wed Feb 22 18:25:15 2012 +0100
@@ -540,10 +540,10 @@
</p>
</section>
- <section id="sec-strings">
- <h3>String Escape Sequences</h3>
+ <section id="sec-strings" class="issue">
+ <h3><span style="color:red;">REMOVE: </span>String Escape Sequences</h3>
- <p>Turtle strings and IRIs can use <code>\</code>-escape sequences to
+ <p>Turtle strings, IRIs and <a href="#term-turtle2-PN_LOCAL">local names</a> can use <code>\</code>-escape sequences to
represent Unicode code points.</p>
<p class="note">
@@ -553,6 +553,11 @@
<code>~.-!$&'()*+,;=:/?#@%_</code>. It is not necessary to escape
these characters in strings or IRIs.</p>
+ <p class="note">The <a href="#term-turtle2-PERCENT"><PERCENT></a> terminal matches %-encoding
+ sequences. These sequences are <em>not</em> decoded during processing; a term written as
+ <code><http://a.example/%66oo\-bar></code> in Turtle designates the RDF IRI
+ <code>http://a.example/%66oo-bar</code>.</p>
+
<p>The following table describes all the escape sequences
allowed inside a <a href="#prod-turtle2-String">string</a>
@@ -621,6 +626,115 @@
</blockquote>
</section>
+ <section id="sec-escapes" class="issue">
+ <h3><span style="color:red;">ADD: </span>Escape Sequences</h3>
+
+ <p><span style="font-weight:bold;">IRI</span>s, used as <a href="#prod-turtle2-IRIref">RDF terms</a> or as in <a href="#prod-turtle2-prefixID">@prefix</a> or <a href="#prod-turtle2-base">@base</a> declarations, may include <code>\</code>-escape sequences to represent Unicode code points:</p>
+
+ <table>
+ <thead>
+ <tr>
+ <th>Escape sequence</th>
+
+ <th>Unicode code point</th>
+
+ </tr>
+ </thead>
+ <tbody>
+
+ <tr>
+ <td>'\u' <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a></td>
+ <td>A Unicode codepoint in the range U+0 to U+FFFF inclusive
+ corresponding to the value encoded by the four hexadecimal digits interpreted from most significant to least significant digit.</td>
+ </tr>
+
+
+ <tr>
+ <td>'\U' <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a></td>
+
+ <td>A Unicode codepoint in the range U+10000 to U+10FFFF inclusive
+ corresponding to the value encoded by the eight hexadecimal digits interpreted from most significant to least significant digit.</td>
+ </tr>
+
+ </tbody>
+ </table>
+
+ <p>where <a href="#rHEX">HEX</a> is a hexadecimal character</p>
+ <blockquote>
+ <p><span style="font-family: monospace; font-size: 85%;"><a id="rHEX">HEX</a>
+ ::= [0-9] | [A-F] | [a-f]</span></p>
+
+ </blockquote>
+
+ <p>
+ <a href="#term-turtle2-BLANK_NODE_LABEL"><span style="font-weight:bold;">Blank node label</span>s</a> and <a href="#term-turtle2-PN_LOCAL"><span style="font-weight:bold;">local name</span>s</a> may include:
+ </p>
+ <ul>
+ <li>%-encoded sequences. A '%' followed by two hex characters represents that same sequence of three characters. These sequences are <em>not</em> decoded during processing; a term written as <code><http://a.example/%66oo\-bar></code> in Turtle designates the RDF IRI <code>http://a.example/%66oo-bar</code>.</li>
+ <li>escaped punctuation: A '\' followed by one of <code>~.-!$&'()*+,;=:/?#@%_</code> represents the character to the right of the '\'.</li>
+ </ul>
+
+ <p>
+ <a href="#prod-turtle2-String"><span style="font-weight:bold;">String</span>s</a> may include the <code>\</code>-escape sequences allowed in IRIs, as well as the the following escape sequences, which also map to Unicode code points:</p>
+
+ <table>
+ <thead>
+ <tr>
+ <th>Escape sequence</th>
+
+ <th>Unicode code point</th>
+
+ </tr>
+ </thead>
+ <tbody>
+
+ <tr>
+ <td>'\t'</td>
+ <td>U+0009</td>
+ </tr>
+
+ <tr>
+ <td>'\b'</td>
+ <td>U+0008</td>
+ </tr>
+
+ <tr>
+
+ <td>'\n'</td>
+ <td>U+000A</td>
+ </tr>
+
+ <tr>
+ <td>'\r'</td>
+ <td>U+000D</td>
+ </tr>
+
+ <tr>
+ <td>'\f'</td>
+ <td>U+000C</td>
+ </tr>
+
+ <tr>
+ <td>'\"'</td>
+ <td>U+0022</td>
+ </tr>
+
+ <tr>
+ <td>'\''</td>
+ <td>U+0027</td>
+ </tr>
+
+ <tr>
+ <td>'\\'</td>
+
+ <td>U+005C</td>
+ </tr>
+
+ </tbody>
+ </table>
+
+
+ </section>
<section id="sec-grammar-grammar">
<h3>Grammar</h3>
<p>The <abbr title="Extended Backus–Naur Form">EBNF</abbr> used here is defined in XML 1.0 (Third Edition)
@@ -951,10 +1065,10 @@
<section id="n-triple-changes" class="informative">
<h3>Changes from N-Triples</h3>
<ul>
- <li>Default encoding is UTF-8 rather then US-ASCII only
- <li>Uses IRIs rather then RDF URI References
+ <li>Default encoding is UTF-8 rather than US-ASCII only
+ <li>Uses IRIs rather than RDF URI References
<li>Defines a unique media type <code>text/ntriples+turtle</code>
- <li>Subset of Turtle rather then Notation 3
+ <li>Subset of Turtle rather than Notation 3
<li>Comments may occur after a triple production
</ul>
</section>