+ <a href="#sec-escapes">Escape Sequences</a>
authorEric Prud'hommeaux <eric@w3.org>
Wed, 22 Feb 2012 01:55:28 -0500
changeset 203 59fc36595622
parent 202 6844aa998a5a
child 205 9a37a406f2ca
child 206 fade92348740
+ <a href="#sec-escapes">Escape Sequences</a>
rdf-turtle/index.html
--- a/rdf-turtle/index.html	Wed Feb 15 18:33:11 2012 -0800
+++ b/rdf-turtle/index.html	Wed Feb 22 01:55:28 2012 -0500
@@ -540,8 +540,8 @@
 
             </p>
           </section>
-          <section  id="sec-strings">
-            <h3>String Escape Sequences</h3>
+          <section id="sec-strings" class="issue">
+            <h3><span style="color:red;">REMOVE: </span>String Escape Sequences</h3>
 
             <p>Turtle strings, IRIs and <a href="#term-turtle2-PN_LOCAL">local names</a> can use <code>\</code>-escape sequences to
             represent Unicode code points.</p>
@@ -626,6 +626,115 @@
 
             </blockquote>
           </section>
+          <section id="sec-escapes" class="issue">
+            <h3><span style="color:red;">ADD: </span>Escape Sequences</h3>
+
+            <p><span style="font-weight:bold;">IRI</span>s, used as <a href="#prod-turtle2-IRIref">RDF terms</a> or as in <a href="#prod-turtle2-prefixID">@prefix</a> or <a href="#prod-turtle2-base">@base</a> declarations, may include <code>\</code>-escape sequences to represent Unicode code points:</p>
+
+            <table>
+              <thead>
+                <tr>
+                  <th>Escape sequence</th>
+
+                  <th>Unicode code point</th>
+
+                </tr>
+              </thead>
+              <tbody>
+
+                <tr>
+                  <td>'\u' <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a></td>
+                  <td>A Unicode codepoint in the range U+0 to U+FFFF inclusive
+              corresponding to the value encoded by the four hexadecimal digits interpreted from most significant to least significant digit.</td>
+                </tr>
+
+                                                              
+                <tr>
+                  <td>'\U' <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a></td>
+
+                  <td>A Unicode codepoint in the range U+10000 to U+10FFFF inclusive
+              corresponding to the value encoded by the eight hexadecimal digits interpreted from most significant to least significant digit.</td>
+                </tr>
+
+              </tbody>
+            </table>
+
+            <p>where <a href="#rHEX">HEX</a> is a hexadecimal character</p>
+            <blockquote>
+              <p><span style="font-family: monospace; font-size: 85%;"><a id="rHEX">HEX</a> 
+            ::= [0-9] | [A-F] | [a-f]</span></p>
+
+            </blockquote>
+
+	    <p>
+	      <a href="#term-turtle2-BLANK_NODE_LABEL"><span style="font-weight:bold;">Blank node label</span>s</a> and <a href="#term-turtle2-PN_LOCAL"><span style="font-weight:bold;">local name</span>s</a> may include:
+	    </p>
+	    <ul>
+	      <li>%-encoded sequences. A '%' followed by two hex characters represents that same sequence of three characters. These sequences are <em>not</em> decoded during processing; a term written as <code>&lt;http://a.example/%66oo\-bar&gt;</code> in Turtle designates the RDF IRI <code>http://a.example/%66oo-bar</code>.</li>
+	      <li>escaped punctuation: A '\' followed by one of <code>~.-!$&amp;'()*+,;=:/?#@%_</code> represents the character to the right of the '\'.</li>
+	    </ul>
+
+	    <p>
+	      <a href="#prod-turtle2-String"><span style="font-weight:bold;">String</span>s</a> may include the <code>\</code>-escape sequences allowed in IRIs, as well as the the following escape sequences, which also map to Unicode code points:</p>
+
+            <table>
+              <thead>
+                <tr>
+                  <th>Escape sequence</th>
+
+                  <th>Unicode code point</th>
+
+                </tr>
+              </thead>
+              <tbody>
+
+                <tr>
+                  <td>'\t'</td>
+                  <td>U+0009</td>
+                </tr>
+                                                              
+                <tr>
+                  <td>'\b'</td>
+                  <td>U+0008</td>
+                </tr>
+                                                              
+                <tr>
+
+                  <td>'\n'</td>
+                  <td>U+000A</td>
+                </tr>
+                                                              
+                <tr>
+                  <td>'\r'</td>
+                  <td>U+000D</td>
+                </tr>
+                                                              
+                <tr>
+                  <td>'\f'</td>
+                  <td>U+000C</td>
+                </tr>
+                                                              
+                <tr>
+                  <td>'\"'</td>
+                <td>U+0022</td>
+                </tr>
+                                                                                                                            
+                <tr>
+                  <td>'\''</td>
+                <td>U+0027</td>
+                </tr>
+                                                                                                                            
+                <tr>
+                  <td>'\\'</td>
+
+                  <td>U+005C</td>
+                </tr>
+
+              </tbody>
+            </table>
+
+
+          </section>
           <section id="sec-grammar-grammar">
           <h3>Grammar</h3>
             <p>The <abbr title="Extended Backus–Naur Form">EBNF</abbr> used here is defined in XML 1.0 (Third Edition)