Fusionner
authorfgandon@strongy.inria.fr
Wed, 22 Feb 2012 18:25:15 +0100
changeset 205 9a37a406f2ca
parent 204 f6311ba4d3bb (current diff)
parent 203 59fc36595622 (diff)
child 210 24ffdddd66ec
child 215 3110bc07e3cb
Fusionner
--- a/rdf-turtle/index.html	Wed Feb 22 18:19:29 2012 +0100
+++ b/rdf-turtle/index.html	Wed Feb 22 18:25:15 2012 +0100
@@ -540,10 +540,10 @@
 
             </p>
           </section>
-          <section  id="sec-strings">
-            <h3>String Escape Sequences</h3>
+          <section id="sec-strings" class="issue">
+            <h3><span style="color:red;">REMOVE: </span>String Escape Sequences</h3>
 
-            <p>Turtle strings and IRIs can use <code>\</code>-escape sequences to
+            <p>Turtle strings, IRIs and <a href="#term-turtle2-PN_LOCAL">local names</a> can use <code>\</code>-escape sequences to
             represent Unicode code points.</p>
 
             <p class="note">
@@ -553,6 +553,11 @@
 				<code>~.-!$&amp;'()*+,;=:/?#@%_</code>. It is not necessary to escape
 				these characters in strings or IRIs.</p>
 
+            <p class="note">The <a href="#term-turtle2-PERCENT">&lt;PERCENT&gt;</a> terminal matches %-encoding
+	    sequences. These sequences are <em>not</em> decoded during processing; a term written as
+	    <code>&lt;http://a.example/%66oo\-bar&gt;</code> in Turtle designates the RDF IRI
+	    <code>http://a.example/%66oo-bar</code>.</p>
+
 
             <p>The following table describes all the escape sequences
             allowed inside a <a href="#prod-turtle2-String">string</a>
@@ -621,6 +626,115 @@
 
             </blockquote>
           </section>
+          <section id="sec-escapes" class="issue">
+            <h3><span style="color:red;">ADD: </span>Escape Sequences</h3>
+
+            <p><span style="font-weight:bold;">IRI</span>s, used as <a href="#prod-turtle2-IRIref">RDF terms</a> or as in <a href="#prod-turtle2-prefixID">@prefix</a> or <a href="#prod-turtle2-base">@base</a> declarations, may include <code>\</code>-escape sequences to represent Unicode code points:</p>
+
+            <table>
+              <thead>
+                <tr>
+                  <th>Escape sequence</th>
+
+                  <th>Unicode code point</th>
+
+                </tr>
+              </thead>
+              <tbody>
+
+                <tr>
+                  <td>'\u' <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a></td>
+                  <td>A Unicode codepoint in the range U+0 to U+FFFF inclusive
+              corresponding to the value encoded by the four hexadecimal digits interpreted from most significant to least significant digit.</td>
+                </tr>
+
+                                                              
+                <tr>
+                  <td>'\U' <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a> <a href="#rHEX">hex</a></td>
+
+                  <td>A Unicode codepoint in the range U+10000 to U+10FFFF inclusive
+              corresponding to the value encoded by the eight hexadecimal digits interpreted from most significant to least significant digit.</td>
+                </tr>
+
+              </tbody>
+            </table>
+
+            <p>where <a href="#rHEX">HEX</a> is a hexadecimal character</p>
+            <blockquote>
+              <p><span style="font-family: monospace; font-size: 85%;"><a id="rHEX">HEX</a> 
+            ::= [0-9] | [A-F] | [a-f]</span></p>
+
+            </blockquote>
+
+	    <p>
+	      <a href="#term-turtle2-BLANK_NODE_LABEL"><span style="font-weight:bold;">Blank node label</span>s</a> and <a href="#term-turtle2-PN_LOCAL"><span style="font-weight:bold;">local name</span>s</a> may include:
+	    </p>
+	    <ul>
+	      <li>%-encoded sequences. A '%' followed by two hex characters represents that same sequence of three characters. These sequences are <em>not</em> decoded during processing; a term written as <code>&lt;http://a.example/%66oo\-bar&gt;</code> in Turtle designates the RDF IRI <code>http://a.example/%66oo-bar</code>.</li>
+	      <li>escaped punctuation: A '\' followed by one of <code>~.-!$&amp;'()*+,;=:/?#@%_</code> represents the character to the right of the '\'.</li>
+	    </ul>
+
+	    <p>
+	      <a href="#prod-turtle2-String"><span style="font-weight:bold;">String</span>s</a> may include the <code>\</code>-escape sequences allowed in IRIs, as well as the the following escape sequences, which also map to Unicode code points:</p>
+
+            <table>
+              <thead>
+                <tr>
+                  <th>Escape sequence</th>
+
+                  <th>Unicode code point</th>
+
+                </tr>
+              </thead>
+              <tbody>
+
+                <tr>
+                  <td>'\t'</td>
+                  <td>U+0009</td>
+                </tr>
+                                                              
+                <tr>
+                  <td>'\b'</td>
+                  <td>U+0008</td>
+                </tr>
+                                                              
+                <tr>
+
+                  <td>'\n'</td>
+                  <td>U+000A</td>
+                </tr>
+                                                              
+                <tr>
+                  <td>'\r'</td>
+                  <td>U+000D</td>
+                </tr>
+                                                              
+                <tr>
+                  <td>'\f'</td>
+                  <td>U+000C</td>
+                </tr>
+                                                              
+                <tr>
+                  <td>'\"'</td>
+                <td>U+0022</td>
+                </tr>
+                                                                                                                            
+                <tr>
+                  <td>'\''</td>
+                <td>U+0027</td>
+                </tr>
+                                                                                                                            
+                <tr>
+                  <td>'\\'</td>
+
+                  <td>U+005C</td>
+                </tr>
+
+              </tbody>
+            </table>
+
+
+          </section>
           <section id="sec-grammar-grammar">
           <h3>Grammar</h3>
             <p>The <abbr title="Extended Backus–Naur Form">EBNF</abbr> used here is defined in XML 1.0 (Third Edition)
@@ -951,10 +1065,10 @@
         <section id="n-triple-changes" class="informative">
           <h3>Changes from N-Triples</h3>
           <ul>
-            <li>Default encoding is UTF-8 rather then US-ASCII only
-            <li>Uses IRIs rather then RDF URI References
+            <li>Default encoding is UTF-8 rather than US-ASCII only
+            <li>Uses IRIs rather than RDF URI References
             <li>Defines a unique media type <code>text/ntriples+turtle</code>
-            <li>Subset of Turtle rather then Notation 3
+            <li>Subset of Turtle rather than Notation 3
             <li>Comments may occur after a triple production
           </ul>
         </section>