merge
authorDave Reynolds <dave@epimorphics.com>
Thu, 29 Mar 2012 15:28:23 +0100
changeset 143 ef72bc3cb3e2
parent 142 e976be5d510c (current diff)
parent 141 29a3dd6dc12c (diff)
child 144 d85e49aab14d
merge
--- a/bp/index.html	Thu Mar 29 15:27:57 2012 +0100
+++ b/bp/index.html	Thu Mar 29 15:28:23 2012 +0100
@@ -25,6 +25,32 @@
   <p>This document is work in progress. The GLD WG ongoing discussions are recorded on the <a href="http://www.w3.org/2011/gld/wiki/Best_Practices_Discussion_Summary">Best Practices Wiki page</a></p>
 </section>
 
+<!-- Structure of Best 
+
+<!-- List of Best Practices -->
+
+The following best practices are discussed in this document and listed here for convenience.
+
+Select data sets that other people may wish to re-use.
+
+Re-use vocabularies whenever possible
+
+Identify relevant words that describe the main ideas or concepts
+
+Search for vocabularies using semantic search sites and dataset catalogues
+
+Ensure new vocabularies you create pass the "Vocabulary Creation Criteria Checklist"
+* Vocabulary is self-descriptive
+* Vocabulary is described in more than one language, ideally
+* Vocabulary will be accessible for a long period (has longevity)
+
+Ensure vocabularies you choose pass the "Vocabulary Selection Criteria Checklist"
+* Ensure vocabularies you use are published by a trusted group or organization
+* Ensure vocabularies have permanent URI
+* Confirm the versioning policy 
+
+
+
 
 <!--    INTRODUCTION    -->
 
@@ -51,29 +77,21 @@
 Linked Data uses a family of international standards and best practices for the publication, dissemination and reuse of structured data. Linked Data, unlike previous data formatting and publication approaches, provides a simple mechanism for combining data from multiple sources across the Web. 
 </p>
 
-<h2>Government Motivation for Publishing Linked Open Data</h2>
+<h2>Government Motivation to Publish Linked Open Data</h2>
 <p>
-Many governments have mandated publication of open government data to the public Web. The intention of these mandates are to facilitate the maintenance of open societies and support governmental accountability and transparency initiatives. However, publication of unstructured data on the World Wide Web is in itself insufficient; in order to realize the goals of efficiency, transparency and accountability, re-use of published data means members of the public must be able to absorb data in ways that can be readily found via search, visualized and absorbed programmatically.
+Governments worldwide have mandated publication of open government data to the public Web. The intention of these mandates is to facilitate the maintenance of open societies and support governmental accountability and transparency initiatives. However, publication of unstructured data on the World Wide Web is in itself insufficient; in order to realize the goals of efficiency, transparency and accountability, re-use of published data means members of the public must be able to absorb data in ways that can be readily found via search, visualized and absorbed programmatically.
 </p>
 
+The best practices provided provide a methodical approach for the creation, publication and dissemination of government Linked Data. Guidance on the life cycle of a Linked Data project, beginning with identification of suitable data sets, modeling, vocabulary selection, through publication and ongoing maintenance are provided.  Since publishing data to the Web following Linked Data principles is considering an emerging publication mode for governments, guidance for procurement of necessary services is provided to help inform government stakeholders.
 </section>
 
 
-<section>
-<h3>Motivation</h3>
-The best practices provided here are provide a methodical approach for the creation, publication and dissemination of government Linked Data, including:
-<ul>
-	<li> Description of the life cycle of a Linked Data project, starting with identification of suitable data sets, modeling, vocabulary selection, through publication and ongoing maintenance.</li>
-	<li> Definition of proven steps to create and maintain government data sets using Linked Data principles.</li>
-	<li> Guidance in the procurement process for publishing Linked Open Data.
-</ul>
 
+<h3> Linked Open Data Lifecycle </h3>
 <p class='issue'>Does it make sense to base the GLD life cycle on one of the general LD life cycles? See <a href="https://www.w3.org/2011/gld/track/issues/15">ISSUE-15</a></p>
 </section>
 
 <section>
-<p>Currently we have identified the following Government Linked Data Life Cycles
-</p>
 <p class="todo"> Include a brief description for each one of them.
 </p>
 <p>Hyland et al.</p>
@@ -84,8 +102,9 @@
 <img src="img/GLF_Hausenblas.PNG" width="600"/>
 <p>Villazon-terrazas</p>
 <img src="img/GLF_Villazon-terrazas.PNG" width="600" />
+</section>
 
-</section>
+
 <!--    PROCUREMENT   -->
 <section>
 <h3>Procurement</h3>
@@ -95,12 +114,22 @@
 </p>
 
 <p>
-Linked Open Data (LOD) offers novel approaches for publishing and consuming data on the Web. This procurement overview and companion glossary is intended to help contract officers and their technical representatives understand LOD activities, and their associated products and services. It is hoped that this will aid government officials in procuring LOD related products and services.
+Linked Open Data (LOD) offers novel approaches for publishing and consuming data on the Web. This procurement overview and companion glossary is intended to help contract officers and their technical representatives understand LOD activities, and their associated products and services. This guidance is intended to aid government officials in the procurement process.
 </p>
 
 <h4>Overview</h4>
 <p>
-LOD provisions data into the Web so it can be interlinked with other linked data, making it easier to discover, and more useful and reusable. LOD leverages World Wide Web standards such as Hypertext Transfer Protocol (HTTP), Resource Description Framework (RDF), and Uniform Resource Identifiers (URIs), which make data self-describing so that it is both human and machine readable. Self-describing data is important because most government data comes from relational data systems that do not fully describe the source data schema needed for application development by third parties.
+LOD provisions data into the Web so it can be interlinked with other linked data, making it easier to discover, and more useful and reusable. Linked Open Data is based on W3C including Hypertext Transfer Protocol (HTTP), Uniform Resource Identifiers (URIs), and Resource Description Framework (RDF) family of standards.  
+</p>
+
+</p>
+Publish structured content as <em>self-describing</em> Linked Data.
+
+Currently, the majority of structured data collected and curated by governments worldwide resides in relational data systems.  A logical schema for this structured content is typically maintained external to the data itself.  
+
+Understanding the schema, among other characteristics of the data, is necessary for data re-use.  Regardless of whether data is being re-used internally or externally to the original data owner, the <em>schema</em> or structure and organization of the data must be understood by those analyzing the data itself.
+
+Data published in an RDF format <em>combines the schema with the data</em> itself, which is what makes it self-describing.
 </p>
 
 <h5>LOD Production through Consumption Lifecycle</h5>
@@ -358,7 +387,7 @@
 
 <!-- << URI CONSTRUCTION   -->
 <section>
-<h3>URI Construction - Boris</h3>
+<h3>URI Construction</h3>
 <p class='responsible'>Ghislain Atemezing (INSTITUT TELECOM), Michael Hausenblas (DERI), Boris Villazon-Terrazas (UPM), Daniel Vila (UPM), John Erickson (RPI), Martin Alvarez (CTIC)</p>
 <p>
 This section specifies how to create good URIs for use in government linked data. Inputs include 
--- a/bp/respec-config.js	Thu Mar 29 15:27:57 2012 +0100
+++ b/bp/respec-config.js	Thu Mar 29 15:28:23 2012 +0100
@@ -1,7 +1,7 @@
 var respecConfig = {
     // specification status (e.g. WD, LCWD, NOTE, etc.). If in doubt use ED.
     specStatus:           "ED",
-    publishDate:          "2012-02-23",
+    publishDate:          "2012-03-22",
     //copyrightStart:       "2010",
 
     // the specification's short name, as in http://www.w3.org/TR/short-name/
--- a/glossary/index.html	Thu Mar 29 15:27:57 2012 +0100
+++ b/glossary/index.html	Thu Mar 29 15:28:23 2012 +0100
@@ -16,30 +16,18 @@
 
 <section id="abstract">
 
-<p> This document is a glossary of terms defined and used in the <a href="https://dvcs.w3.org/hg/gld/raw-file/default/bp/index.html">Best Practice Document </a>Working Note. It is intended for use by Government Linked Data publishers and consumers in order to refer to a common coherent set of terminologies. 
+<p> This document is a glossary of terms defined and used in the <a href="https://dvcs.w3.org/hg/gld/raw-file/default/bp/index.html">Best Practices for Publishing Linked Data</a> document published by the <a href="http://www.w3.org/2011/gld/charter">W3C Government Linked Data Working Group</a>. It is intended for use by Linked Data publishers and consumers in order to refer to a common glossary of terms. 
 </p>
-<p class="todo">@@TODO EXPAND MORE @@</p>
 </section>
 
 <section id="sotd">
-
 </section>
 
 <section class="introductory">
 <h2>Scope</h2>
-
-<p>
-This document is aimed at assisting government IT managers, procurement officers, Web developers, vendors, and researchers who are interested in publishing open government data using W3C standards.  The benefits of using international standards for data exchange is to significantly increase interoperability of data.
-</p>
 <p>
-Readers of this document are expected to be familiar with delivery of content via the Web, and to have a general familiarity with the technologies involved, but are not required to have a background in semantic technologies or previous experience with Linked Data. Data stewards, curators, database administrators and other personnel involved in Open Government initiatives are encouraged to read this Linked Open Data Best Practices document. 
-</section>
-
-
-
-<section id="abox">
-<h4> Abox </h4>
-One of two types of statements in an <a href="#ontology">ontology</a> (the other being <a href="#tbox">Tbox</a>).  Abox statements represent facts (or "assertions", hence the "A"), e.g. John is a Person (where Person is a defined class).
+This glossary is aimed at anyone interested in publishing Linked Data using W3C standards.  The benefits of using international standards for data exchange is to significantly increase interoperability and re-use of data.
+</p>
 </section>
 
 <section id="api">
@@ -53,17 +41,31 @@
 The part if the application that uses the content of an RDF store in an interaction with some user.
 </section>
 
-<section >
+<section>
 <h4>Blank Nodes</h4>
 RDF nodes that have no URI and thus cannot be referenced globally. They are used to stand in for anonymous entities.
 </section >
 
+<section>
+<h4>CC-BY-SA License</h4>
+Work available under a <a href="http://creativecommons.org/licenses/by-sa/3.0/" taget="blank">CC-BY-SA license</a> means you can include it in any other work under the condition that you give proper attribution. If you create derivative works (such as modified or extended versions), then you must also license them as CC-BY-SA.
+
 <section id="close">
 <h4>Closed World</h4>
 The presumption that what is not known to be true must be false.  The assumption underlying relational databases, most forms of logical programming, OWL DL and OWL Lite.
 </section>
 
 <section >
+<h4>Connection</h4>
+A transport layer virtual circuit established between two programs for the purpose of communication.
+</section>
+
+<section>
+<h4>Content Negotiation</h4>
+The mechanism for selecting the appropriate representation when servicing a request. The representation of entities in any response can be negotiated (including error responses).  See details from RFC 2616bis for an <a ref="http://www.w3.org/Protocols/HTTP/1.1/rfc2616bis/draft-lafon-rfc2616bis-03.html#content.negotiation" target="blank">Content Negotation</a>.
+</section>
+
+<section >
 <h4>Controlled Vocabularies</h4>
 Carefully selected sets of terms that are used to describe units of information; used to create thesauri,  taxonomies and ontologies.
 </section>
@@ -79,11 +81,6 @@
 </section >
 
 <section >
-<h4>Data Cloud</h4>
-<p class="todo">@@TODO: [email protected]@</p>
-</section >
-
-<section >
 <h4>Data Hub</h4>
 The <a href="http://thedatahub.org/">Data Hub</a> is a community-run catalogue of useful sets of data on the Internet, powered using an open-source data cataloguing software called <a href="http://ckan.org/">CKAN</a>. It is an openly editable open data catalogue, in the style of Wikipedia.
 </section >
@@ -93,7 +90,6 @@
 <p class="todo">@@TODO: [email protected]@</p>
 </section >
 
-
 <section >
 <h4>DBpedia</h4>
 An RDF representation of the metadata held in Wikipedia and made available for SPARQL query on the World Wide Web.
@@ -170,22 +166,22 @@
 
 <section>
 <h4>Linked Data</h4>
-It refers to a set of best practices for publishing and interlinking structured data on the Web.
-</section>
-
-<section>
-<h4>Knowledge Extraction</h4>
-It is the creation of knowledge from structured (relational databases, XML) and unstructured (text, documents, images) sources. The resulting knowledge needs to be in a machine-readable and machine-interpretable format and must represent knowledge in a manner that facilitates inferencing.
-</section>
-
-<section>
-<h4>Knowledge Discovery</h4>
-It describes the process of automatically searching large volumes of data for patterns that can be considered knowledge about the data.
+Refers to a set of best practices for publishing and interlinking structured data for access by both humans and machines via the use of the RDF family of syntaxes (e.g., RDF/XML, N3, Turtle and N-Triples) and HTTP URIs. Linked Data can be published by an person or organization behind the firewall or on the public Web.  If Linked Data is published on the public Web, it is generally called <em>Linked Open Data</em>.
 </section>
 
 <section>
 <h4>Linked Open Data</h4>
- A pattern for hyper-linking machine-readable data sets to each other using Semantic Web techniques, especially via the use of RDF and URIs. Enables distributed SPAQL queries of the data sets and a “browsing” or “discovery” approach to finding information (as compared to a search strategy. 
+Refers to Linked Data published on the public Web.  Publishing Linked Open Data enables distributed SPAQL queries of the data sets and a “browsing” or “discovery” approach to finding information, as compared to a search strategy.
+</section>
+
+<section>
+<h4>Linked Open Data Cloud</h4>
+The Linked Data Cloud represents datasets that have been published as Linked Data on the public Web.
+</section >
+
+<section>
+<h4> Linked Open Data Cloud diagram</h4>
+Refers to the pictoral depiction of the <a href=" http://lod-cloud.net/ target="blank">Linked Data Cloud</a>.  There are various depictions of the Linked Open Data Cloud including color-by-theme describing the various data domains including government, geographic, publications, life sciences and media content.  The Linked Open Data Cloud diagrams are maintained by <span rel="dc:creator"><a typeof="foaf:Person" href="http://richard.cyganiak.de/#me" property="foaf:name">Richard Cyganiak</a></span> (<a href="http://www.deri.ie/">DERI, NUI Galway</a>) and <span rel="dc:contributor"><a typeof="foaf:Person" href="http://www.wiwiss.fu-berlin.de/en/institute/pwo/bizer/team/JentzschAnja.html" resource="#anja" property="foaf:name">Anja Jentzsch</a></span> (<a href="http://www.wiwiss.fu-berlin.de/en/institute/pwo/bizer/">Freie Universität Berlin</a>). 
 </section>
 
 <section>
@@ -194,11 +190,6 @@
 </section>
 
 <section>
-<h4>Linked Open Data Cloud</h4>
-Linked Open Data that has been published is depicted in a LOD cloud diagram. The diagram shows connections between linked data sets and color codes them based on data type (e.g., government, media, life sciences, etc.). The diagram can be viewed at: <a href="http://richard.cyganiak.de/2007/10/lod/" target="blank">Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.</a>
-</section>
-
-<section>
 <h4>Machine-Readable Cataloging system</h4> 
 A family of formats for the representation of bibliographic information in libraries (ISO 2709, ANSI/NISO Z39.2).
 </section>
@@ -209,6 +200,12 @@
 </section>
 
 <section>
+<h4>Message</h4> 
+The basic unit of HTTP communication, consisting of a structured sequence of octets matching the syntax defined as an <a href="http://www.w3.org/Protocols/HTTP/1.1/rfc2616bis/draft-lafon-rfc2616bis-03.html#httpmessage" target="blank">HTTP Message</a> and transmitted via the connection.
+</section>
+
+
+<section>
 <h4>Metadata</h4> 
 Information used to administer, describe, preserve, present, use or link other information held in resources, especially knowledge resources, be they physical or virtual.
 </section>
@@ -287,9 +284,22 @@
 
 <section>
 <h4>RDFS</h4>
-RDFS is the schema language for RDF; it describes constructs for types of objects (Classes), relating 
-types to one another (subClasses), properties that describe objects (Properties), and relationships
-between them (subProperty).
+RDFS is the schema language for RDF; it describes constructs for types of objects (Classes), relating types to one another (subClasses), properties that describe objects (Properties), and relationships between them (subProperty).
+</section>
+
+<section>
+<h4>Request</h4>
+A request message from a client to a server includes, within the first line of that message, the method to be applied to the resource, the identifier of the resource, and the protocol version in use.  See details from RFC 2616bis for a <a ref="http://www.w3.org/Protocols/HTTP/1.1/rfc2616bis/draft-lafon-rfc2616bis-03.html#request" target="blank">HTTP Request</a>.
+</section>
+
+<section>
+<h4>Resource</h4>
+A network data object or service that can be identified by an HTTP URI. Resources may be available in multiple representations (e.g. multiple languages, data formats, size, and resolutions) or vary in other ways.  See details from RFC 2616bis for details on <a ref="http://www.w3.org/Protocols/HTTP/1.1/rfc2616bis/draft-lafon-rfc2616bis-03.html#uri.303" target="blank">Uniform Resource Identifiers</a>.
+</section>
+
+<section>
+<h4>Response</h4>
+After receiving and interpreting a request message, a server responds with an HTTP response message.  See details from RFC 2616bis for an <a ref="http://www.w3.org/Protocols/HTTP/1.1/rfc2616bis/draft-lafon-rfc2616bis-03.html#response" target="blank">HTTP Response</a> message.
 </section>
 
 <section>
@@ -327,11 +337,6 @@
 </section>
 
 <section>
-<h4>Silk</h4>
-Silk(Silk Link Discovery Framework) is a tool for discovering relationships between data items within different Linked Data sources.
-</section>
-
-<section>
 <h4>Simple Knowledge Organisation System</h4> 
 A vocabulary description language for RDF designed for representing traditional knowledge organization systems such as enterprise taxonomies in RDF. A W3C standard.
 </section>
@@ -398,7 +403,9 @@
 
 <section >
 <h4>Uniform Resource Identifiers (URIs)</h4>
- URI’s play a key role in enabling linked data. To publish data on the Web, the items in a domain of interest must first be identified. These are the things whose properties and relationships will be described in the data, and may include Web documents as well as real-world entities and abstract concepts. As Linked Data builds directly on Web architecture [[!WEBARCH]], the Web architecture term resource is used to refer to these things of interest, which are, in turn, identified by HTTP URIs. 
+URI’s play a key role in enabling Linked Data. To publish data on the Web, the items in a domain of interest must first be identified. These are the things whose properties and relationships will be described in the data, and may include Web documents as well as real-world entities and abstract concepts. As Linked Data builds directly on Web architecture, the Web architecture term "resource" is used to refer to these things of interest, which are, in turn, identified by HTTP URIs. 
+
+URIs have been known by many names: Web addresses, Universal Document Identifiers, Universal Resource Identifiers, and finally the combination of Uniform Resource Indetifiers.  As far as HTTP is concerned, Uniform Resource Identifiers (URI's) are simply formatted strings which identify via name, location, or any other characteristic, a resource.  For definitive information on URL syntax and semantics, see "Uniform Resource Identifiers (URI): Generic Syntax and Semantics," [[!RFC2396]]
 </section >
 
 <section >
@@ -447,16 +454,13 @@
 <h4>XML Schema</h4> 
 Limitations on the content of an XML document that defines what structural elements are allowed.
 </section >
-<section >
+<
+section >
 <h4>eXtensible Stylesheet Language Transformations</h4> 
  Declarative programs to transform one XML document into another XML document.
-
 </section>
 
 
-
-
-
 <!--    ACK   -->
 <section class="appendix">
 <h2>Acknowledgments</h2>
--- a/glossary/respec-config.js	Thu Mar 29 15:27:57 2012 +0100
+++ b/glossary/respec-config.js	Thu Mar 29 15:28:23 2012 +0100
@@ -1,11 +1,11 @@
 var respecConfig = {
     // specification status (e.g. WD, LCWD, NOTE, etc.). If in doubt use ED.
     specStatus:           "ED",
-    publishDate:          "2012-03-15",
-    //copyrightStart:       "2010",
+    publishDate:          "2012-03-26",
+    //copyrightStart:       "2011",
 
     // the specification's short name, as in http://www.w3.org/TR/short-name/
-    shortName:            "gld-gloss",
+    shortName:            "gld-glossary",
     //subtitle:             "",
     // if you wish the publication date to be other than today, set this
     // publishDate:  "2009-08-06",
@@ -18,7 +18,7 @@
     diffTool:             "http://www.aptest.com/standards/htmldiff/htmldiff.pl",
 
     // if there a publicly available Editor's Draft, this is the link
-    edDraftURI:           "http://dvcs.w3.org/hg/gld/gld-gloss/",
+    edDraftURI:           "http://dvcs.w3.org/hg/gld/gld-glossary/",
 
     // if this is a LCWD, uncomment and set the end of its review period
     // lcEnd: "2009-08-05",
@@ -33,7 +33,7 @@
     // only "name" is required
     editors:  [
 			{ name: "David Wood", url: "http://3roundstones.com/about-us/leadership-team/david-wood/",  company: "3 Round Stones", companyURL: "http://3roundstones.com/"},
-        		{ name: "Bernadette Hyland", url: "https://twitter.com/bernhyland",  company: "3 Round Stones", companyURL: "http://3roundstones.com/"},
+        		{ name: "Bernadette Hyland", url: "http://3roundstones.com/about-us/leadership-team/bernadette-hyland/",  company: "3 Round Stones", companyURL: "http://3roundstones.com/"},
 		{ name: "Ghislain Atemezing", url: "http://www.eurecom.fr/~atemezin",  company: "INSTITUT TELECOM, Eurecom", companyURL: "http://www.eurecom.fr"}
     ],
 
--- a/glossary/respec-ref.js	Thu Mar 29 15:27:57 2012 +0100
+++ b/glossary/respec-ref.js	Thu Mar 29 15:28:23 2012 +0100
@@ -12,7 +12,9 @@
                 berjon.biblio["VOID-GUIDE"] = "<cite><a href=\"http://www.w3.org/TR/void/\">Describing Linked Datasets with the VoID Vocabulary</a></cite>, K. Alexander, R. Cyganiak, M. Hausenblas, and J. Zhao, W3C Interest Group Note 03 March 2011. URL: http://www.w3.org/TR/void/";
                 berjon.biblio["RDFA-CORE-PROFILE"] = "<cite><a href=\"http://www.w3.org/profile/rdfa-1.1\">RDFa Core Default Profile</a></cite>, I. Herman, W3C RDF Web Applications Working Group 02 June 2011. URL: http://www.w3.org/profile/rdfa-1.1";
                 berjon.biblio["XHTML-RDFA-PROFILE"] = "<cite><a href=\"http://www.w3.org/profile/html-rdfa-1.1\">HTML+RDFa Core Default Profile</a></cite>, I. Herman, W3C RDF Web Applications Working Group 24 May 2011. URL: http://www.w3.org/profile/html-rdfa-1.1";
-                berjon.biblio["RFC2616"] = "<cite><a href=\"http://www.w3.org/Protocols/rfc2616/rfc2616.html\">Hypertext Transfer Protocol -- HTTP/1.1</a></cite>, R. Fielding; et al. June 1999. Internet RFC 2616. URL: http://www.w3.org/Protocols/rfc2616/rfc2616.html."
+                berjon.biblio["RFC2616"] = "<cite><a href=\"http://www.w3.org/Protocols/rfc2616/rfc2616.html\">Hypertext Transfer Protocol -- HTTP/1.1</a></cite>, R. Fielding; et al. June 1999. Internet RFC 2616. URL: http://www.w3.org/Protocols/rfc2616/rfc2616.html."; 
+                berjon.biblio["RFC2396"] = "<cite><a ref=\"http://tools.ietf.org/html/rfc2396\">Uniform Resource Identifiers (URI): Generic Syntax</a></cite>, Berners-Lee, T., Fielding, R.T., and L. Masinter.  August 1998. Internet RFC 2396. URL: http://tools.ietf.org/html/rfc2396";
+
 
                 // process the document before anything else is done
                 var refs = document.querySelectorAll('adef') ;