Added brief informative paragraph on LD data modeling.
authorbhyland
Thu, 12 Dec 2013 09:53:10 -0500
changeset 725 c588deb9bcef
parent 724 c1741b7f0618
child 726 47986f4fe9e8
Added brief informative paragraph on LD data modeling.
bp/index.html
--- a/bp/index.html	Thu Dec 12 09:20:33 2013 -0500
+++ b/bp/index.html	Thu Dec 12 09:53:10 2013 -0500
@@ -98,6 +98,14 @@
         	,		publisher: "W3C"
         	},
 
+            "WOOD2013": {
+            			title: "Linked Data: Structured Data on the Web" ,
+            			href: "http://www.manning.com/dwood/",
+            			authors: ["Wood, D.", "Zaidman, M.", "Ruth, L.", and "Hausenblas, M."] ,
+            			publisher: "Manning Publications Co, New York, 2013"
+            	        } ,
+
+
             "howto-lodp": {
             			title: "How to Publish Linked Data on the Web" ,
             			href: "http://linkeddata.org/docs/how-to-publish",
@@ -167,9 +175,6 @@
                         }
             
             }
-           
-
-
 
       };
     </script> 
@@ -254,16 +259,13 @@
 <p class='stmt'><a href="#WORKFLOW">CHOOSE A WORKFLOW:</a><br />Determine which workflow to use for your Linked Open Data use case.
  </p>
 
-<p class='stmt'><a href="#SELECT">SELECT A DATASET:</a> <br />Select a dataset that provides 
+<p class='stmt'>SELECT A DATASET:<br />Select a dataset that provides 
 benefit to others for re-use.  Consider a dataset that is uniquely collected by your organization:
 information that when combined with other open data provides greater value.  For example, 
 publishing facilities that can then be linked with geographic information including post<del>al</del> codes is a popular example.
 </p>
 
-<p class='stmt'><a href="#PII">PERSONALLY IDENTIFIABLE DATA:</a><br /> Do not publish 
-personally identifiable information as Linked Open Data as it can potentially be misused.  
-Examples of personally identifiable data include: individual names, national 
-identification numbers, phone numbers and office extensions, and driver's license numbers. 
+<p class='stmt'>PERSONALLY IDENTIFIABLE DATA:<br /> Do not publish personally identifiable information as Linked Open Data as it can potentially be misused.  Examples of personally identifiable data include: individual names, national identification numbers, phone numbers and office extensions, and driver's license numbers. 
 </p>
 
 <p class='stmt'><a href="#LICENSE">ASSOCIATE A LICENSE:</a> <br />Associate an appropriate open 
@@ -271,10 +273,7 @@
 </p>
 
 <p class='stmt'><a href="#MODEL">MODEL:</a> <br /><a href="http://www.w3.org/TR/ld-glossary/#modeling-process">Model</a> the 
-data in an application-independent way.  This means that if the data was originally organized for a 
-specific application, you're likely to remove irrelevant content.  In many cases, denormalizing 
-the data may be necessary and is appropriate. 
-</p>
+data in an application-independent way.</p>
 
 <p class='stmt'><a href="#STANDARD-VOCABULARIES">STANDARD_VOCABULARIES:</a> <br />Describe objects with 
 standard <a href="http://www.w3.org/TR/ld-glossary/#vocabulary">vocabularies</a> whenever possible.
@@ -343,18 +342,15 @@
 <p>
 The process of publishing  Government Linked Open Data (GLD) should be comprised of 
 tractable and manageable steps, forming a life cycle in the same way software engineering 
-uses life cycles in development projects. A GLD life cycle should cover 
-all steps from identifying appropriate datasets to actually publishing and maintaining them. 
+uses life cycles in development projects. The life cycle for Government Linked Data includes 
+all steps starting with identifying appropriate datasets, through publication and ongoing maintenance. 
 In the following paragraph three different life cycle models are presented, however it 
 is evident that they all share common (and sometimes overlapping) characteristics in 
 their constituents. For example, they all identify the need to specify, model and publish data 
-in acceptable LOD formats. In essence, they capture the same tasks that are needed in 
+in standard open Web formats. In essence, they capture the same tasks that are needed in 
 the process, but provide different boundaries between these tasks.
 </p>
 
-<!--<p class="issue"> (Editors) - Please provide a brief description of lifecycle diagrams.
-</p> -->
-
 <ul>
 	<li>
 	<p>Hyland et al. [[BHYLAND2011]] provide a six-step “cookbook” to model, create, 
@@ -396,6 +392,14 @@
 <img src="img/GLF_Villazon-terrazas.PNG" width="600" />
 </section>
 
+<!--   DATA MODELING   -->
+<section id="MODEL">
+<h2>Data Modeling</h2>
+
+<p>
+Modeling Linked Data requires representation of data objects and how they are related in an application-independent way.  For example, if the data was originally organized for a specific application, Linked Data modeling may involve removing government program or agency-related content.  When modeling Linked Data, we are representing facts and how they are for more general purpose consumption by the public.  In many cases, denormalizing the data may be necessary and is appropriate. [[WOOD2013]]
+</p>
+</section>
 
 <!--   VOCABULARY SELECTION   -->
 <section id="STANDARD-VOCABULARIES">
@@ -458,7 +462,6 @@
 </section>
 
 
-
 <!-- Discovery Checklist -->
 <section id='LINK'>
 <h2>Vocabulary Discovery Checklist</h2>
@@ -472,11 +475,7 @@
 <i>Examples of domain: Geography, Environment, Administrations, State Services, Statistics, People, Organization.</i> </p>
 
 <p class="highlight"><b>Identify relevant keywords in the dataset</b><br />
-	<i>What it means:</i> Identify words that describe the main ideas or concepts. 
-By identifying the relevant keywords or categories of your dataset, it helps the 
-searching process using a Semantic Web search engine. <br /><br />
-	<!-- If you have raw data in <a href='http://www.w3.org/TR/gld-glossary/#csv'>CSV</a>, the columns of the tables can be used for the searching process.--> 
-	<i>Examples: commune, county, feature. </i>	
+	<i>What it means:</i> Identify words that describe the main ideas or concepts. By identifying the relevant keywords or categories of your dataset, it helps search engines that employ algorithms that utilize structured data to improve query results. <br /><br />
 </p>
 
 
@@ -487,16 +486,14 @@
 
 
 <p class="highlight"><b>How to find vocabularies</b><br />
-	<i>What it means:</i>There are some specific search tools (<a href="http://ws.nju.edu.cn/falcons/">Falcons</a>, 
+There are search tools that collect, analyze and index vocabularies and semantic data available online for efficient access.  Search tools that use structured data represented as Linked Data include: (<a href="http://ws.nju.edu.cn/falcons/">Falcons</a>, 
 <a href="http://watson.kmi.open.ac.uk/WatsonWUI/">Watson</a>, 
 <a href="http://sindice.com/">Sindice</a>, <a href="http://swse.deri.org/">Semantic Web Search Engine</a>, 
-<a href="http://swoogle.umbc.edu/">Swoogle</a>, <a href="http://schemapedia.com/">Schemapedia</a>) that collect, 
-analyze and index vocabularies and semantic data available online for efficient access.<br /><br />. 
+<a href="http://swoogle.umbc.edu/">Swoogle</a>, and <a href="http://schemapedia.com/">Schemapedia</a>).<br /><br />. 
 Others include the <a href="http://lov.okfn.org/">LOV</a> directory, 
 <a href="http://prefix.cc">Prefix.cc</a>, 
 <a href="http://bioportal.bioontology.org/">Bioportal (biological domain)</a> and the European Commission's
 <a href="https://joinup.ec.europa.eu/catalogue/repository">Joinup platform.</a>
-	<i>Examples: These tools make it possible to perform a search on a relevant term or category present in your data.</i>
 </p>
 
 <p class="highlight"><b>Where to find existing vocabularies in datasets catalogues</b><br />