[フレーム]

Understanding Data

The document presents an in-depth exploration of data, its representation, access, and integration, emphasizing the importance of understanding data through the lenses of entities, relationships, and observations. It discusses the roles of identifiers and URIs in denoting entities and the significance of linked data, especially linked open data, in enhancing structured data representation. Additionally, it highlights the shift from open database connectivity to open data connectivity as a means of democratizing data access across various platforms.

Downloaded 663 times
UNDERSTANDING DATA By Kingsley Idehen Founder & CEO, OpenLink Software
Presentation Goals Deconstruct Data Understand Data Representation Understand Data Access Understand Data Integration License CC-BY-SA 4.0 (International).
SITUATION ANALYSIS License CC-BY-SA 4.0 (International).
EVERY DAY WE HEAR License CC-BY-SA 4.0 (International). DATA IS BIG DATA IS OPEN DATA IS LINKED
WE ALMOST NEVERHEAR ABOUT License CC-BY-SA 4.0 (International). WHAT DATA ACTUALLY IS HOW DATA IS REPRESENTED HOW DATA IS ACCESSED, SHARED, & INTEGRATED
Why is Data Important? Data is the basis of Information, Knowledge, and Wisdom. WISDOM KNOWLEDGE INFORMATION DATA License CC-BY-SA 4.0 (International).
What is Data? Data is how we express Observation in reusable form. License CC-BY-SA 4.0 (International).
What is Observation? Observation is the Perception of Relationships between Entities. YOUR OBSERVATIONS PEOPLE, PLACES, MUSIC, DOCUMENTS, CALENDARS, DIARIES, ADDRESS BOOKS & MORE... License CC-BY-SA 4.0 (International).
What is an Entity? An Entity is a Distinctly Identifiable Thing License CC-BY-SA 4.0 (International).
How is an Entity Identified (Named) ? An Entity is Identified (or named) through the combined effects of Identifier based denotation (signification) and document content based connotation (description). License CC-BY-SA 4.0 (International).
How is an Entity Denoted? An Entity is Denoted (Signified) through the use of an Identifier. License CC-BY-SA 4.0 (International).
What is an Identifier? An Identifier is a Sign (or Token) that Signifies (Denotes, or "Stands For") an Entity License CC-BY-SA 4.0 (International).
Identifier Types? Quoted Literals such as: "Kingsley Idehen" or ‘Kingsley Idehen’ Absolute References: <http://kingsley.idehen.net/dataspace/person/kidehen#this> Relative References: <#KingsleyIdehen> License CC-BY-SA 4.0 (International).
How is an Entity Described? Through entity relationships that are represented in reusable form via document content (sentences and statements). License CC-BY-SA 4.0 (International).
What is a Relationship? A Relationship is an Association between two or more Entities, where each has a specific Role. License CC-BY-SA 4.0 (International).
What is a Relationship Role? A Relationship Role is a Function performed by an Entity in a Relationship License CC-BY-SA 4.0 (International).
Relationship Role Types? • Entity Attribute Value EAV  Entity -- observation focal point  Attribute -- observation attribute name (relationship type determinant)  Value -- observation attribute value • RDF (WC3’s Resource Description Framework)  Subject -- observation focal point  Predicate -- observation attribute name (relationship type determinant)  Object -- observation attribute value License CC-BY-SA 4.0 (International).
Relationship Role: Predicate The Relationship Predicate is the Connector that associates an observation focal point (Subject) with something, in the form of an observation value (Object). License CC-BY-SA 4.0 (International).
Relationship Role: Subject Actual Entity being Observed License CC-BY-SA 4.0 (International).
Relationship Role: Object Value associated with an observation focal point (Subject) via a Relationship Predicate. License CC-BY-SA 4.0 (International).
Types of Values? • Untyped Literals (Strings) • Typed Literals  Numbers  Dates  Booleans  Etc. • References (Local and Global Hyperlinks) License CC-BY-SA 4.0 (International).
How are Relationships Expressed? Relationships are Expressed using a Language, i.e., a system of signs [for denotation], syntax [arrangement of signs to form sentences], and entity relation semantics [meaning of relationship roles] for encoding and decoding information. Example: Subject, Predicate, Object – Used by W3C’s Resource Description Framework (RDF) and Natural Language. License CC-BY-SA 4.0 (International).
How Are Entity Relationships Represented ? Entity Relationships are Represented using notations associated with a specific language. Examples include: • Entity Relationship Model (Network /Graph) Diagrams. • Tables (CSV files, Spreadsheets, and SQL Relational Database Management Systems). • RDF-Turtle, JSON-LD, RDF/XML, HTML+Microdata, HTML+RDFa etc.. License CC-BY-SA 4.0 (International).
Entity Relationship Diagram <#hasCapital> License CC-BY-SA 4.0 (International). <#PopulatedPlace> "France" <#Paris> <#type> <#hasLabel> <#France>
Turtle Notation Based Entity Relationship Statements <#France> <#Type> <#PopulatedPlace> . <#France> <#hasLabel> "France" . <#France> <#hasCapital> <#Paris> . <#Paris> <#Type> <#PopulatedPlace> . <#Paris> <#hasLabel> "Paris" . <#PopulatedPlace> <#Type> <#Place> . License CC-BY-SA 4.0 (International).
Entity Relationship Tables Delimiter: e.g., Comma Identifier Quote Character: Double-quotes Relation Header Row: Entity,Attribute,Value Relation Body Example: "Entity", "Attribute" "Value" "France", "Type" "PopulatedPlace" "France" , "hasLabel" "France" "France" , "hasCapital" "Paris" License CC-BY-SA 4.0 (International).
Statement Representation: Spreadsheet Tables Entity (Subject) Attribute (Predicate) Value (Object) #France #Type #PopulatedPlace #France #hasLabel "France" #France #hasCapital #Paris #Paris #Type #PopulatedPlace #Paris #hasLabel "Paris" #PopulatedPlace #Type #Place License CC-BY-SA 4.0 (International).
How are Statements Persisted & Transmitted? • Persistence:  To paper based documents  To digital realm documents (e.g., operating system files, web pages, etc.) • Transmission:  Text oriented serialization formats  Binary serialization formats License CC-BY-SA 4.0 (International).
Understanding Data (Recap) • The term "Data" refers to observation expressed in reusable form. • The term "Observation" refers to our perception of Entity Relationships. • Entity Relationships are expressed using a language. • Statements are represented using a variety of notations; persisted to paper or digital documents; and transmissible using a variety of serialization formats. License CC-BY-SA 4.0 (International).
DATA ACCESS License CC-BY-SA 4.0 (International).
Fundamental Challenge Access to Data Independent of: • Location (File or Database Management System) • Representation Notation • Serialization Format • Transmission Protocol • Host Operating Systems • Consumer Applications License CC-BY-SA 4.0 (International).
Critical Components • Identifiers that denote (signify) each entity associated with the following relationship roles:  Entity (Subject)  Attribute (Predicate)  Value (Object) • Identifiers that denote entity description documents (Descriptors) • Identifiers that provide entity naming (identification) via implicit or explicit [denotation]  [description document content] resolution using indirection (i.e., combined effect of denotation & connotation to deliver identification or sense) • Name Resolution Protocols • Document Content Serialization Formats License CC-BY-SA 4.0 (International).
Entity Identifiers (Names) Uniform Resource Identifier (URI) <http://kingsley.idehen.net/dataspace/person/kidehen#this> – WebID (i.e., an HTTP URI that denotes an Entity of Type: Agent (Person, Organization, Software, Robot etc) ODBC Data Source Name (DSN) DSN=CRM JDBC Data Source Name (DSN) DSN=CRM License CC-BY-SA 4.0 (International).
Entity Description Document Locators • Uniform Resource (Data) Locator (URL) o <http://kingsley.idehen.net/dataspace/person/kidehen> – an HTTP URI that denotes a Document on an HTTP Network • ODBC Data Source Name o DSN=CRM;HOST=crm.example.org;SVT=Oracle;DATABASE=CRM;TABLE=CUSTOMER – denotes an ODBC accessible Table in a SQL RDBMS • JDBC Data Source URL o jdbc:openlink://crm.example.org/SVT=Oracle/DATABASE=CRM/TABLE=CUSTOMER – denotes a JDBC accessible Table in a SQL RDBMS License CC-BY-SA 4.0 (International).
ODBC Data Source Name Challenges • SQL Relational Database Specific. • Identifiers are x.500 names that are only understood by operating system locked applications. • Identifiers denote RDBMS specific tables, views, users, and stored procedures. License CC-BY-SA 4.0 (International).
JDBC Data Source Name Challenges • SQL Relational Database Specific. • Identifiers are "jdbc:" scheme URIs that are only understood by JDBC compliant applications constrained by Java Virtual Machine (JVM). • Identifiers denote RDBMS specific tables, views, users, and stored procedures. License CC-BY-SA 4.0 (International).
HTTP URI based Data Source Name Virtues • Database Engine Independent. • Data Access Protocol Independent. • Data Representation Format Independent. • Identifiers are Literals and/or References (which globalize lookup scope). • Identifiers denote anything, i.e., an kind of entity. • Identifiers are "terms" that resolve to referent description documents, globally. License CC-BY-SA 4.0 (International).
Data Source Name Resolution Protocols • Internet based Computer Network – Domain Name Services (DNS) protocol provides Name Resolution for Computers. • World Wide Web Document Network – HTTP provides Name Resolution for Web Documents via HTTP URLs. • World Wide Web Data Network – HTTP provides Name Resolution for Entities via HTTP URIs . License CC-BY-SA 4.0 (International).
DNS based Linked Computer Network (Internet) Linked Computer Network (e.g., Internet) 1. Computer (DNS CNAMES) Names are Data Source Name 2. Actual Data Model and Data Access is Local and Machine OS hosted App. specific. License CC-BY-SA 4.0 (International). Internet
HTTP based Linked Document Network (Web 1.0 & 2.0) Linked Document Network (e.g., World Wide Web) 1. Computer (DNS CNAMES) Names become irrelevant. 2. Document Locators / Addresses (HTTP URLs) are Data Source Names (DSNs). 3. One kind of Relation i.e., "LinksTo" is what connects the Documents. 4. To machines: actual Data Model, Entity Relation Semantics, and Representation Notations are indecipherable from content. Internet Web License CC-BY-SA 4.0 (International).
HTTP based Linked Data Network (Web 3.0) Linked Data Network (e.g., Linked Open Data Cloud) 1. Entity Names (HTTP URIs) are Data Source Names (DSNs) 2. Computer (DNS CNAMES) & Document Names (HTTP URLs) become irrelevant 3. Actual Data Model and Representation Notations are loosely coupled. License CC-BY-SA 4.0 (International). Internet Web Linked Data
LINKED DATA (WEBBY STRUCTURED DATA) License CC-BY-SA 4.0 (International).
Linked Data Fundamentals • Denote ("refer to" or name) entities unambiguously using URIs – similar to the role of "words" in natural language. • Use HTTP URIs so that the description of any entity can be looked up using any HTTP user agent – similar to the role of "terms " in natural language. • Use human and machine readable statements (via open standards e.g., RDF) to create document content that describes entities. • Refer to other entities using their HTTP URI based names in your entity description documents – i.e., – expand the Web! License CC-BY-SA 4.0 (International).
Understanding HTTP URI Entity Name and Description Doc Address Duality An HTTP URI is a kind of identifier that denotes ("Refers To") an entity while also resolving to its description document, over an HTTP Network. License CC-BY-SA 4.0 (International).
What is Linked Data? Linked Data is the use of Resolvable URIs to enhance Structured Data Representation. Basically: Representing Entity Relationships using Statements where the relationship role participants [Subject, Predicate, and Object (optionally)] are unambiguously "referred to" using Resolvable URIs. License CC-BY-SA 4.0 (International).
What is Linked Open Data? Linked Open Data is the use of HTTP URIs to enhance Structured Data Representation. Basically: Representing Entity Relationships using Statements where the relationship role participants [Subject, Predicate, and Object (optionally)] are unambiguously "referred to" using HTTP URIs. Note: URIs and HTTP are Open Standards License CC-BY-SA 4.0 (International).
Why is Linked Open Data Important? • It turns HTTP URIs (Hyperlinks) into Data Source Names. • It moves us from Open Database Connectivity to Open Data Connectivity – that scales from Private Data Spaces to the World Wide Web. • It delivers a powerful mechanism for virtualization of disparate and heterogeneous data sources (big or small) i.e., Data De-Silo-Fication. • It is inherently Platform Agnostic. • It delivers a Linked Open Data Cloud that scales to the World Wide Web. License CC-BY-SA 4.0 (International).
What is RDF based Linked Data? RDF-based Linked Data is the use of IRIs and Entity Relationship Type (aka. Relations) Semantics to enhance Structured Data Representation. Basically: Representing Entity Relationships using Statements where the relationship role participants [Subject, Predicate, and Object (optionally)] are unambiguously "referred to" using IRIs. Note: RDF and IRIs are Open Standards License CC-BY-SA 4.0 (International).
What is RDF based Linked Open Data? RDF-based Linked Open Data is the use of HTTP URIs & Entity Relationship Type (Relations) Semantics to enhance Structured Data Representation. Basically: Representing Entity Relationships and Relation Semantics using Statements where the relationship role participants [Subject, Predicate, and Object (optionally)] are unambiguously "referred to" using HTTP URIs. Note: RDF, HTTP and URIs are Open Standards License CC-BY-SA 4.0 (International).
What is RDF based Linked Data? RDF-based Linked Data is Web-Like Structured Data enhanced with RDF’s *explicit* machine-and human-comprehensible Entity Relationship Semantics. Identifiers, Structured Data Representation, and Logic License CC-BY-SA 4.0 (International). Linked Data RDF Predicate Logic (Entity Relationship Semantics)
RDF based Linked Open Data (Semantic Web) Semantically Enhanced Linked Data Network (e.g., Semantic Web of Big Linked Open Data) License CC-BY-SA 4.0 (International). Internet Web Linked Data Relation 1. Entity Names (HTTP URIs) are Semantics Data Source Names (DSNs) 2. Computer (DNS CNAMES) & Document Names (HTTP URLs) become irrelevant 3. Actual Data Model and Representation Notations are loosely coupled 4. RDF & RDF Schema Relation Semantics are accessible and comprehendible to humans and machines.
Local Linked Data (Inaccessible) Entity (Subject) Attribute (Predicate) Value (Object) urn:data:object:id:France urn:data:object:id:Type urn:data:object:id:Popula tedPlace urn:data:object:id:France urn:data:object:id:hasLabel "France" urn:data:object:id:France urn:data:object:id:hasCapital urn:data:object:id:Paris urn:data:object:id:Paris urn:data:object:id:Type urn:data:object:id:Popula tedPlace urn:data:object:id:Paris urn:data:object:id:hasLabel "Paris" urn:data:object:id:Populate dPlace urn:data:object:id:Type urn:data:object:id:Place License CC-BY-SA 4.0 (International).
Linked Data (Accessible Webby Data) Entity (Subject) Attribute (Predicate) Value (Object) http://dbpedia.org/resource/France http://www.w3.org/1999/02/22-rdf-syntax- ns#type http://dbpedia.org/ontology/Popula tedPlace http://dbpedia.org/resource/France http://www.w3.org/2000/01/rdf-schema# label "France" http://dbpedia.org/resource/France http://dbpedia.org/ontology/capital http://dbpedia.org/resource/Paris http://dbpedia.org/resource/Paris http://www.w3.org/1999/02/22-rdf-syntax- ns#type http://dbpedia.org/ontology/Popula tedPlace http://dbpedia.org/resource/Paris http://www.w3.org/2000/01/rdf-schema# label "Paris" http://dbpedia.org/ontology/Popula tedPlace http://www.w3.org/2000/01/rdf-schema# subClassOf http://dbpedia.org/ontology/Place License CC-BY-SA 4.0 (International).
Massive Linked Open Data Cloud License CC-BY-SA 4.0 (International).
NATURAL LANGUAGE & DATA "Natural Languages are the most sophisticated systems of communication ever developed." – John F. Sowa "Once you have a truly massive amount of information integrated as knowledge, then the human-software system will be superhuman, in the same sense that mankind with writing is superhuman compared to mankind before writing." – Douglas Lenat License CC-BY-SA 4.0 (International).
Natural Language & Data • A Word or Phrase is an identifier that names an Entity (thing) via implicit [denotation][referent description document content] resolution • A Term is a Word or Phrase that names an Entity via explicit, [denotation][referent description document content] resolution, using indirection. • A Sentence is a syntax rules constrained arrangement of Words and Phrases that represent types of Entity Relationships. • A Statement is a kind of Sentence constructed from Terms. License CC-BY-SA 4.0 (International).
Data (Recap) • A IRI is an Internationalized Identifier that has the entity naming characteristics of a Word or Phrase. • An HTTP URI is a kind of IRI that has the entity naming characteristics of a Term i.e., denotation (signification) and connotation (description) reference duality. • RDF enables digital sentence construction where IRIs are used to name Entities participating in the Subject, Predicate, and Object relationship roles. • RDF based Linked Data enables digital statement construction where HTTP URIs are used to denote Entities participating in the Subject, Predicate, and Object relationship roles. License CC-BY-SA 4.0 (International).
Natural Language & Data Connection • An RDF triple represents a "Datum" – a Sentence compromised of Words or Phrases. • An RDF based Linked Open Data Triple represents a "Webby Datum" – a Statement comprised of Terms. • RDF triple collections represent Data – Sentences. • RDF based Linked Open Data triple collections represent "Webby Data" – Statements. License CC-BY-SA 4.0 (International).
Live Additional Information Links An Glossary of terms, in Linked Data form: • Data • Big Data • Open Data • Public Open Data • Linked Data • Linked Open Data • Semantic Web • Resource Description Framework (RDF) License CC-BY-SA 4.0 (International).
References • The Role of Logic and Ontology in Language and Reasoning --- John F. Sowa • Blogic – Pat Hayes • Unified View of Data – Peter Chen • Levels of Abstraction: Net, Web, Graph – Tim Berners- Lee • What is Data? What is a Datum – Ontolog Forum Thread • Data & Relations – Ontolog Forum Thread. License CC-BY-SA 4.0 (International).
Additional Information Web Sites OpenLink Software YouID – Digital Identity Card (Certificate) Generator OpenLink Data Spaces – Semantically enhanced Personal & Enterprise Data Spaces & Collaboration Platform OpenLink Virtuoso - Hybrid Data Management, Integration, Application, and Identity Server Universal Data Access Drivers - High-Performance ODBC, JDBC, ADO.NET, and OLE-DB Drivers Social Media Data spaces http://kidehen.blogspot.com (weblog) http://www.openlinksw.com/blog/~kidehen/ (weblog) https://plus.google.com/112399767740508618350/posts (Google+) https://twitter.com/#!/kidehen (Twitter) Hashtag: #LinkedData (Anywhere). License CC-BY-SA 4.0 (International).

More Related Content

Introduction to Computational Social Science - Lecture 1
PPTX
Introduction to Computational Social Science - Lecture 1
Data Science Lifecycle
PPTX
Data Science Lifecycle
BIG DATA and USE CASES
PPTX
BIG DATA and USE CASES
Machine Learning and Data Mining
PPTX
Predictive analytics
PPTX
Predictive analytics
Introductio to Data Science and types of data
PPTX
Introductio to Data Science and types of data
Gui programming
PPTX
Gui programming
Multidimensional schema of data warehouse
PPTX
Multidimensional schema of data warehouse
Introduction to Computational Social Science - Lecture 1
Introduction to Computational Social Science - Lecture 1
Data Science Lifecycle
Data Science Lifecycle
BIG DATA and USE CASES
BIG DATA and USE CASES
Machine Learning and Data Mining
Predictive analytics
Predictive analytics
Introductio to Data Science and types of data
Introductio to Data Science and types of data
Gui programming
Gui programming
Multidimensional schema of data warehouse
Multidimensional schema of data warehouse

What's hot

pandas: Powerful data analysis tools for Python
PDF
pandas: Powerful data analysis tools for Python
Introduction to Django Rest Framework
PPTX
Introduction to Django Rest Framework
Introduction to PySpark
PDF
Introduction to PySpark
01 Data Mining: Concepts and Techniques, 2nd ed.
PPT
01 Data Mining: Concepts and Techniques, 2nd ed.
Data Analysis and Visualization using Python
PDF
Data Analysis and Visualization using Python
An Introduction to Information Retrieval and Applications
PPTX
An Introduction to Information Retrieval and Applications
PPT on Data Science Using Python
PPTX
PPT on Data Science Using Python
null.pptx
PPTX
null.pptx
Introduction to MongoDB.pptx
PPTX
Introduction to MongoDB.pptx
Id and class selector
PPTX
Id and class selector
Data mining :Concepts and Techniques Chapter 2, data
PPT
Data mining :Concepts and Techniques Chapter 2, data
multi dimensional data model
PPTX
multi dimensional data model
PySpark dataframe
PPTX
PySpark dataframe
Probabilistic information retrieval models & systems
PPTX
Probabilistic information retrieval models & systems
Python for Data Science
PDF
Python for Data Science
The Basics of MongoDB
PPTX
The Basics of MongoDB
Information Retrieval
PPTX
Information Retrieval
Probabilistic Retrieval
PDF
Probabilistic Retrieval
Data visualization
PPTX
Data visualization
Classification of data
PPTX
Classification of data
pandas: Powerful data analysis tools for Python
pandas: Powerful data analysis tools for Python
Introduction to Django Rest Framework
Introduction to Django Rest Framework
Introduction to PySpark
Introduction to PySpark
01 Data Mining: Concepts and Techniques, 2nd ed.
01 Data Mining: Concepts and Techniques, 2nd ed.
Data Analysis and Visualization using Python
Data Analysis and Visualization using Python
An Introduction to Information Retrieval and Applications
An Introduction to Information Retrieval and Applications
PPT on Data Science Using Python
PPT on Data Science Using Python
null.pptx
null.pptx
Introduction to MongoDB.pptx
Introduction to MongoDB.pptx
Id and class selector
Id and class selector
Data mining :Concepts and Techniques Chapter 2, data
Data mining :Concepts and Techniques Chapter 2, data
multi dimensional data model
multi dimensional data model
PySpark dataframe
PySpark dataframe
Probabilistic information retrieval models & systems
Probabilistic information retrieval models & systems
Python for Data Science
Python for Data Science
The Basics of MongoDB
The Basics of MongoDB
Information Retrieval
Information Retrieval
Probabilistic Retrieval
Probabilistic Retrieval
Data visualization
Data visualization
Classification of data
Classification of data

Viewers also liked

Data vs. information
PPT
Data vs. information
BII The Internet Of Everything 2015
PPTX
BII The Internet Of Everything 2015
Data and information
PPTX
Data and information
Pp1 data, information & knowledge
PPT
Pp1 data, information & knowledge
Data presentation 2
PPT
Data presentation 2
Nature of Inquiry and Research
PPT
Nature of Inquiry and Research
CRISP-DM - Agile Approach To Data Mining Projects
PDF
CRISP-DM - Agile Approach To Data Mining Projects
Financing Innovations For The Bottom Of The Pyramid
PPT
Financing Innovations For The Bottom Of The Pyramid
DMann-SQLDeveloper4Reporting
PDF
DMann-SQLDeveloper4Reporting
Sql rally amsterdam Aanalysing data with Power BI and Hive
PPTX
Sql rally amsterdam Aanalysing data with Power BI and Hive
Oracle OpenWorld 2011 - Oracle Application Express within the Oracle SOA Suite
PPTX
Oracle OpenWorld 2011 - Oracle Application Express within the Oracle SOA Suite
OpenLink Virtuoso - Management & Decision Makers Overview
PPTX
OpenLink Virtuoso - Management & Decision Makers Overview
Sql saturday denmark power bi for pdf
PDF
Sql saturday denmark power bi for pdf
Oracle Apex Technical Introduction
PPTX
Oracle Apex Technical Introduction
Oracle Apex Overview
PPT
Oracle Apex Overview
How to save 30% on your internal printing expenses
PPT
How to save 30% on your internal printing expenses
Quality tools
PDF
Quality tools
Ways in stating research problem.report
PPTX
Ways in stating research problem.report
Crisp-DM
PDF
Crisp-DM
Chapter 1 types of research
PPTX
Chapter 1 types of research
Data vs. information
Data vs. information
BII The Internet Of Everything 2015
BII The Internet Of Everything 2015
Data and information
Data and information
Pp1 data, information & knowledge
Pp1 data, information & knowledge
Data presentation 2
Data presentation 2
Nature of Inquiry and Research
Nature of Inquiry and Research
CRISP-DM - Agile Approach To Data Mining Projects
CRISP-DM - Agile Approach To Data Mining Projects
Financing Innovations For The Bottom Of The Pyramid
Financing Innovations For The Bottom Of The Pyramid
DMann-SQLDeveloper4Reporting
DMann-SQLDeveloper4Reporting
Sql rally amsterdam Aanalysing data with Power BI and Hive
Sql rally amsterdam Aanalysing data with Power BI and Hive
Oracle OpenWorld 2011 - Oracle Application Express within the Oracle SOA Suite
Oracle OpenWorld 2011 - Oracle Application Express within the Oracle SOA Suite
OpenLink Virtuoso - Management & Decision Makers Overview
OpenLink Virtuoso - Management & Decision Makers Overview
Sql saturday denmark power bi for pdf
Sql saturday denmark power bi for pdf
Oracle Apex Technical Introduction
Oracle Apex Technical Introduction
Oracle Apex Overview
Oracle Apex Overview
How to save 30% on your internal printing expenses
How to save 30% on your internal printing expenses
Quality tools
Quality tools
Ways in stating research problem.report
Ways in stating research problem.report
Crisp-DM
Crisp-DM
Chapter 1 types of research
Chapter 1 types of research

Similar to Understanding Data

Understanding data -latest
PPTX
Understanding data -latest
DBMS Unit 1 nice content please download it
PDF
DBMS Unit 1 nice content please download it
DBMS: Week 03 - Data Models and ER Model
PPTX
DBMS: Week 03 - Data Models and ER Model
Graphics designing.pptx
PPTX
Graphics designing.pptx
Chapter – 2 Data Models.pdf
PDF
Chapter – 2 Data Models.pdf
dbms ppt parul university dbms course for
PPTX
dbms ppt parul university dbms course for
Introduction to Application Profiles
PPTX
Introduction to Application Profiles
Topics In Data Science With Practical Examples Abdolreza Abhari
PDF
Topics In Data Science With Practical Examples Abdolreza Abhari
Presentation1
PPTX
Presentation1
Linked Data and RDA: Looking at Next-Generation Cataloging
PPTX
Linked Data and RDA: Looking at Next-Generation Cataloging
Introduction to database
PPTX
Introduction to database
Linked data for Libraries, Archives, Museums
PPTX
Linked data for Libraries, Archives, Museums
Publishing Linked Data using Schema.org
PDF
Publishing Linked Data using Schema.org
DATA MODELS type discussion samples for schools
PPTX
DATA MODELS type discussion samples for schools
DATA MODELS samples and expmples detailed
PPTX
DATA MODELS samples and expmples detailed
Using MongoDB as a high performance graph database
PDF
Using MongoDB as a high performance graph database
DATA MODEL PRESENTATION UNIT I-BCA I.pptx
PPTX
DATA MODEL PRESENTATION UNIT I-BCA I.pptx
Database Systems - Entity Relationship Modeling (Chapter 4/2)
PDF
Database Systems - Entity Relationship Modeling (Chapter 4/2)
Publishing and Using Linked Data
PDF
Publishing and Using Linked Data
Linked Data Modeling for Beginner
PPTX
Linked Data Modeling for Beginner
Understanding data -latest
Understanding data -latest
DBMS Unit 1 nice content please download it
DBMS Unit 1 nice content please download it
DBMS: Week 03 - Data Models and ER Model
DBMS: Week 03 - Data Models and ER Model
Graphics designing.pptx
Graphics designing.pptx
Chapter – 2 Data Models.pdf
Chapter – 2 Data Models.pdf
dbms ppt parul university dbms course for
dbms ppt parul university dbms course for
Introduction to Application Profiles
Introduction to Application Profiles
Topics In Data Science With Practical Examples Abdolreza Abhari
Topics In Data Science With Practical Examples Abdolreza Abhari
Presentation1
Presentation1
Linked Data and RDA: Looking at Next-Generation Cataloging
Linked Data and RDA: Looking at Next-Generation Cataloging
Introduction to database
Introduction to database
Linked data for Libraries, Archives, Museums
Linked data for Libraries, Archives, Museums
Publishing Linked Data using Schema.org
Publishing Linked Data using Schema.org
DATA MODELS type discussion samples for schools
DATA MODELS type discussion samples for schools
DATA MODELS samples and expmples detailed
DATA MODELS samples and expmples detailed
Using MongoDB as a high performance graph database
Using MongoDB as a high performance graph database
DATA MODEL PRESENTATION UNIT I-BCA I.pptx
DATA MODEL PRESENTATION UNIT I-BCA I.pptx
Database Systems - Entity Relationship Modeling (Chapter 4/2)
Database Systems - Entity Relationship Modeling (Chapter 4/2)
Publishing and Using Linked Data
Publishing and Using Linked Data
Linked Data Modeling for Beginner
Linked Data Modeling for Beginner

More from Kingsley Uyi Idehen

Virtuoso Platform Overview
PPTX
Virtuoso Platform Overview
LOD Cloud Knowledge Graph vs COVID-19
PPTX
LOD Cloud Knowledge Graph vs COVID-19
Enterprise & Web based Federated Identity Management & Data Access Controls
PPTX
Enterprise & Web based Federated Identity Management & Data Access Controls
Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
PPTX
Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
HTML5 based PivotViewer for Visualizing LInked Data
PPT
HTML5 based PivotViewer for Visualizing LInked Data
Sigma Knowledge Engineering Environment
PDF
Sigma Knowledge Engineering Environment
Linked Open Data (LOD) Cloud & Ontology Life Cycles
PPTX
Linked Open Data (LOD) Cloud & Ontology Life Cycles
ISWC 2012 - Linked Data Meetup
PPT
ISWC 2012 - Linked Data Meetup
Knowledge Design Patterns (by John F. Sowa)
PDF
Knowledge Design Patterns (by John F. Sowa)
Accessing the Linked Open Data Cloud via ODBC
PPT
Accessing the Linked Open Data Cloud via ODBC
Virtuoso ODBC Driver Configuration & Usage (Mac OS X)
PPT
Virtuoso ODBC Driver Configuration & Usage (Mac OS X)
Virtuoso ODBC Driver Configuration & Usage (Windows)
PPT
Virtuoso ODBC Driver Configuration & Usage (Windows)
Exploiting Linked Data via Filemaker
PPT
Exploiting Linked Data via Filemaker
Tableau Desktop as a Linked (Open) Data Front-End via ODBC
PPT
Tableau Desktop as a Linked (Open) Data Front-End via ODBC
Using SAP Crystal Reports as a Linked (Open) Data Front-End via ODBC
PPT
Using SAP Crystal Reports as a Linked (Open) Data Front-End via ODBC
Exploiting Linked (Open) Data via Microsoft Access using ODBC File DSNs
PPT
Exploiting Linked (Open) Data via Microsoft Access using ODBC File DSNs
Using Tibco SpotFire (via Virtuoso ODBC) as Linked Data Front-end
PPT
Using Tibco SpotFire (via Virtuoso ODBC) as Linked Data Front-end
Exploiting Linked (Open) Data via Microsoft Access
PPT
Exploiting Linked (Open) Data via Microsoft Access
Integrating Semantic Systems
PDF
Integrating Semantic Systems
Understanding Linked Data via EAV Model based Structured Descriptions
PPT
Understanding Linked Data via EAV Model based Structured Descriptions
Virtuoso Platform Overview
Virtuoso Platform Overview
LOD Cloud Knowledge Graph vs COVID-19
LOD Cloud Knowledge Graph vs COVID-19
Enterprise & Web based Federated Identity Management & Data Access Controls
Enterprise & Web based Federated Identity Management & Data Access Controls
Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
Virtuoso, The Prometheus of RDF -- Sematics 2014 Conference Keynote
HTML5 based PivotViewer for Visualizing LInked Data
HTML5 based PivotViewer for Visualizing LInked Data
Sigma Knowledge Engineering Environment
Sigma Knowledge Engineering Environment
Linked Open Data (LOD) Cloud & Ontology Life Cycles
Linked Open Data (LOD) Cloud & Ontology Life Cycles
ISWC 2012 - Linked Data Meetup
ISWC 2012 - Linked Data Meetup
Knowledge Design Patterns (by John F. Sowa)
Knowledge Design Patterns (by John F. Sowa)
Accessing the Linked Open Data Cloud via ODBC
Accessing the Linked Open Data Cloud via ODBC
Virtuoso ODBC Driver Configuration & Usage (Mac OS X)
Virtuoso ODBC Driver Configuration & Usage (Mac OS X)
Virtuoso ODBC Driver Configuration & Usage (Windows)
Virtuoso ODBC Driver Configuration & Usage (Windows)
Exploiting Linked Data via Filemaker
Exploiting Linked Data via Filemaker
Tableau Desktop as a Linked (Open) Data Front-End via ODBC
Tableau Desktop as a Linked (Open) Data Front-End via ODBC
Using SAP Crystal Reports as a Linked (Open) Data Front-End via ODBC
Using SAP Crystal Reports as a Linked (Open) Data Front-End via ODBC
Exploiting Linked (Open) Data via Microsoft Access using ODBC File DSNs
Exploiting Linked (Open) Data via Microsoft Access using ODBC File DSNs
Using Tibco SpotFire (via Virtuoso ODBC) as Linked Data Front-end
Using Tibco SpotFire (via Virtuoso ODBC) as Linked Data Front-end
Exploiting Linked (Open) Data via Microsoft Access
Exploiting Linked (Open) Data via Microsoft Access
Integrating Semantic Systems
Integrating Semantic Systems
Understanding Linked Data via EAV Model based Structured Descriptions
Understanding Linked Data via EAV Model based Structured Descriptions

Recently uploaded

UiPath Automation Suite Installation (Hands-On) [2/3]
PPTX
UiPath Automation Suite Installation (Hands-On) [2/3]
Application Monitoring and Observability: Elastic Stack Solution for Producti...
PDF
Application Monitoring and Observability: Elastic Stack Solution for Producti...
UnityNet Digital Sovereignty Checklist 2025年10月13日
PDF
UnityNet Digital Sovereignty Checklist 2025年10月13日
A fool with a tool is still a fool - Plone Conference 2025
PDF
A fool with a tool is still a fool - Plone Conference 2025
Fairness and Bias in AI Ethics and Explainability
PDF
Fairness and Bias in AI Ethics and Explainability
Data Structure - 12 Graph
PDF
Data Structure - 12 Graph
Hybrid Active Directory Cyber Resiliency
PPTX
Hybrid Active Directory Cyber Resiliency
BioVault.net ADW 2025 CODATA: SyftBox: a General Purpose Solution for Data Vi...
PPTX
BioVault.net ADW 2025 CODATA: SyftBox: a General Purpose Solution for Data Vi...
Using Copilot with Microsoft Office Apps
PDF
Using Copilot with Microsoft Office Apps
Dragino商品カタログ 2025.7 LoRaWAN・NB-IoT・LTE-M(LTE Cat.M1)対応センサーリスト
PDF
Dragino商品カタログ 2025.7 LoRaWAN・NB-IoT・LTE-M(LTE Cat.M1)対応センサーリスト
What is Google Cloud Platform (GCP)? A 5-Minute Guide for Beginners (2025)
PDF
What is Google Cloud Platform (GCP)? A 5-Minute Guide for Beginners (2025)
Ethical Initiatives in AI and accountability.pdf
PDF
Ethical Initiatives in AI and accountability.pdf
Getting the Best of TrueDEM – October News & Updates
PDF
Getting the Best of TrueDEM – October News & Updates
KV Caching Strategies for Latency-Critical LLM Applications by John Thomson
PDF
KV Caching Strategies for Latency-Critical LLM Applications by John Thomson
Unlock This Brilliant App Freezur That Builds Brainrot Videos That Captivate ...
PDF
Unlock This Brilliant App Freezur That Builds Brainrot Videos That Captivate ...
GPUS and How to Program Them by Manya Bansal
PDF
GPUS and How to Program Them by Manya Bansal
Meta and Apple close to settling EU cases.pdf
PDF
Meta and Apple close to settling EU cases.pdf
Beyond Generative AI: Creating Demand for Compute in the 2030s
PDF
Beyond Generative AI: Creating Demand for Compute in the 2030s
The Journey Is The Reward - Apple Maps Travel Companion
PDF
The Journey Is The Reward - Apple Maps Travel Companion
NSSHOEU-J 4x35mm2 Cable Specification.pdf
PDF
NSSHOEU-J 4x35mm2 Cable Specification.pdf
UiPath Automation Suite Installation (Hands-On) [2/3]
UiPath Automation Suite Installation (Hands-On) [2/3]
Application Monitoring and Observability: Elastic Stack Solution for Producti...
Application Monitoring and Observability: Elastic Stack Solution for Producti...
UnityNet Digital Sovereignty Checklist 2025年10月13日
UnityNet Digital Sovereignty Checklist 2025年10月13日
A fool with a tool is still a fool - Plone Conference 2025
A fool with a tool is still a fool - Plone Conference 2025
Fairness and Bias in AI Ethics and Explainability
Fairness and Bias in AI Ethics and Explainability
Data Structure - 12 Graph
Data Structure - 12 Graph
Hybrid Active Directory Cyber Resiliency
Hybrid Active Directory Cyber Resiliency
BioVault.net ADW 2025 CODATA: SyftBox: a General Purpose Solution for Data Vi...
BioVault.net ADW 2025 CODATA: SyftBox: a General Purpose Solution for Data Vi...
Using Copilot with Microsoft Office Apps
Using Copilot with Microsoft Office Apps
Dragino商品カタログ 2025.7 LoRaWAN・NB-IoT・LTE-M(LTE Cat.M1)対応センサーリスト
Dragino商品カタログ 2025.7 LoRaWAN・NB-IoT・LTE-M(LTE Cat.M1)対応センサーリスト
What is Google Cloud Platform (GCP)? A 5-Minute Guide for Beginners (2025)
What is Google Cloud Platform (GCP)? A 5-Minute Guide for Beginners (2025)
Ethical Initiatives in AI and accountability.pdf
Ethical Initiatives in AI and accountability.pdf
Getting the Best of TrueDEM – October News & Updates
Getting the Best of TrueDEM – October News & Updates
KV Caching Strategies for Latency-Critical LLM Applications by John Thomson
KV Caching Strategies for Latency-Critical LLM Applications by John Thomson
Unlock This Brilliant App Freezur That Builds Brainrot Videos That Captivate ...
Unlock This Brilliant App Freezur That Builds Brainrot Videos That Captivate ...
GPUS and How to Program Them by Manya Bansal
GPUS and How to Program Them by Manya Bansal
Meta and Apple close to settling EU cases.pdf
Meta and Apple close to settling EU cases.pdf
Beyond Generative AI: Creating Demand for Compute in the 2030s
Beyond Generative AI: Creating Demand for Compute in the 2030s
The Journey Is The Reward - Apple Maps Travel Companion
The Journey Is The Reward - Apple Maps Travel Companion
NSSHOEU-J 4x35mm2 Cable Specification.pdf
NSSHOEU-J 4x35mm2 Cable Specification.pdf
In this document
Powered by AI

The presentation introduces the topic of data and its importance, aiming to deconstruct data, representation, access, and integration.

The slides elaborate on data's significance, concepts of observation, entities, identifiers, and denotation in data representation.

These slides explain the relationships between entities and their components, focusing on subjects, predicates, and objects.

Different methods of expressing relationships and representing entity relationships are discussed, including various notation systems.

A discussion on data access challenges, identifiers, and resolution protocols related to entity access over networks.

Fundamentals of linked data, important concepts, and benefits of linked open data and its role in enhancing data representation.

Details on RDF-based linked data, its importance, and how it enhances structured data representation for semantic web applications.

The connection between natural languages and data representation, concluding with resources and references for further exploration.

Understanding Data

  • 1.
    UNDERSTANDING DATA By Kingsley Idehen Founder & CEO, OpenLink Software
  • 2.
    Presentation Goals Deconstruct Data Understand Data Representation Understand Data Access Understand Data Integration License CC-BY-SA 4.0 (International).
  • 3.
    SITUATION ANALYSIS License CC-BY-SA 4.0 (International).
  • 4.
    EVERY DAY WE HEAR License CC-BY-SA 4.0 (International). DATA IS BIG DATA IS OPEN DATA IS LINKED
  • 5.
    WE ALMOST NEVERHEAR ABOUT License CC-BY-SA 4.0 (International). WHAT DATA ACTUALLY IS HOW DATA IS REPRESENTED HOW DATA IS ACCESSED, SHARED, & INTEGRATED
  • 6.
    Why is Data Important? Data is the basis of Information, Knowledge, and Wisdom. WISDOM KNOWLEDGE INFORMATION DATA License CC-BY-SA 4.0 (International).
  • 7.
    What is Data? Data is how we express Observation in reusable form. License CC-BY-SA 4.0 (International).
  • 8.
    What is Observation? Observation is the Perception of Relationships between Entities. YOUR OBSERVATIONS PEOPLE, PLACES, MUSIC, DOCUMENTS, CALENDARS, DIARIES, ADDRESS BOOKS & MORE... License CC-BY-SA 4.0 (International).
  • 9.
    What is an Entity? An Entity is a Distinctly Identifiable Thing License CC-BY-SA 4.0 (International).
  • 10.
    How is an Entity Identified (Named) ? An Entity is Identified (or named) through the combined effects of Identifier based denotation (signification) and document content based connotation (description). License CC-BY-SA 4.0 (International).
  • 11.
    How is an Entity Denoted? An Entity is Denoted (Signified) through the use of an Identifier. License CC-BY-SA 4.0 (International).
  • 12.
    What is an Identifier? An Identifier is a Sign (or Token) that Signifies (Denotes, or "Stands For") an Entity License CC-BY-SA 4.0 (International).
  • 13.
    Identifier Types? Quoted Literals such as: "Kingsley Idehen" or ‘Kingsley Idehen’ Absolute References: <http://kingsley.idehen.net/dataspace/person/kidehen#this> Relative References: <#KingsleyIdehen> License CC-BY-SA 4.0 (International).
  • 14.
    How is an Entity Described? Through entity relationships that are represented in reusable form via document content (sentences and statements). License CC-BY-SA 4.0 (International).
  • 15.
    What is a Relationship? A Relationship is an Association between two or more Entities, where each has a specific Role. License CC-BY-SA 4.0 (International).
  • 16.
    What is a Relationship Role? A Relationship Role is a Function performed by an Entity in a Relationship License CC-BY-SA 4.0 (International).
  • 17.
    Relationship Role Types? • Entity Attribute Value EAV  Entity -- observation focal point  Attribute -- observation attribute name (relationship type determinant)  Value -- observation attribute value • RDF (WC3’s Resource Description Framework)  Subject -- observation focal point  Predicate -- observation attribute name (relationship type determinant)  Object -- observation attribute value License CC-BY-SA 4.0 (International).
  • 18.
    Relationship Role: Predicate The Relationship Predicate is the Connector that associates an observation focal point (Subject) with something, in the form of an observation value (Object). License CC-BY-SA 4.0 (International).
  • 19.
    Relationship Role: Subject Actual Entity being Observed License CC-BY-SA 4.0 (International).
  • 20.
    Relationship Role: Object Value associated with an observation focal point (Subject) via a Relationship Predicate. License CC-BY-SA 4.0 (International).
  • 21.
    Types of Values? • Untyped Literals (Strings) • Typed Literals  Numbers  Dates  Booleans  Etc. • References (Local and Global Hyperlinks) License CC-BY-SA 4.0 (International).
  • 22.
    How are Relationships Expressed? Relationships are Expressed using a Language, i.e., a system of signs [for denotation], syntax [arrangement of signs to form sentences], and entity relation semantics [meaning of relationship roles] for encoding and decoding information. Example: Subject, Predicate, Object – Used by W3C’s Resource Description Framework (RDF) and Natural Language. License CC-BY-SA 4.0 (International).
  • 23.
    How Are Entity Relationships Represented ? Entity Relationships are Represented using notations associated with a specific language. Examples include: • Entity Relationship Model (Network /Graph) Diagrams. • Tables (CSV files, Spreadsheets, and SQL Relational Database Management Systems). • RDF-Turtle, JSON-LD, RDF/XML, HTML+Microdata, HTML+RDFa etc.. License CC-BY-SA 4.0 (International).
  • 24.
    Entity Relationship Diagram <#hasCapital> License CC-BY-SA 4.0 (International). <#PopulatedPlace> "France" <#Paris> <#type> <#hasLabel> <#France>
  • 25.
    Turtle Notation Based Entity Relationship Statements <#France> <#Type> <#PopulatedPlace> . <#France> <#hasLabel> "France" . <#France> <#hasCapital> <#Paris> . <#Paris> <#Type> <#PopulatedPlace> . <#Paris> <#hasLabel> "Paris" . <#PopulatedPlace> <#Type> <#Place> . License CC-BY-SA 4.0 (International).
  • 26.
    Entity Relationship Tables Delimiter: e.g., Comma Identifier Quote Character: Double-quotes Relation Header Row: Entity,Attribute,Value Relation Body Example: "Entity", "Attribute" "Value" "France", "Type" "PopulatedPlace" "France" , "hasLabel" "France" "France" , "hasCapital" "Paris" License CC-BY-SA 4.0 (International).
  • 27.
    Statement Representation: Spreadsheet Tables Entity (Subject) Attribute (Predicate) Value (Object) #France #Type #PopulatedPlace #France #hasLabel "France" #France #hasCapital #Paris #Paris #Type #PopulatedPlace #Paris #hasLabel "Paris" #PopulatedPlace #Type #Place License CC-BY-SA 4.0 (International).
  • 28.
    How are Statements Persisted & Transmitted? • Persistence:  To paper based documents  To digital realm documents (e.g., operating system files, web pages, etc.) • Transmission:  Text oriented serialization formats  Binary serialization formats License CC-BY-SA 4.0 (International).
  • 29.
    Understanding Data (Recap) • The term "Data" refers to observation expressed in reusable form. • The term "Observation" refers to our perception of Entity Relationships. • Entity Relationships are expressed using a language. • Statements are represented using a variety of notations; persisted to paper or digital documents; and transmissible using a variety of serialization formats. License CC-BY-SA 4.0 (International).
  • 30.
    DATA ACCESS License CC-BY-SA 4.0 (International).
  • 31.
    Fundamental Challenge Access to Data Independent of: • Location (File or Database Management System) • Representation Notation • Serialization Format • Transmission Protocol • Host Operating Systems • Consumer Applications License CC-BY-SA 4.0 (International).
  • 32.
    Critical Components • Identifiers that denote (signify) each entity associated with the following relationship roles:  Entity (Subject)  Attribute (Predicate)  Value (Object) • Identifiers that denote entity description documents (Descriptors) • Identifiers that provide entity naming (identification) via implicit or explicit [denotation]  [description document content] resolution using indirection (i.e., combined effect of denotation & connotation to deliver identification or sense) • Name Resolution Protocols • Document Content Serialization Formats License CC-BY-SA 4.0 (International).
  • 33.
    Entity Identifiers (Names) Uniform Resource Identifier (URI) <http://kingsley.idehen.net/dataspace/person/kidehen#this> – WebID (i.e., an HTTP URI that denotes an Entity of Type: Agent (Person, Organization, Software, Robot etc) ODBC Data Source Name (DSN) DSN=CRM JDBC Data Source Name (DSN) DSN=CRM License CC-BY-SA 4.0 (International).
  • 34.
    Entity Description Document Locators • Uniform Resource (Data) Locator (URL) o <http://kingsley.idehen.net/dataspace/person/kidehen> – an HTTP URI that denotes a Document on an HTTP Network • ODBC Data Source Name o DSN=CRM;HOST=crm.example.org;SVT=Oracle;DATABASE=CRM;TABLE=CUSTOMER – denotes an ODBC accessible Table in a SQL RDBMS • JDBC Data Source URL o jdbc:openlink://crm.example.org/SVT=Oracle/DATABASE=CRM/TABLE=CUSTOMER – denotes a JDBC accessible Table in a SQL RDBMS License CC-BY-SA 4.0 (International).
  • 35.
    ODBC Data Source Name Challenges • SQL Relational Database Specific. • Identifiers are x.500 names that are only understood by operating system locked applications. • Identifiers denote RDBMS specific tables, views, users, and stored procedures. License CC-BY-SA 4.0 (International).
  • 36.
    JDBC Data Source Name Challenges • SQL Relational Database Specific. • Identifiers are "jdbc:" scheme URIs that are only understood by JDBC compliant applications constrained by Java Virtual Machine (JVM). • Identifiers denote RDBMS specific tables, views, users, and stored procedures. License CC-BY-SA 4.0 (International).
  • 37.
    HTTP URI based Data Source Name Virtues • Database Engine Independent. • Data Access Protocol Independent. • Data Representation Format Independent. • Identifiers are Literals and/or References (which globalize lookup scope). • Identifiers denote anything, i.e., an kind of entity. • Identifiers are "terms" that resolve to referent description documents, globally. License CC-BY-SA 4.0 (International).
  • 38.
    Data Source Name Resolution Protocols • Internet based Computer Network – Domain Name Services (DNS) protocol provides Name Resolution for Computers. • World Wide Web Document Network – HTTP provides Name Resolution for Web Documents via HTTP URLs. • World Wide Web Data Network – HTTP provides Name Resolution for Entities via HTTP URIs . License CC-BY-SA 4.0 (International).
  • 39.
    DNS based Linked Computer Network (Internet) Linked Computer Network (e.g., Internet) 1. Computer (DNS CNAMES) Names are Data Source Name 2. Actual Data Model and Data Access is Local and Machine OS hosted App. specific. License CC-BY-SA 4.0 (International). Internet
  • 40.
    HTTP based Linked Document Network (Web 1.0 & 2.0) Linked Document Network (e.g., World Wide Web) 1. Computer (DNS CNAMES) Names become irrelevant. 2. Document Locators / Addresses (HTTP URLs) are Data Source Names (DSNs). 3. One kind of Relation i.e., "LinksTo" is what connects the Documents. 4. To machines: actual Data Model, Entity Relation Semantics, and Representation Notations are indecipherable from content. Internet Web License CC-BY-SA 4.0 (International).
  • 41.
    HTTP based Linked Data Network (Web 3.0) Linked Data Network (e.g., Linked Open Data Cloud) 1. Entity Names (HTTP URIs) are Data Source Names (DSNs) 2. Computer (DNS CNAMES) & Document Names (HTTP URLs) become irrelevant 3. Actual Data Model and Representation Notations are loosely coupled. License CC-BY-SA 4.0 (International). Internet Web Linked Data
  • 42.
    LINKED DATA (WEBBY STRUCTURED DATA) License CC-BY-SA 4.0 (International).
  • 43.
    Linked Data Fundamentals • Denote ("refer to" or name) entities unambiguously using URIs – similar to the role of "words" in natural language. • Use HTTP URIs so that the description of any entity can be looked up using any HTTP user agent – similar to the role of "terms " in natural language. • Use human and machine readable statements (via open standards e.g., RDF) to create document content that describes entities. • Refer to other entities using their HTTP URI based names in your entity description documents – i.e., – expand the Web! License CC-BY-SA 4.0 (International).
  • 44.
    Understanding HTTP URI Entity Name and Description Doc Address Duality An HTTP URI is a kind of identifier that denotes ("Refers To") an entity while also resolving to its description document, over an HTTP Network. License CC-BY-SA 4.0 (International).
  • 45.
    What is Linked Data? Linked Data is the use of Resolvable URIs to enhance Structured Data Representation. Basically: Representing Entity Relationships using Statements where the relationship role participants [Subject, Predicate, and Object (optionally)] are unambiguously "referred to" using Resolvable URIs. License CC-BY-SA 4.0 (International).
  • 46.
    What is Linked Open Data? Linked Open Data is the use of HTTP URIs to enhance Structured Data Representation. Basically: Representing Entity Relationships using Statements where the relationship role participants [Subject, Predicate, and Object (optionally)] are unambiguously "referred to" using HTTP URIs. Note: URIs and HTTP are Open Standards License CC-BY-SA 4.0 (International).
  • 47.
    Why is Linked Open Data Important? • It turns HTTP URIs (Hyperlinks) into Data Source Names. • It moves us from Open Database Connectivity to Open Data Connectivity – that scales from Private Data Spaces to the World Wide Web. • It delivers a powerful mechanism for virtualization of disparate and heterogeneous data sources (big or small) i.e., Data De-Silo-Fication. • It is inherently Platform Agnostic. • It delivers a Linked Open Data Cloud that scales to the World Wide Web. License CC-BY-SA 4.0 (International).
  • 48.
    What is RDF based Linked Data? RDF-based Linked Data is the use of IRIs and Entity Relationship Type (aka. Relations) Semantics to enhance Structured Data Representation. Basically: Representing Entity Relationships using Statements where the relationship role participants [Subject, Predicate, and Object (optionally)] are unambiguously "referred to" using IRIs. Note: RDF and IRIs are Open Standards License CC-BY-SA 4.0 (International).
  • 49.
    What is RDF based Linked Open Data? RDF-based Linked Open Data is the use of HTTP URIs & Entity Relationship Type (Relations) Semantics to enhance Structured Data Representation. Basically: Representing Entity Relationships and Relation Semantics using Statements where the relationship role participants [Subject, Predicate, and Object (optionally)] are unambiguously "referred to" using HTTP URIs. Note: RDF, HTTP and URIs are Open Standards License CC-BY-SA 4.0 (International).
  • 50.
    What is RDF based Linked Data? RDF-based Linked Data is Web-Like Structured Data enhanced with RDF’s *explicit* machine-and human-comprehensible Entity Relationship Semantics. Identifiers, Structured Data Representation, and Logic License CC-BY-SA 4.0 (International). Linked Data RDF Predicate Logic (Entity Relationship Semantics)
  • 51.
    RDF based Linked Open Data (Semantic Web) Semantically Enhanced Linked Data Network (e.g., Semantic Web of Big Linked Open Data) License CC-BY-SA 4.0 (International). Internet Web Linked Data Relation 1. Entity Names (HTTP URIs) are Semantics Data Source Names (DSNs) 2. Computer (DNS CNAMES) & Document Names (HTTP URLs) become irrelevant 3. Actual Data Model and Representation Notations are loosely coupled 4. RDF & RDF Schema Relation Semantics are accessible and comprehendible to humans and machines.
  • 52.
    Local Linked Data (Inaccessible) Entity (Subject) Attribute (Predicate) Value (Object) urn:data:object:id:France urn:data:object:id:Type urn:data:object:id:Popula tedPlace urn:data:object:id:France urn:data:object:id:hasLabel "France" urn:data:object:id:France urn:data:object:id:hasCapital urn:data:object:id:Paris urn:data:object:id:Paris urn:data:object:id:Type urn:data:object:id:Popula tedPlace urn:data:object:id:Paris urn:data:object:id:hasLabel "Paris" urn:data:object:id:Populate dPlace urn:data:object:id:Type urn:data:object:id:Place License CC-BY-SA 4.0 (International).
  • 53.
    Linked Data (Accessible Webby Data) Entity (Subject) Attribute (Predicate) Value (Object) http://dbpedia.org/resource/France http://www.w3.org/1999/02/22-rdf-syntax- ns#type http://dbpedia.org/ontology/Popula tedPlace http://dbpedia.org/resource/France http://www.w3.org/2000/01/rdf-schema# label "France" http://dbpedia.org/resource/France http://dbpedia.org/ontology/capital http://dbpedia.org/resource/Paris http://dbpedia.org/resource/Paris http://www.w3.org/1999/02/22-rdf-syntax- ns#type http://dbpedia.org/ontology/Popula tedPlace http://dbpedia.org/resource/Paris http://www.w3.org/2000/01/rdf-schema# label "Paris" http://dbpedia.org/ontology/Popula tedPlace http://www.w3.org/2000/01/rdf-schema# subClassOf http://dbpedia.org/ontology/Place License CC-BY-SA 4.0 (International).
  • 54.
    Massive Linked Open Data Cloud License CC-BY-SA 4.0 (International).
  • 55.
    NATURAL LANGUAGE & DATA "Natural Languages are the most sophisticated systems of communication ever developed." – John F. Sowa "Once you have a truly massive amount of information integrated as knowledge, then the human-software system will be superhuman, in the same sense that mankind with writing is superhuman compared to mankind before writing." – Douglas Lenat License CC-BY-SA 4.0 (International).
  • 56.
    Natural Language & Data • A Word or Phrase is an identifier that names an Entity (thing) via implicit [denotation][referent description document content] resolution • A Term is a Word or Phrase that names an Entity via explicit, [denotation][referent description document content] resolution, using indirection. • A Sentence is a syntax rules constrained arrangement of Words and Phrases that represent types of Entity Relationships. • A Statement is a kind of Sentence constructed from Terms. License CC-BY-SA 4.0 (International).
  • 57.
    Data (Recap) • A IRI is an Internationalized Identifier that has the entity naming characteristics of a Word or Phrase. • An HTTP URI is a kind of IRI that has the entity naming characteristics of a Term i.e., denotation (signification) and connotation (description) reference duality. • RDF enables digital sentence construction where IRIs are used to name Entities participating in the Subject, Predicate, and Object relationship roles. • RDF based Linked Data enables digital statement construction where HTTP URIs are used to denote Entities participating in the Subject, Predicate, and Object relationship roles. License CC-BY-SA 4.0 (International).
  • 58.
    Natural Language & Data Connection • An RDF triple represents a "Datum" – a Sentence compromised of Words or Phrases. • An RDF based Linked Open Data Triple represents a "Webby Datum" – a Statement comprised of Terms. • RDF triple collections represent Data – Sentences. • RDF based Linked Open Data triple collections represent "Webby Data" – Statements. License CC-BY-SA 4.0 (International).
  • 59.
    Live Additional Information Links An Glossary of terms, in Linked Data form: • Data • Big Data • Open Data • Public Open Data • Linked Data • Linked Open Data • Semantic Web • Resource Description Framework (RDF) License CC-BY-SA 4.0 (International).
  • 60.
    References • The Role of Logic and Ontology in Language and Reasoning --- John F. Sowa • Blogic – Pat Hayes • Unified View of Data – Peter Chen • Levels of Abstraction: Net, Web, Graph – Tim Berners- Lee • What is Data? What is a Datum – Ontolog Forum Thread • Data & Relations – Ontolog Forum Thread. License CC-BY-SA 4.0 (International).
  • 61.
    Additional Information Web Sites OpenLink Software YouID – Digital Identity Card (Certificate) Generator OpenLink Data Spaces – Semantically enhanced Personal & Enterprise Data Spaces & Collaboration Platform OpenLink Virtuoso - Hybrid Data Management, Integration, Application, and Identity Server Universal Data Access Drivers - High-Performance ODBC, JDBC, ADO.NET, and OLE-DB Drivers Social Media Data spaces http://kidehen.blogspot.com (weblog) http://www.openlinksw.com/blog/~kidehen/ (weblog) https://plus.google.com/112399767740508618350/posts (Google+) https://twitter.com/#!/kidehen (Twitter) Hashtag: #LinkedData (Anywhere). License CC-BY-SA 4.0 (International).

Editor's Notes

  • #30 Title previously read "Data Access Challenge"
  • #32 Previously read: A WebID that denotes a Person Entity Typo in word organization
  • #33 Previously read: Uniform Resource (Data) Locator (URL) http://kingsley.idehen.net/dataspace/person/kidehen – denotes an HTTP accessible Document comprised of structured data ODBC Data Source Name DSN=CRM;HOST=crm.example.org;SVT=Oracle;DATABASE=CRM;TABLE=CUSTOMER – denotes an ODBC accessible Table JDBC Data Source URL jdbc:openlink://crm.example.org/SVT=Oracle/DATABASE=CRM/TABLE=CUSTOMER – denotes a JDBC accessible Table
  • #42 Previously read: Linked data principles "Refer To" (Name or Denote) Entities unambiguously using URIs. Use resolvable URIs (e.g., HTTP URIs) so that Entity Names resolve to Entity Description Documents (Descriptors). Use Structured Data to enhance the content of Entity Description Documents. Expand the Web by referring to other entities using their HTTP URIs.
  • #43 Previously read: Understanding HTTP URI Duality Duality endowed Identifiers that denote ("Refer To") an entity while also resolving to its description, over an HTTP Network.
  • #55 Please check over colour keyed words Deleted: Words play Subject, Predicate, or Object roles in Sentences.

AltStyle によって変換されたページ (->オリジナル) /