From OntologPSMW

Revision as of 11:56, 23 June 2020 by Ravisharma (Talk | contribs)

Jump to: navigation, search
[ ]

Ontology Summit 2020 Whence Working Group     (1)

  • What brought the use of graphs in as persistence mechanisms. Will need to address other non-relational persistence mechanisms; historical background; this can include parts of 'why'.     (1A)

First Draft     (1B)

As a possible starting point on our overview, here are some excerpts from Gutierrez and Sequeda A Brief History of Knowledge Graph's Main Ideas. They give accomplishments and foci for each of the eras followed by ‘realizations’ and ‘limitations’. A related report by Juan Sequeda is at Also Sequeda and others presented Whence related material at the Stanford Course Lectures (CS520)     (1B1)

  • Introduction “Those who cannot remember the past are condemned to repeat it”- George Santayana     (1B2)
    • Knowledge Graphs can be considered to be fulfilling an early vision in Computer Science of creating intelligent systems that integrate knowledge and data at large scale. The term “Knowledge Graph” was introduced by researchers at the turn of this century and has rapidly gained popularity in academia and industry since Google popularized it in 2012. It is paramount to note that, regardless of the discussions on, and definitions of the term “Knowledge Graph”, it stems from scientific advancements in diverse research areas such as Semantic Web, Databases, Knowledge Representation and Reasoning, NLP, Machine Learning, among others. … The integration of ideas and techniques from such disparate disciplines give the richness to the notion of Knowledge Graph, but at the same time presents a challenge to practitioners and researchers to know how current advances develop from, and are rooted in, early techniques.     (1B2A)
  • How is this paper written?     (1B3)
    • The essential elements involved in the notion of knowledge graphs can be traced to ancient history. If one would like to dig into their origins, several disciplines should be considered, among them mathematics, philosophy, linguistics, and psychology.[2] However, we do not have the time to go back to ancient times[3] and revisit broad areas of science. Thus, from a temporal point of view, we will concentrate on the evolution after the advent of computing in its modern sense (1950s). … We periodized by decades, but are conscious that the boundaries are much more blurry.[4]     (1B3A)
  • Advent of the digital age (1950s and 1960s)     (1B4)
    • Realizations during the decades of the 50s and 60s:     (1B4A)
      • Importance and possibility of automated reasoning.     (1B4A1)
      • The problem of dealing with large search spaces.     (1B4A2)
      • Need to understand natural language and other human representations of knowledge     (1B4A3)
      • Potential of semantic nets (and graphical representations in general) as abstraction layers     (1B4A4)
      • Relevance of systems and high level languages to manage data.     (1B4A5)
    • Limitations of contemporary (50s and 60s) techniques:     (1B4B)
      • Physical, technical and cost limitations of hardware     (1B4B1)
      • Gap between graphical representation and linear implementation     (1B4B2)
      • Gap between the logic of human language and data as handled by computer systems     (1B4B3)
  • Foundations Data and Knowledge (1970s)     (1B5)
    • Realizations:     (1B5A)
      • The need of representational independence, having the relational model as the first example. This approach could also be implemented in practical systems.     (1B5A1)
      • The need to formalize semantic networks using the tools of formal logic.     (1B5A2)
      • The possibilities of combining logic and data by means of networks.     (1B5A3)
    • Contemporary Limitations:     (1B5B)
      • On the DATA side, more flexible data structures were needed to represent new forms of data giving rise to Object Oriented and Graph data structures.     (1B5B1)
      • On the KNOWLEDGE side, more understanding was needed on the formalization of knowledge in logic giving rise to Description Logics.     (1B5B2)
  • Managing Data and Knowledge (1980s)     (1B6)
    • Realizations:     (1B6A)
      • Combining logic and data needs to be tightly coupled (not just layer prolog/expert system on top of a database)     (1B6A1)
      • Tradeoff between expressive power of logical languages and computational complexity of reasoning tasks     (1B6A2)
    • Contemporary Limitations:     (1B6B)
      • Negation was a killer. It was not well understood at this time.     (1B6B1)
      • Reasoning at large scale was still hard. Hardware was not going to be up to the task.     (1B6B2)
      • Realization of what would be known as the knowledge acquisition bottleneck     (1B6B3)
  • Data, Knowledge and the Web (1990s)     (1B7)
  • Data and Knowledge at Large Scale (2000s)     (1B8)
    • Realizations     (1B8A)
      • We learned to think about data and knowledge in a much bigger way (at Web scale)     (1B8A1)
      • Entering the era of Neural Networks due to new hardware and clever learning techniques     (1B8A2)
    • Limitations     (1B8B)
      • Do not know how to integrate logical and statistical views     (1B8B1)
      • Statistical methods (particularly in neural networks) do not provide information about the process of “reasoning” or “deduction”, which generates problems in areas where explanation is needed     (1B8B2)
  • Where are we now?     (1B9)
    • Throughout this history, we observed two important threads:     (1B9A)
      • Represent and manage data and knowledge at large scale     (1B9A1)
      • Integrate the most diverse, disparate and almost unlimited amount of sources of data and knowledge (structured data text, rules, images, voice, videos, etc.).     (1B9A2)
      • Furthermore, all of this must be available and accessible for “normal” users.     (1B9A3)
    • In 2012, Google announced a product called the Knowledge Graph, which is based on representing data in the form of graph connected with knowledge. … Later on, a myriad of companies ( e.g. Microsoft, Facebook, IBM) and organizations started to use the Knowledge Graph keyword to refer to the integration of data given rise to entities and relations forming graphs.[37] Academia began to use this keyword to designate loosely systems that integrate data with some structure of graphs, a reincarnation of the Semantic Web and Linked Data.     (1B9B)