Actions

Ontolog Forum

Ontology Summit 2012: Session-05, Thursday 2012-02-09

Summit Theme: OntologySummit2012: "Ontology for Big Systems"

Track 3 Title: Meeting Big Data Challenges through Ontology

Session Topic: I - Big Data domain experts and ontologists; II - Big Data that would benefit from ontological technology

Session Chairs: Mr. ErnieLucier (NCO/NITRD) and Ms. MaryBrady (NIST) - intro-slides

Opening Remarks by - Dr. GeorgeStrawn (Director, NCO_NITRD)

Panelists:

  • Professor BarrySmith (University at Buffalo) - "Big Data that might benefit from ontology technology, but why this usually fails" - slides
  • Mr. ChrisMusialek (GSA) (for Dr. Jeanne Holm, Evangelist, Data.gov) - "Driving Innovation with Open Data" - slides
  • Mr. Bryan Thompson and Mr. MikePersonick (SYSTAP) - "Big Data Challenges: Managing Scale in Ontological Systems" - slides . slides+notes
  • Mr. JamesKirby (Naval Research Laboratory) - "Ontology for Software Production" - slides

Archives

Conference Call Details

  • Date: Thursday, 9-Feb-2012
  • Start Time: 9:30am PST / 12:30pm EST / 6:30pm CET / 17:30 UTC
  • Expected Call Duration: ~2.0 hours
  • Dial-in:
    • Phone (US): +1 (206) 402-0100 ... (long distance cost may apply)
      • ... [ backup nbr: (415) 671-4335 ]
    • when prompted enter PIN: 141184#
    • Skype: joinconference (use the PIN above) ... generally free-of-charge, when connecting from your computer)
      • for skype users who have trouble with finding the Skype Dial pad ... it's under the "Call" dropdown menu as "Show Dial pad"
  • Shared-screen support (VNC session), if applicable, will be started 5 minutes before the call at: http://vnc2.cim3.net:5800/
    • view-only password: "ontolog"
    • if you plan to be logging into this shared-screen option (which the speaker may be navigating), and you are not familiar with the process, please try to call in 5 minutes before the start of the session so that we can work out the connection logistics. Help on this will generally not be available once the presentation starts.
    • people behind corporate firewalls may have difficulty accessing this. If that is the case, please download the slides above (where applicable) and running them locally. The speaker(s) will prompt you to advance the slides during the talk.
  • In-session chat-room url: http://webconf.soaphub.org/conf/room/summit_20120209
    • Instructions: once you got access to the page, click on the "settings" button, and identify yourself (by modifying the Name field from "anonymous" to your real name, like "JaneDoe").
    • You can indicate that you want to ask a question verbally by clicking on the "hand" button, and wait for the moderator to call on you; or, type and send your question into the chat window at the bottom of the screen.
    • thanks to the soaphub.org folks, one can now use a jabber/xmpp client (e.g. gtalk) to join this chatroom. Just add the room as a buddy - (in our case here) summit_20120209@soaphub.org ... Handy for mobile devices!
  • Discussions and Q & A:
    • Nominally, when a presentation is in progress, the moderator will mute everyone, except for the speaker.
    • To un-mute, press "*7" ... To mute, press "*6" (please mute your phone, especially if you are in a noisy surrounding, or if you are introducing noise, echoes, etc. into the conference line.)
    • we will usually save all questions and discussions till after all presentations are through. You are encouraged to jot down questions onto the chat-area in the meantime (that way, they get documented; and you might even get some answers in the interim, through the chat.)
    • During the Q&A / discussion segment (when everyone is muted), If you want to speak or have questions or remarks to make, please raise your hand (virtually) by clicking on the "hand button" (lower right) on the chat session page. You may speak when acknowledged by the session moderator (again, press "*7" on your phone to un-mute). Test your voice and introduce yourself first before proceeding with your remarks, please. (Please remember to click on the "hand button" again (to lower your hand) and press "*6" on your phone to mute yourself after you are done speaking.)
  • RSVP to peter.yim@cim3.com appreciated, ... or simply just by adding yourself to the "Expected Attendee" list below (if you are a member of the team.)
  • Please note that this session may be recorded, and if so, the audio archive is expected to be made available as open content, along with the proceedings of the call to our community membership and the public at-large under our prevailing open IPR policy.

Attendees

ABSTRACT

I - Big Data domain experts and ontologists

II - Big Data that would benefit from ontological technology

This is our 7th Ontology Summit, a joint initiative by NIST, Ontolog, NCOR, NCBO, IAOA & NCO_NITRD with the support of our co-sponsors. The theme adopted for this Ontology Summit is "Ontology for Big Systems." The event today is our 5th virtual session.

The principal goal of the summit is to bring together and foster collaboration among the ontology community, systems community, and stakeholders of some of "big systems." Together, the summit participants will exchange ideas on how ontological analysis and ontology engineering might make a difference, when applied in these "big systems." We will aim towards producing a series of recommendations describing how ontologies can create an impact; as well as providing illustrations where these techniques have been, or could be, applied in domains such as bioinformatics, electronic health records, intelligence, the smart electrical grid, manufacturing and supply chains, earth and environmental, e-science, cyberphysical systems and e-government. As is traditional with the Ontology Summit series, the results will be captured in the form of a communiqué, with expanded supporting material provided on the web.

Meeting Big Data Challenges through Ontology

The mission of this track is to identify appropriate objectives for an "Ontology and Big Data" challenge, prepare problem statements, identify the organizations and people to be advocates, and identify the resources necessary to complete a challenge. The goal will be to select a challenge showing benefits of ontology to big data.

One of the NCO's goals is to enhance collaboration and accelerate agencies' adoption of advanced IT capabilities. NITRD seeks to accelerate deployment of promising research technologies; share protocol information, standards, and best practices; and coordinate and disseminate technology assessment and testbed results. NITRD coordinates federally supported IT research under the leadership of OSTP. Ontologies and the semantic web support Open Government Directive.

The goal of "Meeting Big Data Challenges through Ontology" Track 3 is to identify issues that can be addressed using an ontology challenge. Challenges can take many forms and target many issues.

Potential issues to be addressed by challenges:

  • Enhance collaboration and accelerate agencies' adoption
  • Accelerate the adoption of ontological methods, maximize public awareness, and impact of research.
  • Increase the number of agencies using ontologies, i.e., earlier adoption
  • Where should our focus be to accelerate agencies' adoption of ontology capabilities?
  • How many scientists, physicists, engineers, programmers, big data administrators, etc. have experience with ontologies?
  • Is the growth of ontological implementations and technologies with Big Data constrained by the shortage of qualified personnel?
  • Inform, educate, and include the public in scientific research and discovery. Public involvement could be a critical component of our success
  • A mismatch between those with data and those with the skills to analyze the data
  • Are programmers able to optimize the use of unstructured or semi-structured data sets for scientists and engineers?
  • What are the talent and skill set issues impacting the use of ontologies?
  • The skills important to the growth of ontological technologies with Big Data include a combined understanding of a scientific or engineering discipline and knowledge of ontology-based technologies.
  • Programmers are not able to optimize the use of unstructured data for scientists and engineers
  • Scientists and engineers without ontology training may use brute force programming. This can be inefficient and the scientists and engineers without training may not be aware of options and capabilities using ontology-based technologies
  • Strategic significance to the economy, e.g. enabling competitive products.
  • How long does it take to become productive in the ontology environment?
  • Can universities expand coursework in ontologies and integrate ontological methods into the requirements for science degrees? At the undergraduate level? At the graduate level?
  • Identify individuals who have both domain experience and an understanding of what it means to apply ontology technologies.
  • Increase the number of individuals capable of applying ontology technology
  • Ontology-based technology evolution for big data may be slow or non-existent
  • Advances in the use of ontology technology can be difficult or unattainable without an adequate number of properly trained personnel, including scientists, engineers, programmers, system administrators, technologists, and all others that make up the big data systems.
  • Expanding the markets for ontologies could make the field a more attractive career path. What is the growth rate of the ontology market? Further expansion could spark investment and make ontologies an even more vibrant, attractive market for young people to enter.
  • A Challenge may seed and transform the current status quo
  • Software dilemma analogy with ontology
  • Ontologies and software perceived to be a commodity resulting in little or no investment in research. Projects use ontologies as one of their tasks

Potential challenge directions

1. Increase the awareness of ontology technology among programmers/database managers
2. Accelerate agencies' adoption of ontology capabilities
3. Enable scientists and engineers to make maximum use of big data
4. Enable scientists and engineers to understand the potential of ontology-based systems integration
5. Enable ontologists to understand scientists and engineers needs
6. Ameliorate any mismatch between those with data and those with the skills to analyze it
7. People in the domains of science, engineering, software, computer science, etc. can benefit from a combined knowledge of their domain and application of ontology-based technologies. A combined understanding of these domains and ontology-based technologies may encourage the growth of technology.
8. Improve critical areas of current practice

This first session of Track 3 - http://ontolog.cim3.net/cgi-bin/wiki.pl?ConferenceCall_2012_02_09 - is to understand the relationships between big data challenges and ontologies. The second session, we hope to talk about solutions and benefits including a NASA big data challenge activity. At the face-to-face meeting, we would like to present various approaches to implementing ontologies using challenges and a sample from the NITRD Big Data working group.

Agenda

Ontology Summit 2012 - Panel Session-05

  • Session Format: this is a virtual session conducted over an augmented conference call

Proceedings

Please refer to the above

IM Chat Transcript captured during the session

see raw transcript here.

(for better clarity, the version below is a [ re-organized and lightly edited chat-transcript].)

Participants are welcome to make light edits to their own contributions as they see fit.

-- begin in-session chat-transcript --

-- end of in-session chat-transcript --

  • Further Question & Remarks - please post them to the [ ontology-summit ] listserv
    • all subscribers to the previous summit discussion, and all who responded to today's call will automatically be subscribed to the [ ontology-summit ] listserv
    • if you are already subscribed, post to <ontology-summit [at] ontolog.cim3.net>
    • (if you are not yet subscribed) you may subscribe yourself to the [ ontology-summit ] listserv, by sending a blank email to <ontology-summit-join [at] ontolog.cim3.net> from your subscribing email address, and then follow the instructions you receive back from the mailing list system.
      • please email <peter.yim@cim3.com> if you have any question.

Audio Recording of this Session

  • To download the recording of the session, click here
    • the playback of the audio files require the proper setup, and an MP3 compatible player on your computer.
  • Conference Date and Time: 9-Feb-2012 9:38am~11:12am PST
  • Duration of Recording: 1 Hour 30 Minutes
  • Recording File Size: 10.3 MB (in mp3 format)
  • suggestions:
    • its best that you listen to the session while having the respective presentations opened in front of you. You'll be prompted to advance slides by the speaker.
    • Take a look, also, at the rich body of knowledge that this community has built together, over the years, by going through the archives of noteworthy past Ontolog events. (References on how to subscribe to our podcast can also be found there.)

Additional Resources


For the record ...

How To Join (while the session is in progress)