A apresentação está carregando. Por favor, espere

A apresentação está carregando. Por favor, espere

Franz-Josef Katzdobler Heraldo Pimenta Borges Filho Project ISKM, 2008/2009 1 Knowledge Extraction from the Web (ISEWO) Franz-Josef Katzdobler Heraldo.

Apresentações semelhantes


Apresentação em tema: "Franz-Josef Katzdobler Heraldo Pimenta Borges Filho Project ISKM, 2008/2009 1 Knowledge Extraction from the Web (ISEWO) Franz-Josef Katzdobler Heraldo."— Transcrição da apresentação:

1 Franz-Josef Katzdobler Heraldo Pimenta Borges Filho Project ISKM, 2008/2009 1 Knowledge Extraction from the Web (ISEWO) Franz-Josef Katzdobler Heraldo Pimenta Borges Filho DEI – FCTUC, 2008/2009

2 Franz-Josef Katzdobler Heraldo Pimenta Borges Filho Project ISKM, 2008/2009 2 Content Introduction Objectives of the Project Architecture Demo Conclusion Future Work

3 Franz-Josef Katzdobler Heraldo Pimenta Borges Filho Project ISKM, 2008/2009 3 Introduction Increasing Amount of Knowledge –Extract relevant information from the Web is a complex and important task

4 Franz-Josef Katzdobler Heraldo Pimenta Borges Filho Project ISKM, 2008/2009 4 Objectives

5 Franz-Josef Katzdobler Heraldo Pimenta Borges Filho Project ISKM, 2008/2009 5 Architecture

6 Franz-Josef Katzdobler Heraldo Pimenta Borges Filho Project ISKM, 2008/2009 6 Web-Harvest Open Source Web Data Extraction Tool –Written in Java –Mainly focused on HTML/XML based Websites –XSLT, XQuery, Regular Expressions –Configured with a Config-File –Get XML as output (comfortable for processing in the next stages)

7 Franz-Josef Katzdobler Heraldo Pimenta Borges Filho Project ISKM, 2008/2009 7 XML-Output … Pate Bocadelia c/ Caranguejo La Piara 2.10 false Molho Base p/ Engrossar Molhos Express Maizena 2.49 false Arroz Basmati Continente 2.13 false...

8 Franz-Josef Katzdobler Heraldo Pimenta Borges Filho Project ISKM, 2008/2009 8 Ontology

9 Franz-Josef Katzdobler Heraldo Pimenta Borges Filho Project ISKM, 2008/2009 9 Jena Framework for Building Semantic Web Applications –Possibliity to load ontology from existing file –Persistent Model available The only thing the programmer needs to do is offering a database connection OntModel class can be used to manipulate the data

10 Franz-Josef Katzdobler Heraldo Pimenta Borges Filho Project ISKM, 2008/2009 10 Demo

11 Franz-Josef Katzdobler Heraldo Pimenta Borges Filho Project ISKM, 2008/2009 11 Conclusion Advantages –Get the cheapest product from several different markets –No need to visit all the different webpages Limitations –The markets are predefined –Config Files manually created

12 Franz-Josef Katzdobler Heraldo Pimenta Borges Filho Project ISKM, 2008/2009 12 Possible Future Work More Flexibility –Creation of config-file automatically –Extend the ontology e.g. Categories for the product –Give suggestions to the user (Product X may also be interesting for you...)

13 Franz-Josef Katzdobler Heraldo Pimenta Borges Filho Project ISKM, 2008/2009 13 End of Presentation Questions / Discussion


Carregar ppt "Franz-Josef Katzdobler Heraldo Pimenta Borges Filho Project ISKM, 2008/2009 1 Knowledge Extraction from the Web (ISEWO) Franz-Josef Katzdobler Heraldo."

Apresentações semelhantes


Anúncios Google