Methods and Tools for Temporal Knowledge Harvesting

Wang, Yafang

doi:10.22028/D291-26419

Item

ITEM ACTIONSEXPORT

Add to Basket

Local TagsRelease HistoryDetailsSummary

Released

Thesis

Methods and Tools for Temporal Knowledge Harvesting

MPS-Authors

/persons/resource/persons45700

Wang, Yafang
Databases and Information Systems, MPI for Informatics, Max Planck Society;
International Max Planck Research School, MPI for Informatics, Max Planck Society;

External Resource

http://scidok.sulb.uni-saarland.de/volltexte/2013/5096/
(Any fulltext)

http://scidok.sulb.uni-saarland.de/doku/lic_ohne_pod.php?la=de
(Copyright transfer agreement)

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

Fulltext (public)

There are no public fulltexts stored in PuRe

Supplementary Material (public)

There is no public supplementary material available

Citation

Wang, Y. (2013). Methods and Tools for Temporal Knowledge Harvesting. PhD Thesis, Universität des Saarlandes, Saarbrücken. doi:10.22028/D291-26419.

Cite as: https://hdl.handle.net/11858/00-001M-0000-0015-3892-2

Abstract

\chapterAbstract}

To extend the traditional knowledge base with temporal dimension, this thesis
offers methods and tools for harvesting temporal facts from both
semi-structured and textual sources.
Our contributions are briefly summarized as follows.

\begin{enumerate}
\item{\bf Timely YAGO:} A temporal knowledge base called Timely YAGO
(T-YAGO) which extends YAGO with temporal attributes is built. We define a
simple RDF-style data model to support temporal knowledge.

\item{\bf PRAVDA:} To be able to harvest as many temporal facts from
free-text as possible, we develop a system PRAVDA.
It utilizes a graph-based semi-supervised learning algorithm to extract fact
observations, which are further cleaned up by an Integer Linear Program based
constraint solver.
We also attempt to harvest spatio-temporal facts to track a person's
trajectory.

\item{\bf PRAVDA-live:} A user-centric interactive knowledge harvesting
system, called PRAVDA-live, is developed for extracting facts from natural
language free-text. It is built on the framework of PRAVDA.
It supports fact extraction of user-defined relations from ad-hoc selected text
documents
and ready-to-use RDF exports.

\item{\bf T-URDF:} We present a simple and efficient representation
model for time-dependent uncertainty in combination with first-order
inference rules and recursive queries over RDF-like knowledge bases.
We adopt the common possible-worlds semantics known from probabilistic
databases and extend it towards histogram-like confidence distributions that
capture the validity of facts across time.

\end{enumerate

All of these components are fully implemented systems, which together form an
integrative architecture.
PRAVDA and PRAVDA-live aim at gathering new facts (particularly temporal facts),
and then T-URDF reconciles them.
Finally these facts are stored in a (temporal) knowledge base, called T-YAGO.
A SPARQL-like time-aware querying language, together with a visualization tool,
are designed for T-YAGO.
Temporal knowledge can also be applied for document summarization.