|
Scope:
PopulusLog
is a web data mining research project developed by two members of the
Database Lab
at Case. This project is not officially affiliated with Case Western
Reserve University, and is conducted solely for research purposes.
See the Frequently Asked Questions
Motivation: In
order to organize the publicly available personal information on the web in a
more structured way, and allow for advanced querying of the collected
information, we have been developing a knowledgebase called
PopulusLog. PopulusLog is a set of tools providing the capability for
crawling, information extraction, and advanced querying. More specifically,
PopulusLog (i) allows grouping of, and provides one-point access to the
information about a person, (ii) provides semantic querying schemes like “Find
the colleagues of person A” rather than just simple syntactical keyword search,
(iii) evaluates the collected information thorough social network analysis and
provide new knowledge like personal impact factors, social cliques that would
otherwise stay implicit, (iv) visualizes the query results.
This prototype is a
version of PopulusLog developed for the CASE (a.k.a. CWRU) domain. The
PopulusLog CASE Edition features:
·
User authentication through Case
Single Sign-on service
·
Allowing people to edit/curate
automatically collected data, or totally remove their record
·
Locating personal home pages
(if any)
·
Improved data cleaning techniques
to minimize the false labeling of entities
·
Locating and listing similar people
·
Improved person-to-person
relationship detection
·
Identifying virtual social network
of a person
·
Computation of top-k most
informative web pages about a person
·
A newly-designed, light-weight and
simple visualization interface
PopuslusLog Database Stats
64,253 People
3,250 Locations
11,235 Organizations
166,523 Affiliations
11,242,354 Pairwise People Relations
Research
Team:
Ali
Cakmak
Mustafa Kirac
|