Thesis Presentation: Named-Entity Recognition on Publications and Raw-Text for Meticulous Insight at Visual Trend Analytics

Where: TU Darmstadt / GRIS, Fraunhoferstr. 5 (Darmstadt), Room tba

!!!!! Due to the Corona crisis and the accompanying restrictions at the TU Darmstadt, the exam will be non-public! !!!!!

Who: Ubaid Rana (Author), Prof. Dr. Arjan Kuijper (Supervisor), Dipl.-Inf. Dirk Burkhardt (Advisor/Co-Supervisor)
What: Master Thesis – “Named-Entity Recognition on Publications and Raw-Text for Meticulous Insight at Visual Trend Analytics”


In the modern data-driven era, a massive amount of research documents are available from publicly accessible digital libraries in the form of academic papers, journals and publications. This plethora of data does not lead to new insights or knowledge. Therefore, suitable analysis techniques and graphical tools are needed to derive knowledge in order to get insight of this big data. To address this issue, researchers have developed visual analytical systems along with machine learning methods, e.g text mining with interactive data visualization, which leads to gain new insights of current and upcoming technology trends. These trends are significant for researchers, business analysts, and decision-makers for innovation, technology management and to make strategic decisions.
Nearly every existing search portal uses the traditional meta-information e.g only about the author and title to find the documents that match a search request and overlook the opportunity of extracting content-related information. It limits the possibility of discovering most relevant publications, moreover it lacks the knowledge required for trend analysis. To collect this very concrete information, named entity recognition must be used to be able to better identify the results and trends. The state-of-the-art systems use static approach for named entity recognition which means that upcoming technologies remain undetected. Modern techniques like distant supervision methods leverage big existing community-maintained data sources, such as Wikipedia, to extract entities dynamically. Nonetheless, these methods are still unstable and have never been tried on complex scenarios such as trend analysis before.
The aim of this thesis is to enable entity recognition on both static tables and dynamic community updated data sources like Wikipedia & DBpedia for trend analysis. To accomplish this goal, a model is suggested which enabled entity extraction on DBpedia and translated the extracted entities into interactive visualizations. The analysts can use these visualizations to gain trend insights, evaluate research trends or to analyze prevailing market moods and industry trends.

The post Thesis Presentation: Named-Entity Recognition on Publications and Raw-Text for Meticulous Insight at Visual Trend Analytics appeared first on Human-Computer Interaction & Visual Analyitics Reasearch Group (vis) at Darmstadt University of Applied Sciences (h_da).

Thesis Presentation: User-Centered Scientific Publication Research and Exploration in Digital Libraries

When: 10/12/2018 15:30
Where: Frauhofer IGD, Fraunhoferstr. 5, Room 220
Who: Namitha Chandrashekara (Author), Dipl.-Inf. Dirk Burkhardt (Advisor/Co-Supervisor), Prof. Dr. Arjan Kuijper (Supervisor)

What: Master Thesis – “User-Centered Scientific Publication Research and Exploration in Digital Libraries”


Scientific research is the basis for innovations. Surveying the research papers is an essential step in the process of research. It is vital to elaborate the intended writing of state of the art. Due to the rapid growth in scientific and technical discoveries, there is an increasing availability of publications. The traditional method of publishing the research papers includes physical libraries and books. These become hard to document with the rise in the number of publications produced. Due to the above mentioned problem, online archives for scientific publications have become more prominent in the scientific community. The availability of the search engines and digital libraries help the researchers in identifying the scientific publications. However, they provide limited search capabilities and visual interface. Most of the search engines have a single field to search and provides basic filtering of the data. Therefore, even with popular search engines, it is hard for the user to survey the research papers as it limits the user to search based on simple keywords. The relationships across multiple fields of the publications are also not considered such as to find the related papers and papers based on the citations or references.
The main aim of the thesis is to develop a visual access to the digital libraries based on the scientific research and exploration. It helps the user in writing scientific papers. A scientific research and exploration model is developed based on the previous information visualization model for visual trend analysis with digital libraries, and with consideration of the research process. The principles from Visual Seeking Mantra are incorporated to have an interactive user interface that enhances the user experience.
In the scope of this work, a research on Human Computer Interaction, particularly considering the aspects of user interface design are done. An overview of the scientific research, its types and various aspects of data analysis are researched. Different research models, existing approaches and tools that help the researchers in literature survey are also researched. The architecture and the implementation details of scientific research and exploration that provides visual access to digital libraries are presented.