Heterogeneous Data Case Study

1627 Words4 Pages

One of the most common problems that all the File Information Systems face today is to deal with heterogeneous data. By Heterogeneous data we mean collections of data from diverse sources having varying properties .heterogeneous Data may include text, pictures ,music , emails, XML ,LATEX and MS office documents scattered across a hierarchy of folders. And there is no efficient way to store this large amount of heterogeneous data in a relational database that most of the current search systems are using today.
Thus the challenge is to create means of managing and searching this heterogeneous data in a unified fashion.
The goal of the project is to model this heterogeneous data in a single schema and implement the core features of Search software.
As search software, the system implements the following functionalities:
1. The GUI of the Search Software is user friendly. It can easily be used by the naive users.
2. It searches the files and folder with very less information available to the users.
3. The search is not based only on the parameters like name and type of the file but the searching can also be performed using the size of file and modification date of the file as search parameters.
4. Searches in the File Information System should be fast.
1.1 Motivation
The main motivation of this project is to handle the heterogeneous data in an efficient way and then using this data collection for search systems. The heterogeneous data is stored in a unified fashion and then heterogeneous data is queried in a homogeneous manner from the user. So we take this as an opportunity to solve the problem of every computer user and make computing easier. The scope of the project lies in the improvement of the current search systems making the...

... middle of paper ...

...sis based XML approach for ontology generation so that dynamic authorization will be done[10].
A survey on different ranking approaches to perform the semantic search over the web was proposed by Vikas Jindal. Author defined the study on different matching approaches so that most appropriates link will be retrieved. In this paper, different relevancy vectors are considered by the author and based on this effective methodology was adopted by authors so that web page classification and ranking will be performed in an effective way[12]. Another work on semantic web search based on the natural language was presented by Ivan Habernal in year 2013. Author defined a semantic web search system that includes the preprocessing analysis, semantic analysis and the interpretation. Author presented an evaluation based accommodation approach to provide effective query filtration.

More about Heterogeneous Data Case Study

Open Document