Cataloguing to Facilitate Big Data Analytics

dc.contributor.authorSingh, Manish Kumar
dc.contributor.authorSingh, D K
dc.date.accessioned2015-03-12T10:16:20Z
dc.date.available2015-03-12T10:16:20Z
dc.date.issued2015-03-12
dc.description.abstract“Big Data” is the popular term used to denote the collection of large data sets possessed by multiple systems. The inherent characteristics of this Big Data are the difficulty in processing due to sheer scale and accessibility of data and also un manageability through a traditional Database Management System. The size of this data set is ever increasing with increasing pace and addition of multi-exabytes per day. Apart from these, big data normally comprise of heterogeneous dataset, both structured and unstructured and also containing diverse data and file formats. It is very difficult to locate and retrieve the relevant information in real time from the universe of big data. Librarians, coming out of the walled library, can be expected to contribute in this task with their expertise in information organization and management. In this paper various challenges to the big data are identified and to address the challenges mechanisms for creating big data catalogue have been identified. Various mechanisms are discussed and compared and it is proposed to use the technique of library classification and cataloguing to catalogue the datasets in the big data thereby facilitating the information retrieval in the universe of big data.en_US
dc.identifier.isbn978-93-81232-05-7
dc.identifier.urihttp://hdl.handle.net/1944/1887
dc.language.isoen_USen_US
dc.publisherINFLIBNET Centreen_US
dc.subjectBig Dataen_US
dc.subjectHeterogeneous Datasetsen_US
dc.subjectBig Data Catalogueen_US
dc.subjectMetadataen_US
dc.titleCataloguing to Facilitate Big Data Analyticsen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
48.pdf
Size:
135.06 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.67 KB
Format:
Item-specific license agreed upon to submission
Description: