Efficient Collection And Retrieval For Large Heterogeneous Dataset