In the current digital world, big data concept is increasing very rapidly. Data is generated in very large volume and in a variety of forms. This large and complex dataset is used by the business organizations for finding their customer needs or insights. Therefore, the security and privacy over the large datasets become too much necessary for the organizations and users. This paper mainly deals with the issues related to big data while storing and processing it. In the proposed architecture clients data is distributed among different Hadoop machines and computation is done through a single machine using a random method and joint computation is performed here that announces the final result to the clients. Therefore, our architecture provides the anonymity of users to maintain the high level of privacy, that means machine who performs computation only knows the data as a whole of all clients and does not know to whom the data belongs and thus the privacy of different user during data processing remains anonymous.
Date of publication: