Unit 9 – 1
One way to research the security issues associated with big data is to look into every stage of the life cycle of big data. The entire data life cycle consists of the following 8 stages (Khan et al., 2014):
· Stage 1: Raw data
· Stage 2: Collection
· Stage 3: Filtering and classification
· Stage 4: Data analysis
· Stage 5: Storing
· Stage 6: Sharing and publishing
· Stage 7: Security
· Stage 8: Retrieval, reuse, and discover
There are two items to point out, as follows:
· Stage 7 is an abstract stage.
· Only three stages (5, 6, and 8) are involved with security.
In Stage 5, the security issues associated with this stage are mainly caused by two aspects—the size of data and the place to store the data. Because the size of the data is too big, many companies have to store their data in the cloud. However, because the data are so big, it is really hard to verify if cloud vendors indeed stored all the data. Because the cloud runs under black box mode, the customers really have no way to know where the data are stored, how they are stored, and whether the integrity of the data is preserved. Because of the cost of local storage and network bandwidth, customers cannot even afford to use any simple approach, such as downloading the entire data set, to verify if the data have been stored properly in the cloud.
Complete the reading assignment, and search the Library and Internet to find and study more references that discuss the security issues associated with big data and how to solve them. Based on the results of your research, discuss the following tasks:
· Identify 2 security issues associated with big data.
· What are the root causes of these 2 security issues?
· How can each of these 2 security issues be solved?