In a cloud storage environment, file distribution and storage is processed by storage devices providers or physical storage devices rented from the third-party companies. Through centralized management and virtualization, files are integrated into available resources for users to access. Common file storage protocols include ISCSI, Fibre Channel, CIFS and so on, which transmit or store files based on blocks or types. Moreover, because of the wide range and extensive domains of the cloud network, it is very possible that files saved by different users on the same storage device are extremely similar. Also, due to the increasing number of files, the manager cannot guarantee the optimal status of each storage node. The great number of files not only leads to the waste of hardware resources, but also worsens the control complexity of data center, which further degrades the performance of the cloud storage system.
For this reason, to decrease the workload caused by duplicated files, this paper proposes a new data management structure: Index Name Server (INS), which integrates data de-duplication with nodes optimization to enhance the performance of the cloud storage system. INS can manage and optimize the nodes according to the client-side transmission conditions. By INS, each node can be controlled to work in the best status and matched to suitable clients as possible. In such a manner, we can efficiently increase the performance of the cloud storage network and distribute the files reasonably to reduce the load of each node.