Development of a method for searсhing for similar account In social networks based on cluster analysis
DOI: 10.31673/2412-9070.2022.063842
DOI:
https://doi.org/10.31673/2412-9070.2022.063842Abstract
In recent decades, the number of Internet users has been growing significantly. At the same time, the number of people using social networks to communicate and receive information is growing. The active development of social networks has also had a significant impact on the advertising market, as the information about users collected by these networks allows advertisers to target ads with great precision, which has a significant impact on the growth of sales of goods and services. After all, by identifying similar accounts, you can easily determine that a product that one user likes is likely to attract another. In order to select the most appropriate method of cluster analysis for the chosen topic, a systematic analysis of cluster analysis methods was carried out. It has been analyzed that one of the approaches to finding similar accounts in social networks is to analyze accounts by certain characteristics and is widely used by modern companies that own social networks for targeted advertising. The analysis of clustering methods showed that there is no specific universal method for analyzing data clustering, which is why various clustering methods were analyzed and it was concluded that the improved k-means algorithm — k-means mini batch is best suited for finding similar accounts.
Keywords: cluster analysis method; information technology; cluster; clustering methods; simplified clustering algorithm; statistical analysis method; system analysis; big data.
References
1. Internet users 2017 forecast. [Електронний ресурс]. URL: https://stats.areppim.com/stats/stats_internetxfcstx2017.htm
2. Hadoop: The Definitive Guide. T. White, 2012.
3. Association Rule Mining: Models and Algorithms. Chengqi Zhang, 2002.
4. Big data market forecast worldwide from 2011 to 2026 [Електронний ресурс]. URL: https://www.statista.com/statistics/255970/global-big-data-market-forecast-by-segment/
5. Seetha Hari, Murty Narasimha, B. K. Tripathy. Modern Technologies for Big Data Classification and Clustering. IGI Global, 2015 р.
6. Grid-Based Clustering - STING, WaveCluster & CLIQUE [Електронний ресурс]. URL: https://www.datamining365.com/2020/04/grid-based-clustering.ht