Multiple-perspective consumer segmentation using improved weighted Fuzzy k-prototypes clustering and swarm intelligence algorithm for fresh apricot market
Main Article Content
Keywords
consumer segmentation, cluster analysis, MDPSO-WFKP algorithm, MDSSA-WFKP algorithm, precision marketing
Abstract
Leveraging clustering technology for consumer segmentation is crucial for discerning the nuanced differences among fresh apricot consumer groups and subsequently executing precise marketing strategies. To achieve a more comprehensive and lucid consumer segmentation and identify typical characteristics of apricot consumers in different clusters, this research constructs a novel multiple-perspective segmentation indicator system for fresh apricot consumers. Given the diverse degrees of importance and types of consumer segmentation variables, and the inherent sensitivity of the original Fuzzy k-prototypes (FKP) algorithm to clustering centers, we proposed the weighted Fuzzy k-prototypes (WFKP) algorithms for mixed data (MD) optimized by the particle swarm optimization (PSO) algorithm (MDPSO-WFKP) and mixed data sparrow search algorithm (SSA) (MDSSA-WFKP), both incorporating information entropy weighting for mixed attributes. We test the proposed algorithms on four University of California Irvine machine learning repository (UCI) datasets and the consumer segmentation dataset, and the performance of all selected evaluation indexes shows significant improvement. These findings unequivocally validate the efficacy of the proposed methodologies. Since the MDSSA-WFKP algorithm has the best comprehensive effect on the evaluation indexes, we use it to conduct in-depth apricot consumer segmentation research and find that the apricot consumers can be subdivided into three groups with differentiation: ‘Buddhist-like youths’, ‘Upscale attribute enthusiasts’, and ‘Quality-oriented consumers’. Finally, this paper gives the corresponding marketing suggestions based on the characteristics of the segmented groups.
References
Bannor R.K., Abele S., Kuwornu J.K., Oppong-Kyeremeh H., and Yeboah E.D. 2022. Consumer segmentation and preference for indigenous chicken products. J Agribus Dev Emerg Econ. 12(1): 75–93. /10.1108/JADEE-08-2020-0162
Bejaei M., Cliff M.A., and Singh A. 2020. Multiple correspondence and hierarchical cluster analyses for the profiling of fresh apple customers using data from two marketplaces. Foods. 9(7): 873. 10.3390/foods9070873
Bhattacharjee P., and Mitra P. 2020. A survey of density based clustering algorithms. Front Comput Sci. 15(1): 151308. 10.1007/s11704-019-9059-3
Chen N., Chen A., and Zhou L. 2001. Fuzzy k-prototypes algorithm for clustering mixed numeric and categorical valued data. J Softw. 12(8): 1107–1119.
Chen T.-C., Ibrahim Alazzawi F.J., Mavaluru D., Mahmudiono T., Enina Y., Chupradit S., et al. 2022. Application of data mining methods in grouping agricultural product customers. Math Probl Eng. 2022: 3942374. 10.1155/2022/3942374
Chen H., Li S., Wang C., and Xu S. 2021. Influencing factors of consumers’ purchase decision in wine online shopping and customer segmentation based on online review data. Liquor Making Sci Technol. (11): 127–132. 10.13746/j.njkj.2021070 (in Chinese)
Du M., and Wu F. 2022. Grid-based clustering using boundary detection. Entropy. 24(11): 1606. 10.3390/e24111606
Ezugwu A.E., Ikotun A.M., Oyelade O.O., Abualigah L., Agushaka J.O., Eke C.I., and Akinyelu A.A. 2022. A comprehensive survey of clustering algorithms: state-of-the-art machine learning applications, taxonomy, challenges, and future research prospects. Eng. Appl. Artif. Intell. 110: 104743. 10.1016/j.engappai.2022.104743
Ghadiri S.M.E., and Mazlumi K. 2020. Adaptive protection scheme for microgrids based on SOM clustering technique. Appl Soft Comput. 88: 1060. 10.1016/j.asoc.2020.106062
Guo Y., Liu R., and Jiang X. 2019. Fresh corn consumption by Beijing urban residents: market segmentation. Chin Agric Sci Bull. 35(32): 153–157. (in Chinese)
Hasan B.M.S., and Abdulazeez A.M. 2021. A review of principal component analysis algorithm for dimensionality reduction. J Soft Comput Data Mining. 2(1): 20–30. https://publisher.uthm.edu.my/ojs/index.php/jscdm/article/view/8032
Hegazi A., Taha A., and Selim M.M. 2021. An improved copy-move forgery detection based on density-based clustering and guaranteed outlier removal. J King Saud University Comput Inform Sci. 33 (9): 1055-1063. 10.1016/j.jksuci.2019.07.007
Huang Z. 1997. Clustering large data sets with mixed numeric and categorical values. In: Proceedings of the 1st Pacific-Asia conference on knowledge discovery and data mining (PAKDD). CiteSeer, pp. 21–34.
Kazbare L., van Trijp H.C., and Eskildsen J.K. 2010. A priori and post hoc segmentation in the design of healthy eating campaigns. J Market Commun. 16(1–2): 21–45. 10.1080/13527260903342712
Kennedy J., and Eberhart R. 1995. Particle swarm optimization. Proceedings of ICNN’95 International Conference on Neural Networks. IEEE. 4: 1942–1948. 10.1109/ICNN.1995.488968
Kiran A., and Vasumathi D. 2020. Data mining: min–max normalization based data perturbation technique for privacy preservation. In: Raju, K., Govardhan, A., Rani, B., Sridevi, R., Murty, M. (eds.) Proceedings of the third international conference on computational intelligence and informatics. Adv Intell Syst Comput. 1090: 723–734. 10.1007/978-981-15-1480-7_66
Kuesten C., Dang J., Nakagawa M., Bi J., and Meiselman H.L. 2022. Japanese consumer segmentation based on general self-efficacy psychographics data collected in a phytonutrient supplement study: influence on health behaviors, well-being, product involvement and liking. Food Quality Pref. 99: 104545. 10.1016/j.foodqual.2022.104545
Lee Y., Song S., Cho S., and Choi J. 2019. Document representation based on probabilistic word clustering in customer-voice classification. Pattern Anal Appl. 22: 221–232. 10.1007/s10044-018-00772-1
Li Y., Chu X., Tian D., Feng J., and Mu W. 2021. Customer segmentation using K-means clustering and the adaptive particle swarm optimization algorithm. Appl Soft Comput. 113: 107924. 10.1016/j.asoc.2021.107924
Mellal M.A., Tamazirt I., Tiar M., and Williams E.J. 2023. Optimal conventional and nonconventional machining processes via particle swarm optimization and flower pollination algorithm. Soft Comput. 28: 3847–3858 10.1007/s00500-023-09320-4
Mollaei S., Minaker L.M., Robinson D.T., Lynes J.K., and Dias G.M. 2023. Including sustainability factors in the derivation of eater profiles of young adults in Canada. Br Food J. 125(5): 1874–1894. 10.1108/BFJ-06-2022-0476
Ouyang H., Wang Z., Dai X., and Liu Z. 2015. A fuzzy K-prototypes clustering algorithm based on information gain. Comput Eng Sci (CES). 37(5): 1009–1014. (in Chinese)
Park H.-J., Ko J.-M., Lim J., and Hong J.-H. 2020. American consumers’ perception and acceptance of an ethnic food with strong flavor: a case study of Kimchi with varying levels of red pepper and fish sauce. J Sci Food Agric. 100(6): 2348–2357. 10.1002/jsfa.10106
Pradana C., Kusumawardani S., and Permanasari A. 2020. Comparison clustering performance based on moodle log mining. In: 3rd International Conference on Engineering Technology for Sustainable Development (ICET4SD) 23–24 October 2019, Yogyakarta, Indonesia. IOP Conference Series: Materials Science and Engineering, vol. 722. IOP Publishing, Philadelphia, PA; p. 722: 012012. 10.1088/1757-899X/722/1/012012
Prasetyo H. 2021. Pengelompokan wilayah menurut potensi fasilitas kesehatan dan kejadian COVID-19 menggunakan algoritma fuzzy k-prototypes. Technol J Ilmiah. 12(4): 223–227. 10.31602/tji.v12i4.5631
Shannon C.E. 1948. A mathematical theory of communication. Bell Syst Tech J. 27(3): 379–423. 10.1002/j.1538-7305.1948.tb01338.x
Shi Y., and Eberhart R.C. 1999. Empirical study of particle swarm optimization. In: Proceedings of the 1999 Congress on Evolutionary Computation – CEC99 (Cat. No. 99TH8406). IEEE. 3: 1945–1950. 10.1109/CEC.1999.785511
Shi Y., He M., Xie H., Tian D., and Feng J. 2022. Chinese consumers’ behavior and preference to fresh apricot. Chin Fruits. 2022(7): 84–90. 10.16626/j.cnki.issn1000-8047.2022.07.019
Singh D., and Singh B. 2020. Investigating the impact of data normalization on classification performance. Appl Soft Comput. 97: 105524. 10.1016/j.asoc.2019.105524
Sun Z., Zuo T., Liang D., Ming X., Chen Z., and Qiu S. 2021. GPHC: a heuristic clustering method to customer segmentation. Appl Soft Comput. 111: 107677. 10.1016/j.asoc.2021.107677
Taherdoost H., Sahibuddin S., and Jalaliyoon N. 2022. Exploratory factor analysis; concepts and theory. Adv Appl Pure Math. 27: 375–382.
Tohidi A., Mousavi S., Dourandish A., and Alizadeh P. 2023. Organic food market segmentation based on the neobehavioristic theory of consumer behavior. Br Food J. 125(3): 810–831. 10.1108/BFJ-12-2021-1269
Wang O., and Scrimgeour F. 2023. Consumer segmentation and motives for choice of cultured meat in two Chinese cities: Shanghai and Chengdu. Br Food J. 125(2): 396–414. 10.1108/BFJ-09-2021-0987
Wang J., and Zhu Y. 2005. Research on the weighting exponent in fuzzy K-prototypes algorithm. Comput Appl. 25(2): 348–351. (in Chinese)
Weber C.M., Ray D., Valverde A.A., Clark J.A., and Sharma K.S. 2022. Gaussian mixture model clustering algorithms for the analysis of high-precision mass measurements. Nucl Instrum Methods Phys Res Sect A. 1027: 166299. 10.1016/j.nima.2021.166299
Xue J., and Shen B. 2020. A novel swarm intelligence optimization approach: sparrow search algorithm. Syst Sci Control Eng. 8(1): 22–34. 10.1080/21642583.2019.1708830
Ye Q., and Liang G. 2010. Fuzzy K-prototypes clustering based on quantum genetic algorithm. Comput Eng Appl. 46(1): 112–115. 10.3778/j.issn.1002-8331.2010.01.035. (in Chinese)
Zhu Y., Tian D., and Yan F. 2020. Effectiveness of entropy weight method in decision-making. Math Probl Eng. 2020: 3564835. 10.1155/2020/3564835