Multiple-perspective consumer segmentation using improved weighted Fuzzy k-prototypes clustering and swarm intelligence algorithm for fresh apricot market

Main Article Content

Yan Shi
Siyuan Zhang
Siwen Wang
Hui Xie
Jianying Feng

Keywords

consumer segmentation, cluster analysis, MDPSO-WFKP algorithm, MDSSA-WFKP algorithm, precision marketing

Abstract

Leveraging clustering technology for consumer segmentation is crucial for discerning the nuanced differences among fresh apricot consumer groups and subsequently executing precise marketing strategies. To achieve a more comprehensive and lucid consumer segmentation and identify typical characteristics of apricot consumers in different clusters, this research constructs a novel multiple-perspective segmentation indicator system for fresh apricot consumers. Given the diverse degrees of importance and types of consumer segmentation variables, and the inherent sensitivity of the original Fuzzy k-prototypes (FKP) algorithm to clustering centers, we proposed the weighted Fuzzy k-prototypes (WFKP) algorithms for mixed data (MD) optimized by the particle swarm optimization (PSO) algorithm (MDPSO-WFKP) and mixed data sparrow search algorithm (SSA) (MDSSA-WFKP), both incorporating information entropy weighting for mixed attributes. We test the proposed algorithms on four University of California Irvine machine learning repository (UCI) datasets and the consumer segmentation dataset, and the performance of all selected evaluation indexes shows significant improvement. These findings unequivocally validate the efficacy of the proposed methodologies. Since the MDSSA-WFKP algorithm has the best comprehensive effect on the evaluation indexes, we use it to conduct in-depth apricot consumer segmentation research and find that the apricot consumers can be subdivided into three groups with differentiation: ‘Buddhist-like youths’, ‘Upscale attribute enthusiasts’, and ‘Quality-oriented consumers’. Finally, this paper gives the corresponding marketing suggestions based on the characteristics of the segmented groups.

Abstract 315 | PDF Downloads 350 HTML Downloads 0 XML Downloads 157

References

Abbasimehr H., and Bahrini A. 2022. An analytical framework based on the recency, frequency, and monetary model and time series clustering techniques for dynamic segmentation. Expert Syst Appl. 192: 116373. 10.1016/j.eswa.2021.116373

Bannor R.K., Abele S., Kuwornu J.K., Oppong-Kyeremeh H., and Yeboah E.D. 2022. Consumer segmentation and preference for indigenous chicken products. J Agribus Dev Emerg Econ. 12(1): 75–93. /10.1108/JADEE-08-2020-0162

Bejaei M., Cliff M.A., and Singh A. 2020. Multiple correspondence and hierarchical cluster analyses for the profiling of fresh apple customers using data from two marketplaces. Foods. 9(7): 873. 10.3390/foods9070873

Bhattacharjee P., and Mitra P. 2020. A survey of density based clustering algorithms. Front Comput Sci. 15(1): 151308. 10.1007/s11704-019-9059-3

Chen N., Chen A., and Zhou L. 2001. Fuzzy k-prototypes algorithm for clustering mixed numeric and categorical valued data. J Softw. 12(8): 1107–1119.

Chen T.-C., Ibrahim Alazzawi F.J., Mavaluru D., Mahmudiono T., Enina Y., Chupradit S., et al. 2022. Application of data mining methods in grouping agricultural product customers. Math Probl Eng. 2022: 3942374. 10.1155/2022/3942374

Chen H., Li S., Wang C., and Xu S. 2021. Influencing factors of consumers’ purchase decision in wine online shopping and customer segmentation based on online review data. Liquor Making Sci Technol. (11): 127–132. 10.13746/j.njkj.2021070 (in Chinese)

Du M., and Wu F. 2022. Grid-based clustering using boundary detection. Entropy. 24(11): 1606. 10.3390/e24111606

Ezugwu A.E., Ikotun A.M., Oyelade O.O., Abualigah L., Agushaka J.O., Eke C.I., and Akinyelu A.A. 2022. A comprehensive survey of clustering algorithms: state-of-the-art machine learning applications, taxonomy, challenges, and future research prospects. Eng. Appl. Artif. Intell. 110: 104743. 10.1016/j.engappai.2022.104743

Ghadiri S.M.E., and Mazlumi K. 2020. Adaptive protection scheme for microgrids based on SOM clustering technique. Appl Soft Comput. 88: 1060. 10.1016/j.asoc.2020.106062

Guo Y., Liu R., and Jiang X. 2019. Fresh corn consumption by Beijing urban residents: market segmentation. Chin Agric Sci Bull. 35(32): 153–157. (in Chinese)

Hasan B.M.S., and Abdulazeez A.M. 2021. A review of principal component analysis algorithm for dimensionality reduction. J Soft Comput Data Mining. 2(1): 20–30. https://publisher.uthm.edu.my/ojs/index.php/jscdm/article/view/8032

Hegazi A., Taha A., and Selim M.M. 2021. An improved copy-move forgery detection based on density-based clustering and guaranteed outlier removal. J King Saud University Comput Inform Sci. 33 (9): 1055-1063. 10.1016/j.jksuci.2019.07.007

Huang Z. 1997. Clustering large data sets with mixed numeric and categorical values. In: Proceedings of the 1st Pacific-Asia conference on knowledge discovery and data mining (PAKDD). CiteSeer, pp. 21–34.

Kazbare L., van Trijp H.C., and Eskildsen J.K. 2010. A priori and post hoc segmentation in the design of healthy eating campaigns. J Market Commun. 16(1–2): 21–45. 10.1080/13527260903342712

Kennedy J., and Eberhart R. 1995. Particle swarm optimization. Proceedings of ICNN’95 International Conference on Neural Networks. IEEE. 4: 1942–1948. 10.1109/ICNN.1995.488968

Kiran A., and Vasumathi D. 2020. Data mining: min–max normalization based data perturbation technique for privacy preservation. In: Raju, K., Govardhan, A., Rani, B., Sridevi, R., Murty, M. (eds.) Proceedings of the third international conference on computational intelligence and informatics. Adv Intell Syst Comput. 1090: 723–734. 10.1007/978-981-15-1480-7_66

Kuesten C., Dang J., Nakagawa M., Bi J., and Meiselman H.L. 2022. Japanese consumer segmentation based on general self-efficacy psychographics data collected in a phytonutrient supplement study: influence on health behaviors, well-being, product involvement and liking. Food Quality Pref. 99: 104545. 10.1016/j.foodqual.2022.104545

Lee Y., Song S., Cho S., and Choi J. 2019. Document representation based on probabilistic word clustering in customer-voice classification. Pattern Anal Appl. 22: 221–232. 10.1007/s10044-018-00772-1

Li Y., Chu X., Tian D., Feng J., and Mu W. 2021. Customer segmentation using K-means clustering and the adaptive particle swarm optimization algorithm. Appl Soft Comput. 113: 107924. 10.1016/j.asoc.2021.107924

Mellal M.A., Tamazirt I., Tiar M., and Williams E.J. 2023. Optimal conventional and nonconventional machining processes via particle swarm optimization and flower pollination algorithm. Soft Comput. 28: 3847–3858 10.1007/s00500-023-09320-4

Mollaei S., Minaker L.M., Robinson D.T., Lynes J.K., and Dias G.M. 2023. Including sustainability factors in the derivation of eater profiles of young adults in Canada. Br Food J. 125(5): 1874–1894. 10.1108/BFJ-06-2022-0476

Ouyang H., Wang Z., Dai X., and Liu Z. 2015. A fuzzy K-prototypes clustering algorithm based on information gain. Comput Eng Sci (CES). 37(5): 1009–1014. (in Chinese)

Park H.-J., Ko J.-M., Lim J., and Hong J.-H. 2020. American consumers’ perception and acceptance of an ethnic food with strong flavor: a case study of Kimchi with varying levels of red pepper and fish sauce. J Sci Food Agric. 100(6): 2348–2357. 10.1002/jsfa.10106

Pradana C., Kusumawardani S., and Permanasari A. 2020. Comparison clustering performance based on moodle log mining. In: 3rd International Conference on Engineering Technology for Sustainable Development (ICET4SD) 23–24 October 2019, Yogyakarta, Indonesia. IOP Conference Series: Materials Science and Engineering, vol. 722. IOP Publishing, Philadelphia, PA; p. 722: 012012. 10.1088/1757-899X/722/1/012012

Prasetyo H. 2021. Pengelompokan wilayah menurut potensi fasilitas kesehatan dan kejadian COVID-19 menggunakan algoritma fuzzy k-prototypes. Technol J Ilmiah. 12(4): 223–227. 10.31602/tji.v12i4.5631

Shannon C.E. 1948. A mathematical theory of communication. Bell Syst Tech J. 27(3): 379–423. 10.1002/j.1538-7305.1948.tb01338.x

Shi Y., and Eberhart R.C. 1999. Empirical study of particle swarm optimization. In: Proceedings of the 1999 Congress on Evolutionary Computation – CEC99 (Cat. No. 99TH8406). IEEE. 3: 1945–1950. 10.1109/CEC.1999.785511

Shi Y., He M., Xie H., Tian D., and Feng J. 2022. Chinese consumers’ behavior and preference to fresh apricot. Chin Fruits. 2022(7): 84–90. 10.16626/j.cnki.issn1000-8047.2022.07.019

Singh D., and Singh B. 2020. Investigating the impact of data normalization on classification performance. Appl Soft Comput. 97: 105524. 10.1016/j.asoc.2019.105524

Sun Z., Zuo T., Liang D., Ming X., Chen Z., and Qiu S. 2021. GPHC: a heuristic clustering method to customer segmentation. Appl Soft Comput. 111: 107677. 10.1016/j.asoc.2021.107677

Taherdoost H., Sahibuddin S., and Jalaliyoon N. 2022. Exploratory factor analysis; concepts and theory. Adv Appl Pure Math. 27: 375–382.

Tohidi A., Mousavi S., Dourandish A., and Alizadeh P. 2023. Organic food market segmentation based on the neobehavioristic theory of consumer behavior. Br Food J. 125(3): 810–831. 10.1108/BFJ-12-2021-1269

Wang O., and Scrimgeour F. 2023. Consumer segmentation and motives for choice of cultured meat in two Chinese cities: Shanghai and Chengdu. Br Food J. 125(2): 396–414. 10.1108/BFJ-09-2021-0987

Wang J., and Zhu Y. 2005. Research on the weighting exponent in fuzzy K-prototypes algorithm. Comput Appl. 25(2): 348–351. (in Chinese)

Weber C.M., Ray D., Valverde A.A., Clark J.A., and Sharma K.S. 2022. Gaussian mixture model clustering algorithms for the analysis of high-precision mass measurements. Nucl Instrum Methods Phys Res Sect A. 1027: 166299. 10.1016/j.nima.2021.166299

Xue J., and Shen B. 2020. A novel swarm intelligence optimization approach: sparrow search algorithm. Syst Sci Control Eng. 8(1): 22–34. 10.1080/21642583.2019.1708830

Ye Q., and Liang G. 2010. Fuzzy K-prototypes clustering based on quantum genetic algorithm. Comput Eng Appl. 46(1): 112–115. 10.3778/j.issn.1002-8331.2010.01.035. (in Chinese)

Zhu Y., Tian D., and Yan F. 2020. Effectiveness of entropy weight method in decision-making. Math Probl Eng. 2020: 3564835. 10.1155/2020/3564835