Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Next Article in Journal
Improving Elementary Students’ Geometric Understanding Through Augmented Reality and Its Performance Evaluation
Previous Article in Journal
Enhancing Cybersecurity: Hybrid Deep Learning Approaches to Smishing Attack Detection
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Inspiration Recommendation System for Automotive Styling Design Based on User Behavior Data and Group Preferences

1
School of Mechanical Engineering, Guizhou University, Guiyang 550000, China
2
Institute of Twenty-Seventh, China Electronics Technology Group Corporation, Zhengzhou 450000, China
*
Author to whom correspondence should be addressed.
Systems 2024, 12(11), 491; https://doi.org/10.3390/systems12110491
Submission received: 30 August 2024 / Revised: 8 November 2024 / Accepted: 12 November 2024 / Published: 14 November 2024

Abstract

:
Group preferences are crucial for Inspirational Solutions of Automotive Design (ISAD). However, sparse individual purchase behavior hinders the identification of group preferences. Therefore, a novel inspiration recommendation (IR) system based on multi-level mining of user behavior data is proposed. Firstly, the K-means algorithm is employed to cluster users based on a variety of features. The fixed association rule is then applied to filter and identify relevant subsets, forming the foundational basis for constructing a user portrait. The Nonlinear Bayesian Personalized Ranking (NBPR) is constructed to explore common preferences using explicit feedback. Finally, the item preference matrix is enriched with implicit feedback to compile a comprehensive recommendation list that caters to group preferences. Using a multi-user joint evaluation approach, we compare the performance of IR with baseline models across multiple metrics. This comparison demonstrates the robust reliability of the IR system and its ability to prioritize ISAD with preference-aligned groups. Our research overcomes data sparsity in the automotive recommendation system, providing a new method for embedding human elements in decision support systems.

1. Introduction

With the advent of smart manufacturing systems and Industry 4.0 technologies, computer systems are experiencing a profound transformation. This transformation not only brings new opportunities for automotive ergonomic design but also reshapes the understanding of the early stages. The early stages represent the most crucial, complex, and creative phases of development [1]. This process not only determines the level of innovation, cost, and market effectiveness of the final automotive product [2] but also significantly impacts the viability of new automotive products. During this phase, designers tackle open-ended and ill-defined problems [3]. Thus, designers must understand emerging design trends early to develop market-aligned solutions. Capturing effective design inspiration during this phase is critical, as the relevance of the problem solution to market needs is highly sensitive to design inspiration. However, this inspiration often arises from the designer’s long-term memory and past perceptions, supplemented by conscious references to existing designs and other information sources [4]. Therefore, designers face the challenge of locating suitable inspiration amidst vast and cluttered information sources [5]. To enhance user satisfaction [6], the automotive industry increasingly focuses on individual user needs and market changes while transitioning from production-based to service-based models [7]. This phenomenon highlights a new approach to the challenges faced by designers: constraining product design evolution to align with customer needs [8]. Consequently, high-fit design inspiration should be extracted from behavioral data reflecting user preferences. This process is crucial for ensuring that solutions are comprehensive and align closely with user preferences.
Simultaneously, designers now seek intelligent design inspiration through the widespread application of modern information technology. This demand has led to a significant research area: creative support systems. For instance, Yan yielded a systematic human–computer interactively creative design system, integrating knowledge base, and computer-aided thinking to enhance the creation of designs [9]. Recently year, Xinggang also proposed a knowledge base construction method to support design. This can be used to describe design inspiration derived from traditional motifs [10]. The goal of this system is to enhance the creativity of designers and intelligent agents by utilizing information technology tools. Efficiently satisfying consumers’ personalized demands has become a design consensus with the development of new economic growth points [11]. Consequently, personalized recommendation systems have emerged as a primary research direction within creative support systems. Currently, personalized recommendation systems generate valuable recommendations for customers based on implicit and explicit feedback data reflecting potential interests [12]. This research developed an effective method for obtaining design references and inspiration [13,14]. For instance, an efficient personalized recommendation system has been proposed based on user feedback data [15,16]. In the aforementioned literature, user preference features are captured and thus applied in the design of electromechanical parts and computers. Based on user reviews, functional suggestions of practical significance for smartphone application design were generated using the recommendation system [17]. Additionally, ridesharing company such as Uber enhances the user experience by heavily focusing on user comments within the recommender system [18]. Thus, recommendation systems can help designers not only save development time but also detect user preference features using computational techniques. Notably, individual customers are the target of personalized recommendation systems, which embed discrete user attribute labels to generate personalized recommendation lists. However, traditional systems underperform when adopted for durable products with obvious sparsity such as automobiles [19]. This is due to the unique requirements of automotive design recommendation systems, which differ from those of social and e-commerce services. The automotive sector must streamline its supply chain, establish production lines, and manufacture vehicles in bulk before market entry. Significant upfront costs necessitate a keen understanding of group preferences to ensure profitability and business sustainability. Furthermore, the diverse personalized consumer demands and the scarcity of purchasing behavior data—given that most consumers purchase only 1–2 vehicles—lead to persistent data sparsity. Designers’ decision-making based on specific or limited user data can reduce the potential customer base for certain car designs [20]. Simultaneously, this structural issue significantly affects the quality and advancement of Inspirational Solution of Automobile Design (ISAD) recommendation systems. ISAD refers to reference materials that stimulate creativity in automotive design, including images, physical objects, videos, and other multimedia sources. Consequently, the durable products sector, particularly the automotive industry, emphasizes the importance of understanding group preferences. However, current recommendation systems face challenges in overcoming data sparsity and identifying common interests among user groups. This renders research into car design inspiration recommendations a challenging and pressing area of human element study.
To address this, we introduce an innovative concept design inspiration recommendation (IR) system for automobiles. IR is a hybrid recommendation system oriented toward user group preferences for automobile modeling solutions, designed to inspire and assist designers. The purpose is to leverage user behavior data to analyze the relationships between user feature information and both implicit and explicit feedback in depth. Initially, the system employs similarity-based clustering based on the feature information of car users. It then identifies the specific association rule between user features and high-scoring styling assessment data to create user portraits. Subsequently, the system integrates a support vector machine (SVM) into the Bayesian recommendation framework, resulting in the Nonlinear Bayesian Personalized Ranking (NBPR). By utilizing NBPR, the system performs multi-dimensional explicit feedback within user profiles to identify preference-aligned groups. Finally, it constructs an item preference matrix using a scoring system based on implicit feedback, enabling the prediction of ISAD items belonging to preference-aligned groups.
The IR system is a novel hybrid recommendation system designed for scenarios with significantly sparse item preference data. Its strength lies in improving the precision of existing recommendation systems in capturing car user preferences, providing designers with a new approach to discovering ISADs. This research aligns with the transition from a production-only focus to human–computer interaction philosophies in car design, addressing the persistent challenge of data sparsity in durable consumer product recommendations.
The contributions of this research are outlined below:
  • The development of an IR system that leverages a three-dimensional integration of car user features, explicit feedback, and implicit feedback. The purpose is to inspire design concepts that resonate with the collective preferences of user groups, creating ISADs with a high match to the common preferences of aligned user groups.
  • Establishing a method for identifying groups with aligned preferences through user portraits. The fixed association rule is applied to filter and identify relevant subsets, forming the foundational basis for constructing a user portrait. Then, groups are shaped with high styling scores, setting the stage for ISAD recommendations based on shared preferences.
  • We introduce a method to address data sparsity in recommending design inspiration for durable products with low repurchase rates. Before NBPR’s biased recommendation, purchasing behavior guides preference identification, and expert scoring assesses similarity for car design elements of interest. The total score determines if the recommended parts align with user preferences. Additionally, a scoring system for preferences expands the range of user-preferred designs, mitigating the sparsity of the item preference matrix due to limited purchases, thereby enhancing the quality of ISAD recommendations.
  • We propose an evaluation approach for group preference recommendations, assessing recommendation performance of individual users within aligned user groups using metrics from CF, HS-CF, and BPR models. This method, coupled with preprocessing through similarity clustering and common demand mining, indirectly evaluates the ability of IR to recommend preferences for the user group. It offers an objective standard for evaluating group preference recommendations, effectively tackling the challenge of diverse user preferences that render existing metrics inapplicable.
The general structure of this article is as follows:
Section 2 presents the status of related work. Section 3 constructs the methodology of IR: framework, key phrase representation and derivation, and the approach to evaluating the performance. Section 4 presents a case study of the IR system, demonstrating the specific process of using the IR system to recommend ISADs. Section 5 evaluates the performance. Section 6 concludes this study by summarizing the key findings and outlining potential directions for future research.

2. Related Work

Section 2.1 provides an overview of applications of mining user behavior data in different areas. Next, Section 2.2 describes the prior studies on recommendation systems. Finally, Section 2.3 summarizes research related to acquiring inspiration for automotive design.

2.1. Applications of Mining User Behavior Data

User behavior data include information generated through interactions between users and the subjects they engage with [21], encompassing feature information, explicit feedback, and implicit feedback related to evaluation attributes. Feature information aids in identifying user characteristics, such as age, gender, and place of residence. Explicit feedback constitutes direct and clear evaluations from users, typically resulting from deliberation and expressed through language, scores, choices, or comments. In contrast, implicit feedback reflects preferences and needs indirectly through behaviors such as likes, comments, retweets, and purchases.
A vast amount of user behavior data from e-commerce and social media [22] has emerged with advancements in Internet technology, presenting significant potential for data mining. Effective mining of these data can inform business strategies and improve the quality of services offered to users [23]. Log analysis, an early data-mining technique, captures various interactive behaviors for subsequent analysis, a widely established practice [24,25]. However, many of these data are implicit—such as browsing and clicking activities—which can contain significant noise and may only indicate potential interests [26]. To fully utilize user attributes, researchers have started incorporating explicit feedback into the analysis of user behavior. The combination of implicit and explicit feedback in analyzing user behavior data has become mainstream. For example, comparing explicit likes and play counts of music tracks has shown that a combined approach can unlock greater potential in online music recommendations [27]. Furthermore, personalized filtering systems that utilize both explicit evaluations and implicit feedback from web pages can effectively prioritize information in the online environment, presenting only users’ interests [28].
Jointly mining implicit and explicit feedback fully leverages the potential of user behavior data. This method capitalizes on the accuracy of explicit feedback in reflecting user interests [29], while also utilizing abundant implicit feedback to enhance the precision of mining outcomes [30]. This approach is widely applied in internet projects involving user behavior data for tasks such as searching, filtering, or recommending.
Notably, user behavior data increasingly serve multiple supportive functions in the product design domain. Innovative designs, such as vacuum cleaners and office chairs, based on explicit feedback, improve the accuracy of gauging consumer purchase intentions [31]. In automotive design, a dynamic demand mining method for explicit feedback data has been proposed to effectively analyze time-related product preferences, guiding the strategic improvement of future product generations [32]. By identifying the link between user attribute information and visual aesthetic experiences, a strategy has been proposed to assist decision-makers in segmenting the laptop market and shaping brand image [33]. In summary, explicit feedback is frequently mined in product design. Valuable product design insights can be gained through the collaborative mining of multiple datasets. User behavior data, as a carrier of feedback that reflects users’ inner needs and experiences, are closely interconnected rather than existing in isolation. Therefore, research on joint mining and establishing connections between data deserves more attention and offers great value for design guidance.
With the advent of intelligent manufacturing [34] and the Internet of Everything [35], the richness of user behavior data in the product design process has greatly exceeded that of the past. Designers can now gather explicit and implicit user feedback through various channels, considering the support and confidence of different preference expressions. This development improves the alignment between feedback data and user feature information, laying the foundation for new user behavior data mining methods. The market value of mining methods tailored to new business needs is immense. This is because related research will leverage the benefits provided by the abundance of user behavior data in modern society, offering reliable design support for the dynamic needs of users.

2.2. Recommendation System

With the growth of the Internet and smart devices, the efficiency of information exchange has significantly improved. However, this has led to an overwhelming amount of information, significantly increasing the burden of choice. To address this challenge, recommendation systems have become essential [36]. These systems analyze both implicit and explicit user data, along with data from similar user groups, to generate a list of items that align with user preferences and characteristics. The goal is to prioritize content of interest and reorganize the information display [37]. Therefore, recommendation systems alleviate the strain of information overload and enhance user retention and the overall Internet experience [38].
The first recommendation systems to emerge were content-based filtering (CB) systems [39]. CB systems analyze the content features of items that users have favored in the past to recommend similar items. As early Internet systems increased network interactions, preference information became more readily available, facilitating the development of CB systems [40]. The widespread use of CB systems can be observed in e-commerce and social networks [41,42]. However, CB systems have a significant drawback: their reliance on user preference similarity makes it difficult to predict new items. This reliance means that all recommendations are similar to previously interacted items, limiting their potential to discover new preferences [41]. Based on this understanding, collaborative filtering (CF) systems [43] have become the foundation for subsequent recommendation system research. Unlike CB systems, CF systems do not predict based on item–user relevance; instead, they utilize a preference database built from user evaluations. This approach effectively addresses the limitations of CB systems by allowing evaluation data to flow between diverse items. Researchers have since expanded CF systems into various applications. User-based collaborative filtering (User-CF) [44] recommends items based on ratings from similar user groups. Its primary application is in social networks with strong time sensitivity. Item-based collaborative filtering (Item-CF) [45] predicts recommendations based on item similarities with a rating matrix of users and items. It is mainly applied in e-commerce, where the user base exceeds the number of items. Bayesian personalized ranking (BPR) [46] is a CF variant that uses implicit feedback data such as clicks and purchases. The idea is to optimize item preference matrices for biased recommendations, making it suitable for scenarios rich in implicit feedback. Although CF systems are more widely applied than CB systems, their reliance on item rating data limits recommendation accuracy. To overcome this limitation, researchers developed hybrid recommendation (HS) systems [47]. HS systems combine CB and CF systems to mitigate individual limitations and enhance performance [48], a widely accepted approach in recommendation systems.
Leveraging HS systems, researchers have developed a variety of filtering techniques applied across numerous service sectors [49,50,51,52]. Particularly in social networking and e-commerce, the HS system has shown a strong ability to capture user preferences, attracting the interest of designers. For designers, the ongoing challenge of aligning user preferences with product design can be addressed by establishing a feedback mechanism. This will enhance user satisfaction starting from the conceptual design phase. As a result, integrating recommendation systems into product design services has produced positive results. SHARMA [53] developed an interactive fashion design recommendation system, which combines customer sentiments, design knowledge bases, and virtual garment visualization. CHEN [54] introduced a computer-aided product pattern design recommendation system that incorporates user-perceived imagery to precisely target user needs. These advancements have not only equipped designers with effective tools for understanding user preferences but also validated the applicability of recommendation systems in the design process.
Although recommendation systems connect user preferences with product design, they face significant challenges in the context of innovative design, primarily due to data sparsity. Undoubtedly, insufficient data results in a significant decline in system performance [55]. In social networks, the challenge of data sparsity has attracted significant attention, leading to the development of novel recommendation models designed to mitigate the issue between sessions [56]. Current research in design has focused on products with high user interaction frequencies, creating a gap in developing recommendation systems for durable consumer products. These products are characterized by long lifecycles and substantial initial investments, with infrequent user interaction and sparse data.
In this context, automobiles are representative durable consumer products. With the rise of Industry 4.0, the automotive industry has adopted advanced technologies such as artificial intelligence (AI), the Internet of Things (IoT), and big data analytics, revolutionizing traditional processes across design, production, and service [57]. Recommender systems, key components of this industrial shift, are increasingly being used to assist designers and engineers in analyzing large datasets to identify patterns and generate innovative ideas. In particular, recommender systems in automotive design act as intermediaries between user feedback, historical data, and new market needs, delivering recommendations that enhance both aesthetic and functional aspects of vehicles [58]. Systems utilizing AI and machine learning offer insights derived from connected vehicles and user interaction data, significantly supporting early-stage design decisions critical for innovation [59]. Despite their potential, these systems have traditionally been more focused on later phases such as customer customization and manufacturing efficiency, leaving a gap in their application to the conceptual phase of automotive design [60].
As the manufacturing industry increasingly transitions from a production-centric model to a service-oriented one, durable goods companies, including those in the automotive sector, are recognizing the immense value of design insights gathered from user feedback. In automotive design, user preferences and real-world usage data captured through connected car technologies offer designers crucial insights to shape new inspirations and innovations [61]. However, the development of recommender systems tailored specifically for the early stages of automotive design remains a key challenge. Addressing issues like data sparsity in early-stage design processes can broaden the scope of these systems, turning them into indispensable tools that enrich the ideation phase and foster the creation of innovative, user-centered designs [62]. By seamlessly integrating recommender systems into the design process, the automotive industry can not only enhance creativity but also push the boundaries of what’s possible in vehicle development, leading to groundbreaking and customer-oriented innovations [63].

2.3. Acquiring Inspiration for Automotive Design

The automotive design process involves progressively expanding on an initial concept, refining details, and ultimately bringing the product to life [64]. Computational technology plays a crucial role in this process. It provides efficient collaborative design platforms [65]. Designers ensure that the transition of design inspiration from two-dimensional sketches to three-dimensional models and prototypes remains accurate. However, the application of computational technologies has predominantly focused on the conceptualization and realization phases, with less emphasis on the early stages of inspiration sourcing [66]. Inspiration sourcing refers to the intentional reference to past designs or other objects [67] to produce new creative sparks. It was not until the late 1980s that Cooper [68] first established the critical link between the early stages of new product development and product success. Then, Reinertsen [69] introduced the term “fuzzy front end (FFE)” to highlight the significance of early inspiration research, thus encouraging further studies in this area.
In contrast to stability and rigorous collaboration during the inspiration realization phase, the selection of inspiration sources in the early stages is characterized by variability and independence. On one hand, when designers focus on originality to satisfy users’ desire for novelty, they often seek inspiration from concepts with a high degree of freedom [70]. This is because greater categorical differences lead to stronger innovation. Mercedes-Benz adopted biomimicry, incorporating the shape of the boxfish into car design, providing excellent aerodynamics and capturing user attention [71]. On the other hand, iterative design emphasizes understanding and sustaining user preferences to maintain family-style designs that enhance brand recognition [72]. The primary reason is that brand recognition plays a critical role in purchase decisions [64]. Therefore, drawing design inspiration from users’ feedback is an effective method to ensure design success.
Today, the product design field, supported by computational technology, has access to vast image and text databases. However, extracting design inspiration that aligns with user needs and expectations remains a significant challenge. Content-based image retrieval has been developed for this purpose, but it struggles with the mismatch between its limited feature description capabilities and the rich semantics of user feedback [73]. With advancements in AI and knowledge base systems, semantic-based image retrieval methods have emerged, such as Kansei Engineering [74]. Kansei Engineering captures emotional responses in user semantics to stimulate inspiration and has been widely applied [75,76]. A semantic-based inspiration source retrieval model [77] was subsequently developed to bridge the “semantic gap” between design inspiration images and Kansei words. However, there is often a gap between user feedback and actual purchase behavior, resulting in a misalignment between design inspirations and true user preferences. Additionally, the limited amount of effective feedback data is insufficient to support the automotive industry with high confidence. Therefore, it is necessary to develop design inspiration source screening technology based on user purchase behavior. For instance, Xinggang proposed an optimization method for automobile modeling design based on key feature data distribution, addressing the issue of subjective noise commonly found in traditional design processes [78]. This method enhances the accuracy and objectivity of design by focusing on the precise distribution of critical features, reducing reliance on subjective judgment. The exploration should complement existing retrieval methods to create a comprehensive automotive design inspiration retrieval matrix. A dynamic channel established for inspiration exchange between designers and users will help iterative products sustain their market share.

3. Materials and Methods

3.1. Framework

Currently, many automotive firms adopt an iterative strategy, making selective modifications and refinements to established systems to maintain brand recognition. The focus is primarily on design elements that align with user demands and preferences, guiding the design direction. User preferences fundamentally influence the iterative vehicle design process. Therefore, this manuscript introduces an IR system built on the analysis of user behavioral data. Specifically, the system is designed for groups with similar preferences. The framework is depicted in Figure 1.
The IR system comprises two sequential data processing components: the automotive user portrait module and the preference recommendation module. The approach to the automotive user portrait module involves creating an information perception space and discovering the special association rule. Firstly, as many user behavior data of the target product are collected as possible, including users’ personal information, comments on products, preferred product types, browsing times, and other traces of Internet multimodal interaction, to establish the original dataset. Then, we preprocess this dataset to remove noise data and invalid samples. In particular, the dataset is structured into a high-dimensional vector matrix that can be recognized by a computer, where each vector contains all the behavior data of a user. User behavior data are then categorized into feature information, explicit feedback, and implicit feedback, each labeled with various tags. After categorizing the user behavior data in the perception space, the feature information is clustered using K-means. This approach clusters users with similar characteristic attributes to form initial user groups. However, the car styling preferences within these groups remain unclear, as they are not factored into the clustering process. The distance between users is significantly correlated with the looseness of the dataset, making it insufficient to establish groups with shared preferences at this stage. Further data mining is necessary to achieve the goal of group preference recommendations for the IR system. Hence, the fixed association rule was adopted to select a suitable cluster, through the filter of the highest support and the minimum confidence. The frequent item set is then used to further eliminate noise data within the clusters, aiming to identify user groups with consistent feature information and automobile modeling evaluations. From these sets, a specific association rule linking user features with scoring assessments of explicit feedback is extracted to form user portraits.
In the preference recommendation module, the NBPR recommendation processes the user portrait to output the collective preferences’ ISAD. The NBPR integrates a nonlinear classifier (SVM) with a Bayesian personalized ranking (BPR) to address the challenges of preference recommendation for user groups with diverse needs. The process begins by segmenting the nonlinear explicit feedback to further identify preference-aligned groups. A high-density scoring matrix is then constructed to enrich the preference set of the groups. Following this, a Bayesian personalized ranking is applied to the item preference matrix. Finally, the recommendations from all users are aggregated to compile a group preference recommendation list. In the case of the automotive home website, we achieved the recommendation of group-preferred models based on the user behavior data processed with the IR system.

3.2. Key Phrase Representation and Derivation

3.2.1. Clustering Feature Information and Determining the Special Association Rule

We start by clustering user behavior data. The complex social division of labor and expectation constraints lead to distinct car preferences among user groups. Meanwhile, within the “information cocoon” effect, user groups with similar feature information tend to have homogeneous aesthetics and design demands. K-means is a widely used unsupervised learning algorithm for clustering. It efficiently clusters high-dimensional vector data, making it a popular choice in various applications [79]. Its main strengths are simplicity, rapid convergence, and objectivity. It can iteratively divide data into K clusters, grouping similar samples. When the user behavior dataset is D, then,
D = { d 1 , d 2 , d n }
where d i ( 1 i n ) is a multi-dimensional vector encapsulating feature information sub-sectors d i f (f means feature information) and evaluation attribute sub-vectors d i e (e means evaluation attribute). Due to the workflow of the IR system progressively cleaning user behavior data to identify user groups with shared preferences, we first cluster users based on their characteristic attributes, forming initial groups with similar traits. The users’ feedback information is then processed in the subsequent stages of the algorithm. The purpose lies in the refinement of these clusters, moving from general similarities to uncovering more precise group preferences. Hence, d i f meets the conditions for applying K-means. We focus on the user behavior data subset D f for clustering, D f = { d 1 f , d 2 f , d n f } .
After segmenting and normalizing D f , the number of clusters K can be determined with reference to elbow silhouette score methods, which are widely adopted [80,81]. Then, we randomly select K users as initial cluster centers μ f c ( 1 f c k ), and we introduce a loss function S S E based on the sum of squared errors,
S S E = f c = 1 k d i f C f c d i f μ f c 2
where C f c is the cluster of d i f .
The process involves calculating the Euclidean distance between each user and the nearest cluster center to assign the user to the corresponding cluster. Then, the cluster centers are recalculated based on the average of user dimensions within each cluster. This iterative process repeats until the loss function reaches its minimum value, indicating that the cluster centers and the clustering have converged. The dataset D can be then divided into K-cluster D f c .
As an unsupervised learning algorithm, K-means does not consider attribute labels on data points, clustering solely based on the distance between them. As a result, the target group feature information that designers focus on is not identified in this process, necessitating further clustering with this feature as a constraint. Given that the goal of the IR system is to recommend shapes and inspire design, the stylings preferred by users in the target group cannot be ignored. To ensure the recommendation list inspires meaningful design insights, target group feature information and preferred stylings will be used as labels for supervised learning in subsequent data processing, allowing for further clustering among users within each cluster.
Hence, the fixed association rule between the target users and their scoring styling assessment data should be considered. The concept of association rules is often introduced within the context of the Apriori algorithm. Introduced by data science expert Agrawal in 1993, this algorithm was designed to uncover association rules among goods in transaction databases [82]. It operates by iteratively identifying frequent item sets and strong association rules that exceed the minimum support and confidence thresholds. This has led to its widespread use in business, mobile communications, and healthcare sectors. For instance, it helps college freshmen select suitable course combinations based on their interests [83]. It also supports convenience stores in devising sales strategies and product endorsements.
However, unlike typical association rule mining, which involves discovering unknown rules, the association rule used in the IR system is fixed and unique. It is referred to as a fixed association rule, because it is tailored to the automotive styling recommendation objective of the IR system, necessitating adjustments to the Apriori algorithm’s application.
Specifically, this recommendation goal prompts us to focus on specific user behavior feedback related to automobile styling. Thereby, the consequent y in the association rules can be stabilized and refined, focusing on explicit styling feedback data with high evaluation scores. This fixed criterion ensures that data processing aligns with the IR system’s objectives. For instance, if a user values the emotional impact of a car’s appearance and body lines, they will prioritize styling as a key purchase factor and rate relevant models highly. This preferred car design, which satisfies user preferences, should be identified and referenced for the recommendation list. By consistently concentrating on such high-scoring interactions, our recommendations will align more closely with users’ styling preferences in the corresponding group.
Different user segments often display unique preferences yet may have similar explicit feedback scores. For example, users who primarily browse luxury car models versus those interested in compact, economy vehicles exhibit distinct engagement profiles, but both groups tend to assign high evaluation scores to their selected stylings. Thus, simply considering a user’s preference score for design is insufficient; we must also identify and filter characteristic attribute information for each user. Combining these segments would likely reduce the recommendation accuracy of the IR system. This highlights the need to fix the antecedent x—representing the target group’s feature information—in our association rules, ensuring that subsequent data mining remains focused on the target user group.
Therefore, the target group’s feature information is fixed as antecedent x, and the high styling assessment score is fixed as consequent y. The fixed association rule ensures that automotive styling recommendations are tailored to meet the explicit needs of a clearly defined user segment.
Notably, the fixed association rule does not employ the Apriori algorithm, as there is no operation involving the mining of frequent item sets. Instead, the introduction of fixed association rules is primarily motivated by the need to utilize confidence and support—key metrics derived from the Apriori algorithm and association rules. These metrics serve as filters, effectively screening the clustering subsets generated by the K-means algorithm.
In this case, Support is considered as the ratio of users containing the fixed association rule to the total user population. This can be expressed with the following formula:
S u p p o r t ( x = > y ) = P ( x y ) = n u m ( x y ) n u m ( a l l )
where x = > y represents the association rule expression, n u m ( x y ) is the number of users conforming to the fixed association rule, and n u m ( a l l ) is the total number of users.
Confidence represents the frequency at which users containing both the antecedent x and the consequent y of the fixed association rule appear, given that the antecedent x is present. The relevant formula is as follows:
C o n f i d e n c e ( x = > y ) = P ( y | x ) = n u m ( x y ) n u m ( x )
where the n u m ( x ) is the number of users conforming to the antecedent x.
Subsequently, the confidence of the fixed association rule within each D f c can be calculated, and a minimum confidence threshold is established for filtering.
For the association rule x = > y , we select the cluster D f c with the highest support that also meets the minimum confidence level. This constraint enables rapid identification of the structural characteristics of the fixed association rule in clustering. Within the framework of the fixed association rule, we establish an acceptable minimum confidence level to evaluate the proportion of users with preferred car stylings in the target group across various clusters. An appropriate minimum confidence level can be set for selecting the representative subset after clustering. Meanwhile, a high support level indicates that the fixed association rule occurs more frequently within the clusters. Consequently, support and confidence are compared across multiple clusters to select the most suitable one for constructing the user portrait. In contrast, this approach does not aim to mine strong association rules within a cluster, as the fixed association rule has already been established.
In this context, the results of K-means clustering primarily serve two functions. One is optimizing the data scale and computational efficiency. Adequate user behavior data are essential to encompass enough target groups and diverse preferences, which is necessary for accurate recommendations from the IR system. This data demand creates significant computational pressure; however, K-means clustering can partition the dataset into multiple subsets. Subsequently, the subsets containing the fixed association rule can be selected, reducing resource consumption and enhancing the IR system’s operational efficiency.
The second function is filtering system noise in the fixed association rule. To recommend for a wider range of users, designers need to use small-dimensional feature information as antecedent x to include more user-preferred models. However, this tolerance threshold can introduce system noise, where behavior data from a unique group lower recommendation accuracy. Support and confidence indicators from K-means can act as filters, helping to assess whether the rule holds greater significance within a specific cluster. This approach ensures that the fixed association rule is more representative. The cluster represents broader preferences as the source dataset for the fixed association rule effectively removes the system noise, enhancing recommendation accuracy.
Next, we apply this filter to organize the frequent item sets, which is the collection of users that meet the fixed association rule, and the user portrait D u p ( D u p D f c ) is thus formed. The user portrait is represented as a multi-dimensional vector matrix, where only user behavior data that conform to the fixed association rule are considered. These can be described as feature information (with association antecedent x), explicit feedback data (with association consequent y), and preferred models.
The user portrait D u p is the initial output of the IR system, emphasizing the characteristics and behavioral patterns of the target user group. Since some dimensions of these characteristics, such as age, change over time and the recommendation model is future-oriented, we begin by clustering users based on their similar attributes. This approach filters out noisy data, ensuring that the consumer traits and intrinsic consumption tendencies of the identified user groups align closely with the design goals of the IR system’s group recommendations. The process also involves deriving the fixed association rule from highly rated explicit feedback data. As a result, automotive styling becomes the focal point of the IR system, with all subsequent analyses centered around the user portrait D u p developed from the fixed association rule.
In addition, the user portrait is constructed using a fixed association rule, while excluding explicit feedback data that may highlight significant differences and conflicting preferences within user groups. Although the variation in user portrait data is often dismissed as noise, it holds valuable insights for automotive products characterized by high sparsity. Economic constraints typically result in individual users owning only one or two cars over extended periods, despite their interest in a broader range of models. This discrepancy poses challenges in capturing user preferences through behavioral data, leading to persistent data sparsity in recommendations for consumer durables like cars. However, placing a single user within a group of users with similar consumption characteristics can reveal convergent consumption tendencies, thereby illuminating the user’s preferred vehicles from multiple perspectives. Recognizing this feature effectively mitigates data sparsity and enhances the potential for accurate recommendations. Consequently, we construct a user portrait with sufficiently diverse car shapes. To further ensure consistency in preferences and minimize noise, we process the user portraits using a nonlinear classifier during the subsequent NBPR construction (Section 3.2.2).

3.2.2. Development and Recommendation of NBPR

The pool of purchased models reflects the diverse directions of user preferences, but the user portrait fails to capture the collective preferences of the target user group. To address this issue, common preferences derived from multi-dimensional explicit feedback are incorporated as prerequisites in the final Top-N recommendation process. NBPR is employed to meet this requirement, and the recommendation process is illustrated in Figure 2.
The D u p is accepted as input of NBPR. Consequently, we refine D u p through nonlinear classification based on explicit feedback data. The objective is to constitute the preference-aligned groups, thereby preventing performance issues arising from diverse user preferences. A dense user–item preference matrix is constructed, allocating 80% of the users’ preferred models to the training set and 20% to the test set. In the training set, the user matrices W and H are determined using maximum a posteriori estimation. The test set is used to improve recommendation accuracy by comparing predictions with the ground truth, ensuring that the Top-N recommendations align more closely with user expectations. After the preference scores for all models are generated, they are aggregated to produce a Top-N recommendation list.
The specific steps are as follows. For linearly separable data M,
M = ( y 1 , z 1 ) , , ( y n , z n )
where y 1 is the characteristic vector, z i is category identification of user evaluation, and M is an ideal multi-dimensional linear separable dataset. z i = + 1 , y i ω 1 , z i = 1 , y i ω 2 .
Given the inseparable nature of multi-dimensional explicit feedback data, a non-negative ξ i is introduced, with the following constraints,
z i ω T y i + ω 0 1 ξ i
where ξ i 0 , i = 1 , , n . A constant C ( C > 0 ) is introduced, and the original optimization of the nonlinear sample set is as follows,
m i n ω , ω 0 Γ S V M ω , ω 0 = 1 2 ω 2 + C i = 1 n ξ i
We utilize the Lagrange function to optimize ω , ω 0 , ξ and α , β with the minimum and maximum, respectively.
ω = i = 1 n α i z i y i
i = 1 n α i z i y i ,       C α i β i = 0
Leading back to the Lagrange function,
L = 1 2 ω 2 + C i = 1 n ξ i i = 1 n α i z i ω T y i + ω 0 1 + ξ i i = 1 n β i ξ i
m a x α L α = i = 1 n α i 1 2 i = 1 n j = 1 n α i α j z i z j y i T y j
where β i 0 , α i C , C α i 0 , i = 1 , , n , i = 1 n α i z i = 0 .
With kernel function κ ( y i , z i ) , the nonlinear classification of D u p can be realized. Then, the data above the hyperplane are regarded as preference-aligned groups.
Each user in the training set has purchased only one or two models. These purchased models are established as the benchmark for reference, with styling design used as the basis for similarity screening. The screening focuses on car sets within the preference-aligned groups, with similarity assessments conducted through expert evaluations. The design elements of interest to the designers are quantified into multiple features, with responses reflecting different levels of similarity. Once the questionnaires are collected, the total similarity scores are used as a measure of user preferences. Filtering criteria are then applied to select models that align with user preferences, effectively addressing the issue of sparsity in preferred products. Subsequently, a dense training set T s is constructed,
T s = { u , i , j | i , j I : i j }
where u , i , j represents that compared with model j, and user u prefers model i, noted i > u j .
According to Bayes theorem,
P ( θ | > u ) P ( > u | θ ) P θ
where P ( W , H | > u ) is estimated based on maximum a posteriori.
For 3–13, the following formula can be further obtained,
P > u | θ = u , i , j T s P ( i > u j | θ )
where P i > u j θ = σ x ¯ u i j θ   σ x is the sigmoid function, x ¯ u i j θ is the arbitrary function of the parameter vector θ , and
x ¯ u i j θ = x ¯ u i θ x ¯ u j θ
The loss function is
O θ = u i j T s ln σ x ¯ u i x ¯ u j + λ θ 2
where λ is the penalty parameter, and θ 1 , 2 . Furthermore, W and H will be obtained. Each user’s score of preference for any model can be predicted once W and H have iterated to convergence.
Finally, the trained NBPR is used to recommend models preferred by the group. For each user, NBPR generates a TOP-N list of recommendations. Depending on the length n of the list, this output can include all models from the item set, excluding those the user has already purchased. The recommended models are arranged sequentially based on predicted preferences. These TOP-N lists, derived from the same item set, may appear across multiple user lists but in varying positions, depending on individual preferences.
To identify common preferences among users with similar tastes, we integrate these TOP-N lists. The rationale is that these users can be considered to have similar preferences, as determined through comprehensive similarity mining and noise reduction. Therefore, the influence of uncertainty and outliers can be reduced by integrating the recommendation lists based on these preferences, making the prediction results more robust. The specific process involves recording the preference score of each model from all users’ TOP-N lists, then averaging these scores and re-ranking the models to create a new TOP-N list. This list best reflects common preferences and serves as a recommendation list that represents the group’s preferences, forming the final output of the IR system.

3.3. IR System Evaluation

Since IR focuses on group preferences, no clear model exists as a reference for recommendations and evaluations. This discrepancy causes an inconsistency in the vector dimensions between the research subjects in the IR system and conventional baseline models. To address this, users with high-scoring styling assessments are selected as the target users for both the IR and baseline models. Once the recommendation lists are obtained, the models purchased by these users serve as the benchmark for evaluating preferred models. Additionally, this study employed a multi-dimensional set of evaluation metrics (Section 3.3.2) to assess model performance.

3.3.1. Baseline Models

While traditional recommendation systems focus on specific segments of user behavior data, the IR system takes into account all aspects of user behavior. Thus, a selection of classic and representative recommendation systems forms the baseline model group. The aim is to analyze the IR recommendation effects from various perspectives. The baseline model group includes the User-CF recommendation system, which emphasizes feature information; the HS-CF hybrid recommendation system, which considers both feature information and explicit feedback; and the BPR recommendation system, which focuses on implicit feedback. Together, these models account for all user behavior data, meeting the criteria for forming the baseline model group.
The User-CF model identifies users with similar feature information to the target user and recommends models they have purchased. The similarity between users, based on the multi-dimensionality of attribute information, is measured using cosine similarity.
cos i n e s i m i l a r i t y ( d 1 , d 2 ) = d 1 d 2 d 1 d 2
The HS-CF hybrid model combines User-CF and Item-CF collaborative models, representing a type of HS recommendation system. The HS-CF model integrates the recommendation results from both collaborative filters. The synergy between these two approaches enhances the accuracy and personalization of the recommendations.
In contrast to collaborative filtering that requires explicit feedback, BPR concentrates on implicit user feedback, specifically purchase behavior in this study. It generates a predictive sorting matrix for TOP-N recommendations by maximum a posteriori with a simplified Bayes formula.

3.3.2. Evaluation Metrics

Automobiles are classified as high-value consumer products with substantial prices. Consequently, the number of models purchased by each user is typically low, mostly ranging from one to two. In scenarios with a limited number of preferred items, using evaluation metrics that focus on unordered preferences may not be suitable. Therefore, this research utilizes ranking-based metrics such as F1-SCORE, HR, MRR, and MAP. These metrics are appropriate for evaluating the quality of recommendation lists with a limited number of preferred products.
The F1-SCORE [84] is a composite metric that considers both precision and recall, providing an overall measure of performance. Precision refers to the proportion of recommended items that the user has purchased from the list. The formula for calculating is,
p r e c i s o n @ n = 1 n i = 1 n r e l i
where r e l i represents the marking score of the i-th recommended item, and n represents the length of the recommendation list.
Recall refers to the proportion of all purchased models in the recommendation list. Through calculating the average of the precision and recall, the result of the F1-SCORE will be generated.
MRR [85] (Mean Reciprocal Rank) emphasizes the position of purchased models in the recommendation list. The formula is,
M R R = 1 Q i = 1 Q 1 r a n k i
where the Q is the number of purchased models, and r a n k i represents the ranking of i-th purchased models. MRR reflects the average ranking of preferred products in the recommendation list. A higher MRR indicates better system performance.
HR [86] (hit ratio) highlights the accuracy. It focuses on whether the recommendation list includes the user’s preferred products. The formula is,
H R = 1 A i = 1 A h i t s ( i )
where the A is the number of target users. HR involves the average hit rate of purchased models, with a higher value indicating better system performance.
MAP [87] (Mean Average Precision) evaluates whether the recommended items can accurately meet user preferences given a certain list length. For each user, the average precision within the list length is calculated, and the formula for this is,
A v e r a g e p r e c i s i o n = 1 n i = 1 n p r e c i s i o n @ i
After obtaining the average precision, the mean of all average precisions is taken to determine MAP. A higher MAP value indicates that the system is better at recommending products that meet the user’s preferences.

4. Case Study

In this section, we adopted a dataset of styling to illustrate the application of the IR system and assess the performance of the recommendation list.

4.1. Experimental Data Acquisition and Preprocessing

The experiments were conducted on a Windows 10.0 operating system, utilizing an Intel(R) Xeon(R) Silver 4210R CPU @2.40 GHz processor. Python 3.8 was employed for the construction of the IR system.
The first step was to gather sufficient and diverse user behavior data. Compared to offline questionnaire surveys, collecting online user behavior data is more convenient, representative, and unrestricted by geographic or demographic limitations. As a result, it has become widely adopted in research on recommendation systems [88,89]. Thus, we extracted public datasets from the Auto Home website (www.autohome.com) for the period from January to December 2023 using web crawlers. This active car forum attracts a substantial number of visitors, providing a wealth of diverse user behavior data as many users seek information about automobiles and share their experiences. In recent years, it has served as a valuable resource for numerous research groups [90,91]. In this case, we focused on the word-of-mouth channel within Auto Home to develop a user-centered IR model. The data collection process involved systematically expanding different models and crawling user information. We extracted characteristic attributes from the “Personal Data” and “Friends” sections, along with explicit and implicit feedback from the “His Word of Mouth” column, ensuring that at least one purchased model was included as valid data. Ultimately, we collected a total of 10,378 user behavior records.
Since providing characteristic attribute information is not mandatory, it was essential to preprocess the collected data to eliminate unusable noise for this recommendation system. We employed Pandas (Python) to remove missing values and duplicates, standardizing the data to ensure compatibility with the IR model. Additionally, we excluded subjective evaluation texts—such as “most satisfied” and “most dissatisfied” comments—from our analysis to enhance the density of the effective information.
After cleaning the data to remove redundancies and standardize formats, we obtained 8057 valid user data entries and 10,186 valid records of purchased models. As illustrated in Figure 3, the data encompass a range of user feature information such as user ID, gender, age, location, fan count, purpose of car purchase, purchase amount, and mileage. They also include explicit feedback like space, power, handling, energy consumption, interior, styling, cost effect, and comfort. Meanwhile, purchasing behavior is treated as implicit feedback.

4.2. IR System Process

By segmenting and encoding elements, the seven attribute labels from the features are converted into categorical data, as shown in Table 1. The attributes are coded from A to G sequentially. Each attribute’s feature elements are numerically encoded according to the sequence shown in Figure 3. This method aims to represent user groups through various element combinations, facilitating the identification of similar groups.
The K-means is employed for clustering the encoded data. A key function of K-means is to group similar data points into clusters, increasing internal consistency within each group after clustering. The significance of clustering lies in the dynamic nature of certain attributes over time, such as age or follower count, which tend to increase. After clustering, the internal distance within subgroups becomes smaller [92], making it more likely for members to exhibit a consistent evolution in preferences. Since recommendation systems are forward-looking [88], clustering helps align the recommendations with the preferences of the group. Moreover, the IR system leverages the fixed association rule to build user portraits, and K-means clustering reduces data complexity. By improving the similarity of data features within each cluster and eliminating irrelevant or noisy data that could hinder rule extraction, the weight of the target association rule is enhanced—for example, the support tends to increase significantly after clustering.
The first step is ascertaining the optimal K value. Rather than assigning K based on industry experience, we employ the silhouette coefficient method to determine a K value that best matches the inherent clustering pattern. Figure 4a displays the curve of the elbow method. It indicates that when K reaches 4, SSE drops significantly as K increases. Hence, the optimal number of clusters has not yet been reached. Beyond K = 4, SSE decline slows, indicating that further increasing K provides diminishing returns and risks over-clustering. Therefore, the data divided into four clusters are considered appropriate. Figure 4b shows that the silhouette coefficient peaks at K = 4 with a value of 0.58, indicating the best ratio of intra-cluster to inter-cluster distance and the optimal clustering effect. Considering both SSE and the silhouette coefficient, the data are divided into four clusters.
By calculating the Euclidean distance of each user to the cluster centroid and performing clustering, the process converges when the loss function SSE reaches its minimum. The demographic attributes of each cluster are shown in Table 2. The number of users decreases from clusters 1 to 4, with 3623, 2410, 1465, and 559 users, respectively. Each cluster exhibits distinct feature information. For instance, female users are primarily found in clusters 1 and 2. Hence, further analysis of female preferences can be conducted by mining data from these clusters. It is worth noting that the four clusters in Table 2 are organized according to the attribute dimensions of user feature information, allowing for discrete elements within the same dimension. The feature information of individual users in the cluster is mainly composed of these elements selectively, and only the multi-dimensional distance between users needs to be minimized to optimize the loss function.
The clustering analysis reveals significant differences in the feature information across the various clusters, highlighting the clear segmentation of user groups. In terms of gender, Clusters 1 and 2 include both male and female users, indicating a balanced distribution of gender across user groups. Regarding age, users in Cluster 1 and Cluster 2 are primarily younger, between 18 and 38 years old, whereas Cluster 3 and Cluster 4 consist of older individuals, mainly between 29 and 58 years old, with Cluster 4 concentrated in the 39–48 age group. This suggests that users of automotive distribution shift with age. The distribution of location shows that users in Cluster 1 and Cluster 2 predominantly reside in first- and second-tier cities, while those in Cluster 3 and Cluster 4 are mostly from third- and fourth-tier cities, reflecting the influence of geographical differences. In terms of followers, Cluster 1 and Cluster 4 tend to have a moderate following (501–10,000), whereas users in Cluster 2 and Cluster 3 exhibit polarization, having either fewer (<500) or significantly more (>10,000) followers. Concerning purpose, Cluster 1 and Cluster 2 users primarily focus on commuting and appointments, while Cluster 3 users lean towards child transportation, and Cluster 4 emphasizes business and transportation purposes. In price preferences, Cluster 1 favors lower-priced vehicles (below 100,000 RMB), whereas Cluster 2 and Cluster 4 prefer higher-priced options (200,000–300,000 RMB or above), and Cluster 3 opts for mid-range prices. In terms of mileage, Cluster 1 and Cluster 4 hold lower-mileage driving experience (<5000 km), while Cluster 3 has been driving at mid to high mileage (5000–15,000 km).
In conclusion, the analysis demonstrates that users in different clusters exhibit distinct characteristics in terms of gender, age, location, fans, purpose, price, and mileage, indicating clear segmentation into different user groups. These feature disparities are representative and reflect the diverse demands and market stratification of automobile users.
Clustering not only reveals the composition of the selected dataset but also applies clustering rules to evaluation attributes. Our team randomly selected two users from the dataset who gave a styling rating of 5, to generate user portraits. Each user’s gender, age, residence, and car purchase purpose were considered as the antecedents of the association rule x, with the 5-point styling score as the consequent y representing explicit feedback. In this context, x represents the antecedent of the association rule, indicating the characteristic attributes of target users, while y represents the consequence, reflecting the user’s satisfaction modeling evaluation. This rule is employed to identify user groups that simultaneously satisfy both the premise and the outcome of the association rule, facilitating the creation of user portraits. Since the fixed association rule is applied simultaneously across three clusters, we set a filtering condition, “The Highest Support with the Minimum Confidence”, to maximize the number of users represented in the user portraits. In the IR system, we designed a low confidence threshold of 0.3. Unlike its typical use in other studies, this minimum confidence threshold primarily serves to filter suitable clusters. Since the association rule is fixed, it is unnecessary to identify association rules with higher confidence within the same cluster. Instead, we compare support among clusters meeting the confidence threshold, selecting the cluster with the highest support as the source for the user portrait. The specific results are detailed in Table 3. With a confidence threshold of 0.3, user portrait 1 is derived from a frequent item set of cluster 1, with 387 users, a support of 0.11, and a confidence of 0.42. User portrait 2 is derived from cluster 2, with 94 users, a support of 0.04, and a confidence of 0.31. The low support corresponds to the characteristics of multi-dimensional user groups: longer antecedents result in more extensive data filtering, leading to a rapid decline in support. However, confidence is significantly higher than support. This is because styling creates an immediate first impression, allowing preferences to be determined without long-term driving experience, which leads many users to quickly choose and highly rate their preferred styling.
In the second phase, the IR system concentrates on the dataset of purchased models. NBPR uses explicit feedback from target users to identify sets of purchased models that match their scoring patterns. Explicit feedback is encoded as H-G, as shown in Table 4. Target User 1, derived from user portrait 1, has a scoring vector of [3, 5, 4, 3, 5, 4, 4]. Target User 2, from user portrait 2, has a vector of [4, 4, 5, 4, 3, 5, 3]. Both users share the same requirement: their scores must be greater than or equal to the existing scores across all evaluation attribute labels. Using a support vector machine, we perform binary classification on the multi-dimensional data, selecting only the positive examples above the hyperplane. After mining, the purchased models from user portrait 1 account for 22% of the total, amounting to 85, while those from user portrait 2 account for 30%, or 28 models. After NBPR processing, the market evaluations of the purchased model sets are no longer diverse but exhibit consistency. These preference-aligned groups provide a foundation for generating a high-quality recommendation list.
The issue of data sparsity, caused by the limited number of purchased models per user, is not addressed when forming item preference triplets u , i , j . Hence, we construct a set of preferred models to enhance the density of meaningful data. Figure 5 illustrates the process of constructing the item preference matrix. In Figure 5a, for a specific user u, only the data in columns i2 (blue) and j2 (green) of the matrix can be used for biased recommendations. The large yellow area indicates missing values, highlighting data sparsity.
To address the missing values, as shown in Figure 5b, the purchased models are broken down into four components: main color, body line, head, and tail. Expert scoring is employed to assess the similarity of these components between non-purchased and purchased models, using a scoring system where 1 indicates “Similar”, 0.5 signifies “Partially Similar”, and 0 represents “Dissimilar”. Specifically, for each user, images of the purchased models are presented at the top of the questionnaire as evaluation criteria for four components. The leftmost column of the table is filled with images of all unsold models in the collection, while the right columns are titled “Main Color”, “Body Line”, “Head”, and “Tail”. A total of 10 designers are invited to score the similarity of each part based on the purchased models. We then calculate the total score for each model and derive the average score from the 10 designers as the similarity score for that model. All models are sorted based on these scores to form a set of preferred models. For the user, the top of the sequence represents the models most similar to their purchased ones, with similarity decreasing as the sequence progresses, reflecting a gradient in user preference.
Figure 5c shows the distribution of the matrix after constructing the preferred model set. A significant reduction in the yellow area indicates that most models are now likely to be included in pairs u , i , j . To mitigate potential overfitting in the recommendation results, models with a similarity score below 3 are considered unpurchased, while 50% of vehicles scoring 3 or above are randomly selected as preferred, and the remaining 50% are classified as unpurchased. This adjustment aims to reasonably increase the number of positive samples in the item preference matrix. The purpose is to diversify the positive samples to enhance the ability to generalize to new data while ensuring sufficient negative samples. Ultimately, the preferred model set is constructed to effectively alleviate the data sparsity, thereby improving the quality of the recommendation sequence.

5. Results and Discussion

The performance of recommendation systems is typically assessed using evaluation metrics that assume users have identifiable preferred products. The quality of a recommendation list is determined by how these preferred products are presented. Although this assumption may not be explicitly mentioned, it is implied in the formulas of these metrics. Unlike this approach, the IR is compiled from multiple lists, indicating that no single preference is shared by the collective group, making traditional evaluation metrics unsuitable for this system. However, the IR system utilizes advanced user portrait and data-processing techniques that incorporate common demand constraints. As a result, a high degree of similarity in features and needs is exhibited by users within this dataset, due to sharing a special association rule and explicit feedback filters. A significantly higher likelihood of similar preferences among these users provides a feasible approach for assessing IR capabilities.
This section presents the results of a multi-user joint evaluation method used to assess the quality of group preference recommendations. During the NBPR recommendation phase, users from preference-aligned groups are selected, and the metrics from Section 3.3.2 are used to assess the TOP-N lists of IR and baseline models. The final group preference recommendations of IR are visually represented through styling images. Given the high degree of similarity within the group and the intuitive recommendation lists, we hypothesized that consistent and robust outcomes would confirm the effectiveness of IR for group preferences.
In the case study, during the stage when the NBPR generates the recommendation list for target user 1, a group of five users was randomly selected to serve as the targets for the baseline models. The evaluation of metrics F1-SCORE and MRR is illustrated in Figure 6. As the recommendation list length increases, the F1-SCORE for both the IR system and the baseline models tends to decline. However, the IR system peaks before dipping, reaching its highest score of 0.49 at a list length of six, surpassing the baseline models, as shown in Figure 6a. Additionally, the F1-SCORE indicates a balance between accuracy and recall. At a list length of three, both User-CF and BPR models score zero, indicating that no purchased models are present. Within the top six recommendations, the IR consistently includes the purchased models and outperforms the baseline models in evaluation scores. This result aligns with the design’s goal of quickly identifying user preferences within a brief list, minimizing the intrusion of non-preferred products. For the MRR metric, the baseline models show similar average rankings for purchased models in their recommendation lists: User-CF averages 0.10, BPR 0.14, and HS-CF 0.12. The IR system scores higher than the baseline for every user, with an average of 0.39, placing the purchased models significantly higher in rank, as shown in Figure 6b.
Due to the inherent scarcity of purchased car models, both precision and recall are likely to decrease as the list length grows, resulting in a gradual decline in MRR for all models. For example, the baseline models fail to include any purchased car models in their TOP-3 recommendations. This indicates a subpar performance when user purchase behavior dictates the preference. In contrast, the IR system mitigates data sparsity caused by limited purchased models by using a preference scoring approach, which enhances the positioning of preferred models and improves F1-SCORE and MRR evaluation outcomes. Additionally, the MRR for multiple users is markedly higher than the baseline, demonstrating the robustness of the IR. As a result, the IR system prioritizes preferred models, creating a high-quality ISAD recommendation list. Preferred models represent user interests, and a higher recommendation priority signifies the ability to accurately capture user preferences and suggest similar items. These strengths are crucial for identifying ISAD that aligns with user group preferences.
The results of the HR and MAP evaluation metrics are illustrated in Figure 7. At a recommendation list length of three, both the User-CF and BPR models scored zero, while the HS-CF model achieved a score of 0.2. At a list length of fifteen, the User-CF model scored 0.8, with the BPR and HS-CF models reaching a perfect score of 1. This indicates that as the list length increases, the baseline models begin to include the purchased models in their recommendations. In contrast, the IR system maintained a consistent score of 1 from a list length of three to fifteen, indicating that it successfully identifies the purchased models even at shorter list lengths, as depicted in Figure 7a–d. Regarding the distribution of average precision across multiple recommendation lists, the IR system shows a decreasing trend, in contrast to the increasing trend of the baseline model. Moreover, MAP scores of the IR consistently outperform those of the baseline models. Thus, the IR system is more efficient at identifying purchased models in shorter recommendation lists, as shown in Figure 7e–h.
The results from Figure 7 facilitate the discussion on how the positioning of purchased models within recommendation lists affects ISAD quality. The baseline models rely on user feature information (CF), implicit feedback (BPR), and a combination of feature and explicit feedback (HS-CF) for generating recommendations. Each model only considers a segment of user behavior data, limiting both the depth of preference mining and the likelihood of purchased car models appearing higher in the list. In contrast, the IR system is constructed on the principle of synergistic mining of user features, explicit feedback, and implicit feedback, using a tiered approach to recommend preferred car models. This method maximizes the utilization of user behavior data and provides a deep understanding of user preferences, leading to a higher placement of purchased models in recommendations. The MAP, which is sensitive to the positioning of purchased models, clearly demonstrates this effect. Due to the scarcity of purchased models, their ranking significantly affects the peak of the MAP curve. Hence, an inverse trend also evidences the performance of IR with high-quality ISAD positioned more prominently.
The previous study focused on comparing the IR system with the baseline model, highlighting performance differences between group and individual recommendations. Next, we turn our attention to analyzing how the characteristics of user behavioral data influence the IR system itself. Figure 8 illustrates the effect of recommendation performance based on different preference-aligned group sizes. User portraits 1 and 2, derived from the case study, form preference-aligned groups of 85 and 28, respectively, after applying common demand constraints, as detailed in Table 4. Combined with the previously studied sample users, we extracted five sample users from user portrait 2 and evaluated the recommendation performance for a total of 10 users.
As depicted in Figure 8a, the F1-SCORE for both user portraits 1 and 2 decreases as the length of the recommendation list increases. However, user portrait 1 consistently exhibits significantly higher scores than user portrait 2 up to TOP-6. This indicates that larger user groups deliver better precision and recall within the first six recommendations, whereas smaller user groups substantially diminish recommendation performance, resulting in a zero score for user portrait 2 at TOP-3. Interestingly, by TOP-15, the F1-SCORE performances of both portraits become comparable, with the smaller group outperforming the larger one at recommendation list lengths of 9 and 15. This phenomenon can be attributed to the sparsity of individual user purchase data. Figure 8b shows that the case user from user profile 2 has a total of nine purchased models, exceeding the number for user profile 1. As the recommendation list expands to include all purchased models, the proportion of purchased models for user profile 2 changes more gradually, leading to a slower decline in this metric and ultimately resulting in comparable metrics. Therefore, the F1-SCORE alone does not adequately capture the influence of group size, highlighting the need for additional metrics to achieve a more comprehensive analysis.
Figure 8b illustrates the MRR values for different sample users, with the numbers indicating the total purchased car models owned by each user. User portrait 1 shows MRR values of 0.426, 0.500, 0.333, 0.500, and 0.261, yielding a mean of 0.404. In contrast, user portrait 2 presents values of 0.196, 0.200, 0.125, 0.183, and 0.121, resulting in a mean of 0.165. The significantly higher MRR values for user portrait 1 indicate that the data for this portrait are better distributed within higher positions of the recommendation list.
Figure 8c examines the hit rates of both user portraits. As the length of the recommendation list increases, the hit rate for the smaller preference-aligned group gradually approaches 1, while the hit rate for the larger group remains consistently at 1. Figure 8d presents MAP metrics for both groups, revealing opposite trends that converge as the recommendation lists become sufficiently long. Overall, user portrait 1 consistently outperforms user portrait 2 in terms of MAP.
Figure 8c,d demonstrate that significant differences in recommendation performance metrics arise with shorter recommendation lists, converging as the list length increases. The results across the four metrics indicate that larger preference-aligned groups exhibit superior recommendation capabilities. This improvement is attributed to the increased learning capacity of the IR system, which enhances the weights of common preference features and mitigates the negative effects of data sparsity [93], thereby enabling more accurate identification of user preferences and generating higher-value recommendations.
Moreover, the volume of user behavioral data similarly influences performance. The IR system effectively filters out noisy data and refines user preferences, creating a strong positive correlation between the number of preference-aligned groups and the volume of user behavioral data. A sufficiently large dataset facilitates the formation of numerous preference-aligned groups, which enhances the accuracy of recommendations. To prevent cold-start issues in the target user group and ensure accurate recommendation results, the IR system must be applied during the data collection phase to gather adequate effective data.
Ultimately, the IR system synthesizes all user recommendations, generating a TOP-N list catered to group preferences. This list comprises models with a hierarchy of priority, representing the ISAD. To demonstrate how recommendation lists inspire designers, the group preference recommendations are rendered into a TOP-5 list. The left column displays the purchased models of two target users in the case study, while the right column presents the IR recommendations, as detailed in Table 5. It is worth noting that this list presents a complete car shape rather than specific design elements. This aligns with the design purpose of the IR system, which is to provide a reliable and effective source of inspiration for designers, ensuring that new designs reflect the common preferences of the target group.
The TOP-5 list includes highly popular models from the user matrix, reflecting the consensus on car styling among users with similar preferences. If a designer wishes to discover the preferred models of a particular user group, they can input the relevant feature information into the trained IR system as the antecedent X of the association rule, generating a TOP-N recommendation list. The models in this list are those that the group has actually purchased and positively evaluated for their design, having gained widespread recognition. This method streamlines the otherwise tedious task of designers searching the web for inspiration, offering a reliable and efficient source.
Moreover, as the list evolves, the diversity of recommended car designs increases, helping designers explore potential design evolution paths aligned with user preferences. It also enables them to assess how well new designs match these preferences and even predict potential market reactions to new models. The length of the suggestion list can be customized based on the scenario, ensuring that the list remains neither too long nor overly unilateral. Overall, the IR system allows designers to quickly grasp the preferences of user groups and the specific design features they favor, inspiring valuable insights and innovative solutions for automobile design through the recommended visual references.
Using a multi-user joint evaluation approach, this study scrutinized the capabilities of the IR system, with users serving as subjects. The baseline models and evaluation metrics demonstrate that the IR system provides a robust performance across multiple users and prioritizes recommending ISAD that aligns with user preferences in the TOP-N lists. When considering the group to which these users belong, the group demonstrates a high degree of structural similarity and a strong capacity to inspire through the recommendation list. The objective results from the evaluation metrics, combined with the subjective assessment of the recommendation list, suggest that the IR system effectively recommends preferences for groups, providing designers with high-quality ISAD.
It is worth noting that this manuscript focuses on automobile styling design; therefore, we only consider users with high scores in this area when determining association rules, forming a recommendation list under this constraint. However, this method can be extended to various aspects of automobile design. By modifying the association rule, we can identify automotive users who prioritize specific characteristics (e.g., fuel consumption or power), thus informing users’ preferred design solution and generating the inspiration for related design elements. From this perspective, the method has broad application potential beyond automobile modeling and merits further exploration.

6. Conclusions

This manuscript introduces an inspiration recommendation system aimed at group preference-driven styling innovation, addressing the challenges of aimlessness and the scarcity of high-quality ISAD. The system utilizes car user behavior datasets, employing a tiered mining strategy to recommend preferred car models. It starts with clustering based on user feature similarity and explicit feedback to create a user portrait, establishing a cohort of users with high styling scores. The system then identifies preference-aligned groups, constructs a set of preferred models using expert scoring, and forms an ISAD list that reflects group preferences based on the item preference matrix. The IR system has been validated by our team, with its performance compared against baseline models—User-CF, HS-CF, and BPR—using the same target user information. Evaluation metrics such as F1-SCORE, MRR, HR, and MAP confirm that the IR system can prioritize ISAD that aligns with user preferences in higher positions on the list. Furthermore, it demonstrates a robust recommendation performance for group preferences. This research addresses the challenges of data sparsity in car user purchases and the difficulty of identifying common needs within groups through human-computer interaction.
The contributions of this study are as follows:
(1)
We construct an IR system tailored for automobile modeling design, addressing the gap in user group preference recommendation models for durable goods. By analyzing user behavior data, this approach uncovers high-density preference clusters of cars among similar groups, mitigating the sparsity of car purchase behavior. This offers a novel avenue for designers to draw inspiration.
(2)
This manuscript proposes a method for identifying users with similar preferences. By systematically cleaning user characteristic attribute information and explicit feedback data, we achieve the personalized identification of users with shared preferences. This lays the groundwork for the broader application of group preference recommendation systems.
(3)
We propose a multi-user joint evaluation method to assess the quality of a single user’s TOP-N list using multiple dimensions of indicators. This approach allows us to verify the recommendation accuracy of the IR system, providing an effective evaluation framework for group preference recommendations.
The IR system presents several opportunities for optimization. To reduce the subjective biases related to expert scoring, we can incorporate Convolutional Neural Networks (CNNs) into local feature recognition, enhancing clustering and scoring precision. The recommendation accuracy can be significantly improved by integrating deep learning techniques like CNNs. We focus solely on user behavior data from the automotive home in this manuscript. However, it will be essential to explore the most suitable clustering methods for multimodal user behavior data in future research, as different data modalities may require more specialized approaches to effectively capture complex user preferences. Additionally, the preferences of experienced designers, which hold significant weight, could be incorporated into the IR system to synthesize all inspiration factors into novel ISAD. Our team has initiated related explorations to enhance the performance of the IR system. In brief, this research unlocks the potential of group preferences in the human element and expands the application scope of recommendation systems in decision support systems. This research bridges interactive computing between user behavior data and product design, offering valuable insights for applying modern computer systems in automotive design.

Author Contributions

Conceptualization, L.L.; formal analysis, W.C.; methodology, W.C.; supervision, M.Y. and L.L.; writing—original draft, W.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Guizhou Science and Technology Plan Project, grant number [ZK[2021]055]; the National Natural Science Foundation of China, grant number [52465024]; the Guizhou provincial Basic Research Program (Natural Science), grant number[ZK[2023]081]; the Guizhou Provincial Key Technology R&D Program, grant number [[2024]131].

Data Availability Statement

The datasets used and analyzed during the current study are available from the corresponding author on reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Li, M.D.; Lou, S.H.; Zheng, H.; Feng, Y.X.; Gao, Y.C.; Zeng, S.Y.; Tan, J.R. A cognitive analysis-based key concepts derivation approach for product design. Expert Syst. Appl. 2024, 236, 121289. [Google Scholar] [CrossRef]
  2. Yang, K.; Li, Y.; Xiong, Y.; Yan, J.-Y.; Na, H.-Z. A model for computer-aided creative design based on cognition and iteration. Proc. Inst. Mech. Eng. Part C J. Mech. Eng. Sci. 2016, 230, 3470–3487. [Google Scholar] [CrossRef]
  3. Lou, S.H.; Feng, Y.X.; Gao, Y.C.; Zheng, H.; Peng, T.; Tan, J.R. A function-behavior mapping approach for product conceptual design inspired by memory mechanism. Adv. Eng. Inf. 2023, 58, 102236. [Google Scholar] [CrossRef]
  4. Setchi, R.; Tang, Q.; Stankov, I. Semantic-based information retrieval in support of concept design. Adv. Eng. Inf. 2011, 25, 131–146. [Google Scholar] [CrossRef]
  5. Robinson, M.A. An empirical analysis of engineers’ information behaviors. J. Am. Soc. Inf. Sci. Technol. 2010, 61, 640–658. [Google Scholar] [CrossRef]
  6. Ma, M.Y.; Chen, C.W.; Chang, Y.M. Using Kano model to differentiate between future vehicle-driving services. Int. J. Ind. Ergon. 2019, 69, 142–152. [Google Scholar] [CrossRef]
  7. Li, Z.; Tian, Z.G.; Wang, J.W.; Wang, W.M.; Huang, G.Q. Dynamic mapping of design elements and affective responses: A machine learning based method for affective design. J. Eng. Des. 2018, 29, 358–380. [Google Scholar] [CrossRef]
  8. Wu, X.Y.; Hong, Z.X.; Feng, Y.X.; Li, M.D.; Lou, S.H.; Tan, J.R. A semantic analysis-driven customer requirements mining method for product conceptual design. Sci. Rep. 2022, 12, 10139. [Google Scholar] [CrossRef]
  9. Li, Y.; Wang, J.; Li, X.; Zhao, W. Design creativity in product innovation. Int. J. Adv. Manuf. Technol. 2007, 33, 213–222. [Google Scholar] [CrossRef]
  10. Hou, X.; Gou, B.; Chen, D.; Chu, J. A semantic data-driven knowledge base construction method to assist designers in design inspiration based on traditional motifs. Adv. Eng. Inf. 2023, 56, 101987. [Google Scholar] [CrossRef]
  11. Saniuk, S.; Grabowska, S.; Gajdzik, B. Personalization of products in the industry 4.0 concept and its impact on achieving a higher level of sustainable consumption. Energies 2020, 13, 5895. [Google Scholar] [CrossRef]
  12. Cintia Ganesha Putri, D.; Leu, J.S.; Seda, P. Design of an unsupervised machine learning-based movie recommender system. Symmetry 2020, 12, 185. [Google Scholar] [CrossRef]
  13. Wang, C.D.; Deng, Z.H.; Lai, J.H.; Philip, S.Y. Serendipitous recommendation in e-commerce using innovator-based collaborative filtering. IEEE Trans. Cybern. 2018, 49, 2678–2692. [Google Scholar] [CrossRef] [PubMed]
  14. Wang, C.; Zheng, Y.F.; Jiang, J.H.; Ren, K. Toward privacy-preserving personalized recommendation services. Engineering 2018, 4, 21–28. [Google Scholar] [CrossRef]
  15. Wang, C.H. Incorporating user preferences and performance ratings into multi-functional tablet design and recommendation. Int. J. Ind. Eng. Theory Appl. Pract. 2020, 27, 308–320. [Google Scholar]
  16. Chen, B.Y.; Hu, X.B.; Huo, Y.L.; Deng, X. Research on Recommendation Method of Product Design Scheme Based on Multi-Way Tree and Learning-to-Rank. Machines 2020, 8, 30. [Google Scholar] [CrossRef]
  17. Zhang, Z.N.; Liu, L.; Wei, W.; Tao, F.; Li, T.M.; Liu, A. A systematic function recommendation process for data-driven product and service design. J. Mech. Des. 2017, 139, 111404. [Google Scholar] [CrossRef]
  18. Ali, S.; Wang, G.j.; Riaz, S. Aspect based sentiment analysis of ridesharing platform reviews for kansei engineering. IEEE Access 2020, 8, 173186–173196. [Google Scholar] [CrossRef]
  19. Jia, R.; Li, R.; Yu, M.; Wang, S. E-commerce purchase prediction approach by user behavior data. In Proceedings of the 2017 International Conference on Computer, Information and Telecommunication Systems (CITS), Dalian, China, 21–23 July 2017; pp. 1–5. [Google Scholar]
  20. Liu, L.F.; Li, Y.; Xiong, Y.; Cao, J.; Yuan, P. An EEG study of the relationship between design problem statements and cognitive behaviors during conceptual design. AI EDAM 2018, 32, 351–362. [Google Scholar] [CrossRef]
  21. Su, L.W.; Yang, Q.Y.; Chen, H.Y.; Zeng, X.F.; Hu, R.L. Design of Marketing Recommendation System for Power Grid Enterprises Based on User Behavior data. Tech. Autom. Appl. 2023, 42, 153–156. [Google Scholar]
  22. Lazer, D.; Brewer, D.; Christakis, N.; Fowler, J.; King, G. Life in the network: The coming age of computational social. Science 2009, 323, 721–723. [Google Scholar] [CrossRef] [PubMed]
  23. Thamaraiselvi, G.; Kaliammal, A. Data mining: Concepts and techniques. SRELS J. Inf. Manag. 2004, 41, 339–348. [Google Scholar]
  24. Rodden, K.; Leggett, M. Best of both worlds: Improving gmail labels with the affordances of folders. In CHI’10 Extended Abstracts on Human Factors in Computing Systems; ACM: New York, NY, USA, 2010; pp. 4587–4596. [Google Scholar]
  25. Adar, E.; Teevan, J.; Dumais, S.T. Large scale analysis of web revisitation patterns. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, Florence, Italy, 5–10 April 2008; pp. 1197–1206. [Google Scholar]
  26. Jung, S.; Herlocker, J.L.; Webster, J. Click data as implicit relevance feedback in web search. Inf. Process. Manag. 2007, 43, 791–807. [Google Scholar] [CrossRef]
  27. Jawaheer, G.; Szomszor, M.; Kostkova, P. Comparison of implicit and explicit feedback from an online music recommendation service. In Proceedings of the 1st International Workshop on Information Heterogeneity and Fusion in Recommender Systems, Barcelona, Spain, 26 September 2010; pp. 47–51. [Google Scholar]
  28. Claypool, M.; Brown, D.; Le, P.; Waseda, M. Inferring user interest. IEEE Internet Comput. 2001, 5, 32–39. [Google Scholar] [CrossRef]
  29. Buscher, G.; White, R.W.; Dumais, S.; Huang, J. Large-scale analysis of individual and task differences in search result page examination strategies. In Proceedings of the Fifth ACM International Conference on Web Search and Data Mining, Seattle, WA, USA, 8–12 February 2012; pp. 373–382. [Google Scholar]
  30. Balakrishnan, V.; Ahmadi, K.; Ravana, S.D. Improving retrieval relevance using users’ explicit feedback. Aslib J. Inf. Manag. 2016, 68, 76–98. [Google Scholar] [CrossRef]
  31. Moon, H.; Park, J.; Kim, S. The importance of an innovative product design on customer behavior: Development and validation of a scale. J. Prod. Innov. Manag. 2015, 32, 224–232. [Google Scholar] [CrossRef]
  32. Sun, H.; Guo, W.; Shao, H.Y.; Rong, B. Dynamical mining of ever-changing user requirements: A product design and improvement perspective. Adv. Eng. Inform. 2020, 46, 101174. [Google Scholar] [CrossRef]
  33. Chien, C.F.; Kerh, R.; Lin, K.Y.; Yu, A.P.I. Data-driven innovation to capture user-experience product design: An empirical study for notebook visual aesthetics design. Comput. Ind. Eng. 2016, 99, 162–173. [Google Scholar] [CrossRef]
  34. Gao, J.; Yao, Y.l.; Zhu, V.C.; Sun, L.Y.; Lin, L. Service-oriented manufacturing: A new product pattern and manufacturing paradigm. J. Intell. Manuf. 2011, 22, 435–446. [Google Scholar] [CrossRef]
  35. Zhao, X. Research on the Application of Interconnection Thinking in Product Design Innovation. Design 2023, 8, 79–84. [Google Scholar] [CrossRef]
  36. Alfarhood, S.; Alfarhood, M. CAML: A Context-Aware Metric Learning approach for improved recommender systems. Alex. Eng. J. 2024, 100, 53–60. [Google Scholar] [CrossRef]
  37. Van Meteren, R.; Van Someren, M. Using content-based filtering for recommendation. In Proceedings of the Machine Learning in the New Information Age: MLnet/ECML2000 Workshop, Barcelona, Spain, 30 May 2000; pp. 47–56. [Google Scholar]
  38. Iyengar, S.S.; Lepper, M.R. When choice is demotivating: Can one desire too much of a good thing? J. Personal. Soc. Psychol. 2000, 79, 995. [Google Scholar] [CrossRef] [PubMed]
  39. Loeb, S.; Terry, D. Information filtering. Commun. ACM 1992, 35, 26–28. [Google Scholar] [CrossRef]
  40. Cantador, I.; Fernández, M.; Vallet, D.; Castells, P.; Picault, J.; Ribiere, M. A multi-purpose ontology-based approach for personalised content filtering and retrieval. Adv. Semant. Media Adapt. Pers. 2008, 93, 25–51. [Google Scholar]
  41. Salter, J.; Antonopoulos, N. CinemaScreen recommender agent: Combining collaborative and content-based filtering. IEEE Intell. Syst. 2006, 21, 35–41. [Google Scholar] [CrossRef]
  42. Iwahama, K.; Hijikata, Y.; Nishida, S. Content-based filtering system for music data. In Proceedings of the 2004 International Symposium on Applications and the Internet Workshops, 2004 Workshops, Tokyo, Japan, 26–30 January 2004; pp. 480–487. [Google Scholar]
  43. Im, I.; Hars, A. Does a one-size recommendation system fit all? the effectiveness of collaborative filtering based recommendation systems across different domains and search modes. ACM Trans. Inf. Syst. 2007, 26, 4. [Google Scholar] [CrossRef]
  44. Cai, Y.; Leung, H.F.; Li, Q.; Min, H.; Tang, J.; Li, J. Typicality-based collaborative filtering recommendation. IEEE Trans. Knowl. Data Eng. 2013, 26, 766–779. [Google Scholar] [CrossRef]
  45. Liu, L.w.; Lecue, F.; Mehandjiev, N. Semantic content-based recommendation of software services using context. ACM Trans. Web 2013, 7, 1–20. [Google Scholar] [CrossRef]
  46. He, J.; Li, X.; Liao, L.j. Next point-of-interest recommendation via a category-aware Listwise Bayesian Personalized Ranking. J. Comput. Sci. 2018, 28, 206–216. [Google Scholar] [CrossRef]
  47. De Campos, L.M.; Fernández-Luna, J.M.; Huete, J.F.; Rueda-Morales, M.A. Combining content-based and collaborative recommendations: A hybrid approach based on Bayesian networks. Int. J. Approx. Reason. 2010, 51, 785–799. [Google Scholar] [CrossRef]
  48. Basilico, J.; Hofmann, T. Unifying collaborative and content-based filtering. In Proceedings of the Twenty-First International Conference on Machine Learning, Banff, AB, Canada, 4–8 July 2004; p. 9. [Google Scholar]
  49. Esteban, A.; Zafra, A.; Romero, C. Helping university students to choose elective courses by using a hybrid multi-criteria recommendation system with genetic optimization. Knowl.-Based Syst. 2020, 194, 105385. [Google Scholar] [CrossRef]
  50. Walek, B.; Fojtik, V. A hybrid recommender system for recommending relevant movies using an expert system. Expert Syst. Appl. 2020, 158, 113452. [Google Scholar] [CrossRef]
  51. Zhang, X.F.; Liu, H.J.; Chen, X.Y.; Zhong, J.B.; Wang, D. A novel hybrid deep recommendation system to differentiate user’s preference and item’s attractiveness. Inf. Sci. 2020, 519, 306–316. [Google Scholar] [CrossRef]
  52. Amato, F.; Moscato, V.; Picariello, A.; Piccialli, F. SOS: A multimedia recommender system for online social networks. Future Gener. Comput. Syst. 2019, 93, 914–923. [Google Scholar] [CrossRef]
  53. Sharma, S.; Koehl, L.; Bruniaux, P.; Zeng, X.; Wang, Z. Development of an intelligent data-driven system to recommend personalized fashion design solutions. Sensors 2021, 21, 4239. [Google Scholar] [CrossRef]
  54. Chen, D.l.; Cheng, P.P. Development of design system for product pattern design based on Kansei engineering and BP neural network. Int. J. Cloth. Sci. Technol. 2022, 34, 335–346. [Google Scholar] [CrossRef]
  55. Papagelis, M.; Plexousakis, D.; Kutsuras, T. Alleviating the sparsity problem of collaborative filtering using trust inferences. In Proceedings of the International Conference on Trust Management, Paris, France, 23–26 May 2005; pp. 224–239. [Google Scholar]
  56. Wang, D.; Du, R.; Yang, Q.; Yu, D.; Wan, F.; Gong, X.; Xu, G.; Deng, S. Category-aware self-supervised graph neural network for session-based recommendation. World Wide Web 2024, 27, 61. [Google Scholar] [CrossRef]
  57. Suryadi, F.M.; Baizal, Z. Ontology-based Car Recommender System Using Functional Requirements Interaction. In Proceedings of the 2023 International Conference on Advancement in Data Science, E-Learning and Information System (ICADEIS), Bali, Indonesia, 2–3 August 2023; pp. 1–6. [Google Scholar]
  58. Sawalkar, M.M.; Kumbhar, R.; Jamkar, K.; Mandlik, M.; Patil, H. Vehicle Insurance Recommendation System. Int. J. Sci. Res. Comput. Sci. Eng. Inf. Technol. 2022, 8, 584–589. [Google Scholar] [CrossRef]
  59. Alabduljabbar, R.; Alghamdi, M.; Alshamlan, H. Personalized Car Recommendations Using Knowledge-Based Methods. In Proceedings of the 2023 Intelligent Methods, Systems, and Applications (IMSA), Giza, Egypt, 15–16 July 2023; pp. 539–544. [Google Scholar]
  60. Khalid, H.; Dangelmaier, M.; Lim, T. The CATER approach to vehicle mass customization. In Proceedings of the 2007 IEEE International Conference on Industrial Engineering and Engineering Management, Singapore, 2–5 December 2007; pp. 1273–1276. [Google Scholar]
  61. Tan, H.; Zhu, Y.; Zhao, J. Development of an automotive user interface design knowledge system. In Proceedings of the 4th International Conference on Automotive User Interfaces and Interactive Vehicular Applications, Seoul, Republic of Korea, 17–20 September 2022; pp. 201–208. [Google Scholar]
  62. Sivaramakrishnan, A.; Krishnamachari, M.; Balasubramanian, V. Recommending customizable products: A multiple choice knapsack solution. In Proceedings of the 5th International Conference on Web Intelligence, Mining and Semantics, Larnaca, Cyprus, 13–15 July 2015; pp. 1–10. [Google Scholar]
  63. Fedullo, T.; Morato, A.; Tramarin, F.; Cattini, S.; Rovati, L. Artificial intelligence-based measurement systems for automotive: A comprehensive review. In Proceedings of the 2022 IEEE International Workshop on Metrology for Automotive (Metroautomotive), Modena, Italy, 4–6 July 2022; pp. 122–127. [Google Scholar]
  64. Burnap, A.; Hartley, J.; Pan, Y.; Gonzalez, R.; Papalambros, P.Y. Balancing design freedom and brand recognition in the evolution of automotive brand styling. Des. Sci. 2016, 2, e9. [Google Scholar] [CrossRef]
  65. Buxton, W.; Fitzmaurice, G.; Balakrishnan, R.; Kurtenbach, G. Large displays in automotive design. IEEE Comput. Graph. Appl. 2000, 20, 68–75. [Google Scholar] [CrossRef]
  66. Mougenot, C.; Bouchard, C.; Aoussat, A.; Westerman, S. Inspiration, images and design: An investigation of designers’ information gathering strategies. J. Des. Res. 2008, 7, 331–351. [Google Scholar] [CrossRef]
  67. Eckert, C.; Stacey, M.; Clarkson, P. Algorithms and inspirations: Creative reuse of design experience. In Proceedings of the Greenwich 2000 International Symposium: Digital Creativity; University of Greenwich: London, UK; pp. 1–10.
  68. Cooper, R.G.; Kleinschmidt, E.J. New products: What separates winners from losers? J. Prod. Innov. Manag. 1987, 4, 169–184. [Google Scholar] [CrossRef]
  69. Reinertsen, D.G.; Smith, P.G. The strategist’s role in shortening product development. J. Bus. Strategy 1991, 12, 18–22. [Google Scholar] [CrossRef] [PubMed]
  70. Gick, M.L.; Holyoak, K.J. Analogical problem solving. Cogn. Psychol. 1980, 12, 306–355. [Google Scholar] [CrossRef]
  71. Chowdhury, H.; Islam, R.; Hussein, M.; Zaid, M.; Loganathan, B.; Alam, F. Design of an energy efficient car by biomimicry of a boxfish. Energy Procedia 2019, 160, 40–44. [Google Scholar] [CrossRef]
  72. Moulson, T.; Sproles, G. Styling strategy. Bus. Horiz. 2000, 43, 45–52. [Google Scholar] [CrossRef]
  73. Smeulders, A.W.; Worring, M.; Santini, S.; Gupta, A.; Jain, R. Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 2000, 22, 1349–1380. [Google Scholar] [CrossRef]
  74. Nagamachi, M. Kansei engineering: A new ergonomic consumer-oriented technology for product development. Int. J. Ind. Ergon. 1995, 15, 3–11. [Google Scholar] [CrossRef]
  75. Schütte, S.T.; Eklund, J.; Axelsson, J.R.; Nagamachi, M. Concepts, methods and tools in Kansei engineering. Theor. Issues Ergon. Sci. 2004, 5, 214–231. [Google Scholar] [CrossRef]
  76. Chang, H.C.; Lai, H.H.; Chang, Y.M. Expression modes used by consumers in conveying desire for product form: A case study of a car. Int. J. Ind. Ergon. 2006, 36, 3–10. [Google Scholar] [CrossRef]
  77. Du, J.; Li, Y.; Ma, J.l.; Xiong, Y.; Li, W. Retrieval of Semantic-Based Inspirational Sources for Emotional Design. Comput. Intell. Neurosci. 2018, 2018, 17. [Google Scholar] [CrossRef] [PubMed]
  78. Hou, X.; Gou, B.; Chen, D.; Chu, J.; Ding, N.; Ma, L. A method to assist designers in optimizing the exterior styling of vehicles based on key features. Expert Syst. Appl. 2024, 254, 124485. [Google Scholar] [CrossRef]
  79. Zhao, Y. Research on the application of university teaching management evaluation system based on Apriori algorithm. Proc. J. Phys. Conf. Ser. 2021, 1883, 012033. [Google Scholar] [CrossRef]
  80. Et-Taleby, A.; Boussetta, M.; Benslimane, M. Faults Detection for Photovoltaic Field Based on K-Means, Elbow, and Average Silhouette Techniques through the Segmentation of a Thermal Image. Int. J. Photoenergy 2020, 2020, 6617597. [Google Scholar] [CrossRef]
  81. Kodinariya, T.M.; Makwana, P.R. Review on determining number of Cluster in K-Means Clustering. Int. J. Adv. Res. Comput. Sci. Manag. Stud. 2013, 1, 90–95. [Google Scholar]
  82. Agrawal, R.; Imieliński, T.; Swami, A. Mining association rules between sets of items in large databases. In Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, Washington, DC, USA, 26–28 May 1993; pp. 207–216. [Google Scholar]
  83. Aher, S.B.; Lobo, L. Combination of machine learning algorithms for recommendation of courses in E-Learning System based on historical data. Knowl.-Based Syst. 2013, 51, 1–14. [Google Scholar] [CrossRef]
  84. Yacouby, R.; Axman, D. Probabilistic Extension of Precision, Recall, and F1 Score for More Thorough Evaluation of Classification Models. In Proceedings of the First Workshop on Evaluation Comparison of NLP Systems 2020, Online, 20 November 2020. [Google Scholar]
  85. Shi, Y.; Karatzoglou, A.; Baltrunas, L.; Larson, M.; Oliver, N.; Hanjalic, A. CLiMF: Learning to maximize reciprocal rank with collaborative less-is-more filtering. In Proceedings of the Sixth ACM Conference on Recommender Systems, Dublin, Ireland, 9–13 September 2012; pp. 139–146. [Google Scholar]
  86. He, X.N.; Chen, T.; Kan, M.Y.; Chen, X. TriRank: Review-aware Explainable Recommendation by Modeling Aspects. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, Melbourne, Australia, 18–23 October 2015; pp. 1661–1670. [Google Scholar]
  87. Liu, T.Y. Learning to rank for information retrieval. Found. Trends Inf. Retr. 2009, 3, 225–331. [Google Scholar] [CrossRef]
  88. Jalali, M.; Mustapha, N.; Sulaiman, M.N.; Mamat, A. WebPUM: A Web-based recommendation system to predict user future movements. Expert Syst. Appl. 2010, 37, 6201–6212. [Google Scholar] [CrossRef]
  89. Ko, H.; Lee, S.; Park, Y.; Choi, A. A survey of recommendation systems: Recommendation models, techniques, and application fields. Electronics 2022, 11, 141. [Google Scholar] [CrossRef]
  90. Li, D.; Li, M.; Han, G.; Li, T. A combined deep learning method for internet car evaluation. Neural Comput. Appl. 2021, 33, 4623–4637. [Google Scholar] [CrossRef]
  91. Wu, P.; Tang, T.; Zhou, L.; Martínez, L. A decision-support model through online reviews: Consumer preference analysis and product ranking. Inf. Process. Manag. 2024, 61, 103728. [Google Scholar] [CrossRef]
  92. Rahmani, M.K.I.; Pal, N.; Arora, K. Clustering of image data using K-means and fuzzy K-means. Int. J. Adv. Comput. Sci. Appl. 2014, 5, 160–163. [Google Scholar]
  93. Hu, Y.; Shi, W.; Li, H.; Hu, X. Mitigating data sparsity using similarity reinforcement-enhanced collaborative filtering. ACM Trans. Internet Technol. 2017, 17, 1–20. [Google Scholar] [CrossRef]
Figure 1. Overview of IR system framework.
Figure 1. Overview of IR system framework.
Systems 12 00491 g001
Figure 2. The NBPR recommendation process.
Figure 2. The NBPR recommendation process.
Systems 12 00491 g002
Figure 3. Attributes and attribute elements contained in user behavior data.
Figure 3. Attributes and attribute elements contained in user behavior data.
Systems 12 00491 g003
Figure 4. The best choice of clustering k. (a) The plot of the sum of the squared errors about the value of K. (b) The plot of the silhouette coefficient about the value of K.
Figure 4. The best choice of clustering k. (a) The plot of the sum of the squared errors about the value of K. (b) The plot of the silhouette coefficient about the value of K.
Systems 12 00491 g004
Figure 5. Schematic diagram of the results of constructing the item preference matrix: (a) Item preference matrix (u, i, j) when there is only one purchased model. (b) Process for quantifying preferences and constructing preferred model set through expert scoring. (c) Item preference matrix (u, i, j) with the preferred model set.
Figure 5. Schematic diagram of the results of constructing the item preference matrix: (a) Item preference matrix (u, i, j) when there is only one purchased model. (b) Process for quantifying preferences and constructing preferred model set through expert scoring. (c) Item preference matrix (u, i, j) with the preferred model set.
Systems 12 00491 g005
Figure 6. The results of evaluation metrics F1-SCORE and MRR: (a) Histogram of F1-SCORE on the recommendation list length. (b) Line graph of MRR on the IR system and baseline models.
Figure 6. The results of evaluation metrics F1-SCORE and MRR: (a) Histogram of F1-SCORE on the recommendation list length. (b) Line graph of MRR on the IR system and baseline models.
Systems 12 00491 g006
Figure 7. The results of evaluation metrics HR and MAR. (ad) Line graphs of HR of IR system and baseline model on the recommendation list length. (eh) Line graphs of MAP of IR system and baseline model on the recommendation list length.
Figure 7. The results of evaluation metrics HR and MAR. (ad) Line graphs of HR of IR system and baseline model on the recommendation list length. (eh) Line graphs of MAP of IR system and baseline model on the recommendation list length.
Systems 12 00491 g007
Figure 8. The performance of the IR system on different preference-aligned group sizes. (a) Histogram of F1-SCORE on the recommendation list length under user portraits 1 and 2. (b) Line graph of MRR of user portraits 1 and 2 on different sampling users. (c,d) Lines graph of HR and MAP of user portraits 1 and 2 on the recommendation list length.
Figure 8. The performance of the IR system on different preference-aligned group sizes. (a) Histogram of F1-SCORE on the recommendation list length under user portraits 1 and 2. (b) Line graph of MRR of user portraits 1 and 2 on different sampling users. (c,d) Lines graph of HR and MAP of user portraits 1 and 2 on the recommendation list length.
Systems 12 00491 g008
Table 1. Element encoding of user feature information set.
Table 1. Element encoding of user feature information set.
UserFeature Information Label
ABCDEFG
11222121
21211353
31122122
42124542
80551311133
80562123122
80571231235
Table 2. Representative results of data clustering by applying K-means.
Table 2. Representative results of data clustering by applying K-means.
Attribute LabelCluster 1 (3623)Cluster 2 (2410)Cluster 3 (1465)Cluster 4 (559)
GenderMale, FemaleMale, FemaleMaleMale
Age18–28, 29–3818–28, 39–4829–38, 49–5839–48
LocationFirst-tier, Second-tierFirst-tier, Fourth-tierThird-tier, ElseThird-tier, Fourth-tier
Fans501–2000, 5001–10,000<500, >10,000<5002001–5000
PurposeCommute, AppointmentCommute, BusinessPick up childTransport, Business
Price<10 W, 10–20 W, 30–40 W20–30 W, >40 W10–20 W, 20–30 W,20–30 W
Mileage<2000 km, 2000–5000 km5000–10,000 km, 10,000 km–15,000 km2000–5000 km, >15,000 km>15,000 km
Table 3. User portrait composed of special association rule.
Table 3. User portrait composed of special association rule.
User PortraitAntecedent xConsequent yConfidence ThresholdSupportConfidenceClusterNumber of Users
Systems 12 00491 i001[1]A1, B1, C1, E1,M50.30.110.421387
Systems 12 00491 i001[2]A1, B3, C4, E3M50.30.040.31294
Table 4. The results of user portrait mining under common needs.
Table 4. The results of user portrait mining under common needs.
Target UsersUser PortraitExplicit Feedback Common NeedsKernel FunctionPercentageNumber of Model
Systems 12 00491 i002[1]1H3, I5, J4, K3,
L5, N4, O4
Being greater than or equal to the existing score in all evaluation attribute labels Linear 0.2285
Systems 12 00491 i002[2]2H4, I4, J5, K4,
L3, N5, O3
Linear 0.3028
Table 5. Recommendation list of target users in the case study.
Table 5. Recommendation list of target users in the case study.
Target UsersPurchased ModelRecommendation List (TOP-5)
12345
Systems 12 00491 i003[1]Mazda3AxelawhiteBeijing-U7 whiteMalibu xl whiteAudiA3
white
Mondeo whiteHongqi H6 white
Systems 12 00491 i003[2]Wey VV7 greyVolvo XC60 silverHongqi HS5 greyWey VV7
grey
Tiguan L blackAudi Q3 grey
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Cai, W.; Yang, M.; Lin, L. An Inspiration Recommendation System for Automotive Styling Design Based on User Behavior Data and Group Preferences. Systems 2024, 12, 491. https://doi.org/10.3390/systems12110491

AMA Style

Cai W, Yang M, Lin L. An Inspiration Recommendation System for Automotive Styling Design Based on User Behavior Data and Group Preferences. Systems. 2024; 12(11):491. https://doi.org/10.3390/systems12110491

Chicago/Turabian Style

Cai, Wanxin, Mingqing Yang, and Li Lin. 2024. "An Inspiration Recommendation System for Automotive Styling Design Based on User Behavior Data and Group Preferences" Systems 12, no. 11: 491. https://doi.org/10.3390/systems12110491

APA Style

Cai, W., Yang, M., & Lin, L. (2024). An Inspiration Recommendation System for Automotive Styling Design Based on User Behavior Data and Group Preferences. Systems, 12(11), 491. https://doi.org/10.3390/systems12110491

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop