Abstract
In today's fast-paced world, recommendation systems have become indispensable tools, aiding users in making personalized decisions amidst an overwhelming array of choices. These systems leverage user data and preferences to generate tailor-made recommendations based on individual tastes and behaviors. This research paper introduces the development and implementation of Pranik Movies, an ultimate recommendation system for personalized movie suggestions. The system incorporates collaborative and content-based filtering techniques, utilizing machine learning algorithms to analyze user behaviors, ratings, and viewing histories. A comprehensive overview of the research framework is provided, encompassing system architecture, data pre-processing, feature engineering techniques, and model selection and design. Text processing methods such as stemming, bag-of-words (BoW), and TF-IDF (Term Frequency-Inverse Document Frequency) are employed for processing and analyzing textual movie data. The accuracy of recommendations is enhanced through the assessment of film similarities, utilizing algorithms like cosine similarity and Euclidean distance. The paper concludes by outlining future directions for advanced machine learning techniques, social media integration, expanded content support, and the refinement of the evaluation framework. Pranik Movies signifies a significant advancement in recommendation systems, enabling personalized and precise movie recommendations within a vast and diverse cinematic landscape.
Similar content being viewed by others
Availability of data and materials
Data sharing not applicable to this article as no datasets were generated or analysed during the current study.
References
Jiang L, Cheng Y, Yang Li, Li J, Yan H, Wang X (2018) A trust-based collaborative filtering algorithm for e-commerce recommendation system. J Ambient Intell Humaniz Comput 10(8):3023–3034. https://doi.org/10.1007/s12652-018-0928-7
Iwahama K, Hijikata Y, Nishida S (2004) “Content-based filtering system for Music Data,” 2004 International Symposium on Applications and the Internet Workshops. 2004 Workshops. doi:https://doi.org/10.1109/saintw.2004.1268677
Wang H, Zhang P, Lu T, Gu H, Gu N (2017) “Hybrid recommendation model based on incremental collaborative filtering and content-based algorithms,” 2017 IEEE 21st International Conference on Computer Supported Cooperative Work in Design (CSCWD). doi:https://doi.org/10.1109/cscwd.2017.8066717
He X, Liao L, Zhang H, Nie L, Hu X, Chua (2017) “Neural collaborative filtering,” Proceedings of the 26th International Conference on World Wide Web. doi:https://doi.org/10.1145/3038912.3052569
Sharma P, Yadav L (2020) Movie recommendation system using item based collaborative filtering. Int J Innov Res Comput Sci Technol 8(4). doi:https://doi.org/10.21276/ijircst.2020.8.4.2
Zhang H, Gan M, Sun X (2021) Incorporating memory-based preferences and point-of-interest stickiness into recommendations in location-based social networks. ISPRS Int J Geo Inf 10(1):36. https://doi.org/10.3390/ijgi10010036
Tintarev N, Masthoff J (2007) A Survey of Explanations in Recommender Systems. 2007 IEEE 23rd International Conference on Data Engineering Workshop, Istanbul, Turkey, pp. 801–810, doi: https://doi.org/10.1109/ICDEW.2007.4401070
Rendle S (2010) Factorization machines. In: 2010 IEEE International Conference on Data Mining. IEEE Sydney, NSW, Australia, pp 995–1000. https://doi.org/10.1109/ICDM.2010.127
Chen B (2022) Data Collection and preprocessing. SpringerBriefs in Computer Science, pp. 5–16. doi:https://doi.org/10.1007/978-981-19-7369-7_2
Camizuli E, Carranza EJ (2018) Exploratory Data Analysis (EDA). The Encyclopedia of Archaeological Sciences, pp. 1–7. doi:https://doi.org/10.1002/9781119188230.saseas0271
Camizuli E, Carranza E (2018) Exploratory Data Analysis (EDA). The Encyclopedia of Archaeological Sciences, pp. 1–7. doi:https://doi.org/10.1002/9781119188230.saseas0271
Gallavotti G, Bonetto F, Gentile G (2004) General qualitative properties. Aspects of Ergodic, Qualitative and Statistical Theory of Motion, pp. 1–26. doi:https://doi.org/10.1007/978-3-662-05853-4_1
Dr PN (2020) Leukemia drug prediction using machine learning techniques with feature engineering. J Adv Res Dynamic Control Syst 12(SP4):141–146. https://doi.org/10.5373/jardcs/v12sp4/20201475
Kapoor N, Vishal S, KKS (2020) Movie recommendation system using NLP Tools. 2020 5th International Conference on Communication and Electronics Systems (ICCES). doi:https://doi.org/10.1109/icces48766.2020.9137993
Arifsiswandi A, Permana Y, Emarilis A (2021) Stemming analysis indonesian language news text with Porter algorithm. J Phys: Confer Series 1845(1):012019. https://doi.org/10.1088/1742-6596/1845/1/012019
Lohmann S, Heimerl F, Bopp F, Burch M, Ertl T (2015)Concentri Cloud: Word cloud visualization for multiple text documents. 2015 19th International Conference on Information Visualisation. doi:https://doi.org/10.1109/iv.2015.30
Passalis N, Tefas A (2018) Learning bag-of-embedded-words representations for textual information retrieval. Pattern Recogn 81:254–267. https://doi.org/10.1016/j.patcog.2018.04.008
Christian H, Agus MP, Suhartono D (2016) “Single Document Automatic text summarization using term frequency-inverse document frequency (TF-IDF)”, ComTech: Computer. Math Eng Appl 7(4):285. https://doi.org/10.21512/comtech.v7i4.3746
Pan X, Cheng J, Xia Y, Zhang X, Wang H (2012) “Which feature is better? TF*IDF feature or topic feature in text clustering,” 2012 Fourth International Conference on Multimedia Information Networking and Security. doi:https://doi.org/10.1109/mines.2012.249
Chiny M, Chihab M, Bencharef O, Chihab Y (2021) Netflix recommendation system based on TF-IDF and cosine similarity algorithms. Proceedings of the 2nd International Conference on Big Data, Modelling and Machine Learning. doi:https://doi.org/10.5220/0010727500003101
Hancock JM (2004) Jaccard distance (Jaccard Index, Jaccard similarity coefficient). Dictionary Bioinform Comput Biol. https://doi.org/10.1002/9780471650126.dob0956
Kryszkiewicz M (2014)The cosine similarity in terms of the Euclidean distance. Encyclopedia of Business Analytics and Optimization, pp. 2498–2508. doi:https://doi.org/10.4018/978-1-4666-5202-6.ch223
Alodadi M, Janeja VP (2015) Similarity in patient support forums using TF-IDF and cosine similarity metrics. 2015 International Conference on Healthcare Informatics. doi:https://doi.org/10.1109/ichi.2015.99
Alfakih AY (2018) Euclidean distance matrices (EDMS). Euclidean Distance Matrices and Their Applications in Rigidity Theory, pp. 51–87. doi:https://doi.org/10.1007/978-3-319-97846-8_3
Adate A, Tripathy BK (2018) 3. Deep Learning techniques for Image Processing. Machine Learning for Big Data Analysis, pp. 69–90. doi:https://doi.org/10.1515/9783110551433-003
Besancon R, Rajman M, Chappelier J-C (1999) Textual similarities based on a distributional approach. Proceedings. Tenth International Workshop on Database and Expert Systems Applications. DEXA 99 [Preprint]. doi:https://doi.org/10.1109/dexa.1999.795163
Prajna KB, et al. (2022) Implementation of a hybrid recommendation system for Movies. 2022 IEEE 2nd Mysore Sub Section International Conference (MysuruCon) [Preprint]. doi:https://doi.org/10.1109/mysurucon55714.2022.9972580
Aggarwal K, et al. (2022) Has the future started? the current growth of artificial intelligence, Machine Learning, and Deep Learning. Iraqi J Comput Sci Math, pp. 115–123. doi:https://doi.org/10.52866/ijcsm.2022.01.01.013
Wenzel T, et al. (2022) Providing personalised recommendations of critical incident narratives in a cross-platform mobile application. Proceedings of the 6th International Conference on Computer-Human Interaction Research and Applications [Preprint]. doi:https://doi.org/10.5220/0011528400003323
Funding
No funding has been received for this work.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Kumar, V., Gupta, A.K., Garg, R.R. et al. The ultimate recommendation system: proposed Pranik System. Multimed Tools Appl 83, 43177–43198 (2024). https://doi.org/10.1007/s11042-023-17370-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-17370-x