Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Multi-View Concept Learning for Data Representation

Published: 01 November 2015 Publication History

Abstract

Real-world datasets often involve multiple views of data items, e.g., a Web page can be described by both its content and anchor texts of hyperlinks leading to it; photos in Flickr could be characterized by visual features, as well as user contributed tags. Different views provide information complementary to each other. Synthesizing multi-view features can lead to a comprehensive description of the data items, which could benefit many data analytic applications. Unfortunately, the simple idea of concatenating different feature vectors ignores statistical properties of each view and usually incurs the “curse of dimensionality” problem. We propose Multi-view Concept Learning (MCL), a novel nonnegative latent representation learning algorithm for capturing conceptual factors from multi-view data. MCL exploits both multi-view information and label information. The key idea is to learn a common latent space across different views which (1) captures the semantic relationships between data items through graph embedding regularization on labeled items, and (2) allows each latent factor to be associated with a subset of views via sparseness constraints. In this way, MCL could capture flexible conceptual patterns hidden in multi-view features. Experiments on a toy problem and three real-world datasets show that MCL performs well and outperforms baseline methods.

References

[1]
I. Guy, N. Zwerdling, I. Ronen, D. Carmel, and E. Uziel, “Social media recommendation based on people and tags,” in Proc. 33rd Int. SIGIR Conf. Res. Develop. Inf. Retrieval, 2010, pp. 194 –201.
[2]
Y. Han, F. Wu, D. Tao, J. Shao, Y. Zhuang, and J. Jiang, “Sparse unsupervised dimensionality reduction for multiple view data,” IEEE Trans. Circuits Syst. Video Technol., vol. 22, no. 10, pp. 1485– 1496, Oct. 2012.
[3]
Y. Jiang, J. Liu, Z. Li, and H. Lu, “Semi-supervised unified latent factor learning with Multi-view data,” Mach. Vis. Appl., vol. 25, pp. 1635–1645, 2013.
[4]
S. Romberg, R. Lienhart, and E. Hörster, “ Multimodal image retrieval,” Int. J. Multimedia Inf. Retrieval, vol. 1, no. 1, pp. 31–44, 2012.
[5]
F. Korn, B.-U. Pagel, and C. Faloutsos, “On the dimensionality curse and the self-similarity blessing,” IEEE Trans. Knowl. Data Eng. , vol. 13, no. 1, pp. 96–111, Jan./Feb. 2001.
[6]
W. Wang and Z.-H. Zhou, “A new analysis of Co-training,” in Proc. 27th Int. Conf. Mach. Learn., 2010, pp. 1135–1142.
[7]
S. Bickel and T. Scheffer, “Multi-view clustering,” in Proc. 4th IEEE Int. Conf Data Mining , 2004, pp. 19–26.
[8]
D. Zhou and C. J. Burges, “Spectral clustering and transductive learning with multiple views,” in Proc. 24th Int. Conf. Mach. Learn., 2007, pp. 1159–1166.
[9]
M. Szafranski, Y. Grandvalet, and A. Rakotomamonjy, “ Composite kernel learning,” Mach. Learn., vol. 79, no. 1–2, pp. 73–103, 2010.
[10]
Z. Xu, R. Jin, H. Yang, I. King, and M. R. Lyu, “Simple and efficient multiple kernel learning by group lasso,” in Proc. 27th Int. Conf. Mach. Learn., 2010, pp. 1175–1182.
[11]
M. Chen, K. Q. Weinberger, and J. Blitzer, “ Co-training for domain adaptation,” in Proc. Adv. Neural Inf. Process. Syst., 2011, pp. 2456–2464.
[12]
Z. Xu and S. Sun, “ Multi-view transfer learning with adaboost,” in Proc. 23rd IEEE Int. Conf. Tools Artif. Intell., 2011, pp. 399–402.
[13]
H. Hotelling, “Relations between two sets of variates,” Biometrika, vol. 28, pp. 321–377, 1936.
[14]
Y. Jia, M. Salzmann, and T. Darrell, “Factorized latent spaces with structured sparsity,” in Proc. Adv. Neural Inf. Process. Syst., 2010, pp. 982–990.
[15]
B. Long, S. Y. Philip, and Z. M. Zhang, “A general model for multiple view unsupervised learning,” in Proc Int. Conf. Data Mining, 2008, pp. 822–833.
[16]
M. Kalayeh, H. Idrees, and M. Shah, “Nmf-knn: Image annotation using weighted multi-view non-negative matrix factorization,” in Proc. IEEE Conf. Comput. Vis. Pattern Recog., 2014, pp. 184–191.
[17]
J. Liu, C. Wang, J. Gao, and J. Han, “Multi-view clustering via joint nonnegative matrix factorization,” in Proc. SIAM Int. Conf. Data Mining, 2013, vol. 13, pp. 252–260.
[18]
T. Xia, D. Tao, T. Mei, and Y. Zhang, “Multiview spectral embedding,” IEEE Trans. Syst., Man Cybern., Part B: Cybern., vol. 40, no. 6, pp. 1438–1446, Dec. 2010.
[19]
N. Chen, J. Zhu, and E. P. Xing, “Predictive subspace learning for multi-view data: A large margin approach,” in Proc. Adv. Neural Inf. Process. Syst., 2010, pp. 361–369.
[20]
A. Shon, K. Grochow, A. Hertzmann, and R. P. Rao, “Learning shared latent structure for image synthesis and robotic imitation,” in Proc. Adv. Neural Inf. Process. Syst., 2005, pp. 1233–1240.
[21]
D. D. Lee and H. S. Seung, “Learning the parts of objects by Non-negative matrix factorization,” Nature, vol. 401, no. 6755, pp. 788–791, 1999.
[22]
S. Yan, D. Xu, B. Zhang, H.-J. Zhang, Q. Yang, and S. Lin, “ Graph embedding and extensions: A general framework for dimensionality reduction,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 29, no. 1, pp. 40 –51, Jan. 2007.
[23]
Y. Nesterov, “Gradient methods for minimizing composite functions, ” Math. Programm., vol. 140, no. 1, pp. 125–161, 2013.
[24]
Y.-X. Wang and Y.-J. Zhang, “Nonnegative matrix factorization: A comprehensive review,” IEEE Trans. Knowl. Data Eng., vol. 25, no. 6, pp. 1336– 1353, Jun. 2013.
[25]
Y. Wang, Y. Jia, C. Hu, and M. Turk, “Fisher Non-negative matrix factorization for learning local features,” in Proc. Asian Conf. Vis., 2004. pp. 27–30.
[26]
S. Zafeiriou, A. Tefas, I. Buciu, and I. Pitas, “Exploiting discriminant information in nonnegative matrix factorization with application to frontal face verification,” IEEE Trans. Neural Netw., vol. 17, no. 3, pp. 683–695, May 2006.
[27]
P. O. Hoyer, “Non-negative sparse coding,” in Proc. 12th IEEE Workshop Neural Netw. Signal Process., 2002, pp. 557– 565.
[28]
H. Kim and H. Park, “ Sparse non-negative matrix factorizations via alternating Non-negativity-constrained least squares for microarray data analysis,” Bioinformatics, vol. 23, no. 12, pp. 1495–1502, 2007.
[29]
N. Mohammadiha and A. Leijon, “Nonnegative matrix factorization using projected gradient algorithms with sparseness constraints,” in Proc. IEEE Int. Symp. Signal Process. Inf. Technol., 2009, pp. 418–423.
[30]
C. Xu, D. Tao, and C. Xu, “A survey on Multi-view learning,” arXiv preprint arXiv:1304.5634, 2013.
[31]
A. Quattoni, X. Carreras, M. Collins, and T. Darrell, “An efficient projection for l 1, infinity regularization,” in Proc. 26th Annu. Int. Conf. Mach. Learn., 2009, pp. 857–864.
[32]
I. Kotsia, S. Zafeiriou, and I. Pitas, “A novel discriminant Non-negative matrix factorization algorithm with applications to facial image characterization problems, ” IEEE Trans. Inf. Forensics Security, vol. 2, no. 3 –2, pp. 588–595, Sep. 2007.
[33]
C.-J. Lin, “Projected gradient methods for nonnegative matrix factorization, ” Neural Comput., vol. 19, no. 10, pp. 2756–2779, 2007.
[34]
J. Duchi, S. Shalev-Shwartz, Y. Singer, and T. Chandra, “Efficient projections onto the l 1-ball for learning in high dimensions,” in Proc. 25th Int. Conf. Mach. Learn., 2008, pp. 272–279.
[35]
F. Sha, Y. Lin, L. K. Saul, and D. D. Lee, “Multiplicative updates for nonnegative quadratic programming,” Neural Comput., vol. 19, no. 8, pp. 2004–2031, 2007.
[36]
M. Amini, N. Usunier, and C. Goutte, “Learning from multiple partially observed Views-an application to multilingual text categorization,” in Proc. Adv. Neural Inf. Process. Syst., 2009, pp. 28–36 .
[37]
H. Li, M. Wang, and X.-S. Hua, “Msra-mm 2.0: A Large-scale web multimedia dataset,” in Proc. IEEE Int. Conf. Data Mining Workshops, 2009, pp. 164–169.
[38]
J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei, “ Imagenet: A Large-scale hierarchical image database,” in Proc. IEEE Conf. Comput. Vis. Pattern Recog., 2009, pp. 248–255.
[39]
D. G. Lowe, “Distinctive image features from Scale-invariant keypoints, ” Int. J. Comput. Vis., vol. 60, no. 2, pp. 91–110, 2004.
[40]
A. Oliva and A. Torralba, “Modeling the shape of the scene: A holistic representation of the spatial envelope, ” Int. J. Comput. Vis., vol. 42, no. 3, pp. 145–175, 2001.
[41]
D. Cai, X. He, J. Han, and T. S. Huang, “Graph regularized nonnegative matrix factorization for data representation,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 33, no. 8, pp. 1548–1560, Aug. 2011.
[42]
T. G. Dietterich, “Approximate statistical tests for comparing supervised classification learning algorithms,” Neural Comput., vol. 10, no. 7, pp. 1895–1923, 1998.
[43]
J. Demšar, “Statistical comparisons of classifiers over multiple data sets, ” The J. Mach. Learn. Res., vol. 7, pp. 1– 30, 2006.
[44]
L. Lovasz and M. D. Plummer, Matching Theory. Amsterdam, The Netherlands: North Holland, 1986.
[45]
E. Alpaydm, “Combined 5$\times$ 2 cv f test for comparing supervised classification learning algorithms,” Neural Comput., vol. 11, no. 8, pp. 1885–1892, 1999.
[46]
B. Lin, X. He, C. Zhang, and M. Ji, “Parallel vector field embedding,” The J. Mach. Learn. Res., vol. 14, no. 1, pp. 2945 –2977, 2013.
[47]
M. Ji, B. Lin, X. He, D. Cai, and J. Han, “Parallel field ranking,” ACM Trans. Knowl. Discovery Data, vol. 7, no. 3, p. 15, 2013.

Cited By

View all
  • (2025)Weight consistency and cluster diversity based concept factorization for multi-view clusteringDigital Signal Processing10.1016/j.dsp.2024.104879157:COnline publication date: 1-Feb-2025
  • (2025)A review on multi-view learningFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-024-40004-w19:7Online publication date: 1-Jul-2025
  • (2024)DNSRF: Deep Network-based Semi-NMF Representation FrameworkACM Transactions on Intelligent Systems and Technology10.1145/367040815:5(1-20)Online publication date: 7-Nov-2024
  • Show More Cited By

Index Terms

  1. Multi-View Concept Learning for Data Representation
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image IEEE Transactions on Knowledge and Data Engineering
      IEEE Transactions on Knowledge and Data Engineering  Volume 27, Issue 11
      Nov. 2015
      287 pages

      Publisher

      IEEE Educational Activities Department

      United States

      Publication History

      Published: 01 November 2015

      Author Tags

      1. structured sparsity
      2. Multi-view learning
      3. nonnegative matrix factorization
      4. graph embedding

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 08 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2025)Weight consistency and cluster diversity based concept factorization for multi-view clusteringDigital Signal Processing10.1016/j.dsp.2024.104879157:COnline publication date: 1-Feb-2025
      • (2025)A review on multi-view learningFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-024-40004-w19:7Online publication date: 1-Jul-2025
      • (2024)DNSRF: Deep Network-based Semi-NMF Representation FrameworkACM Transactions on Intelligent Systems and Technology10.1145/367040815:5(1-20)Online publication date: 7-Nov-2024
      • (2024)Fuzzy-Granular Concept-Cognitive Learning via Three-Way Decision: Performance Evaluation on Dynamic Knowledge DiscoveryIEEE Transactions on Fuzzy Systems10.1109/TFUZZ.2023.332595232:3(1409-1423)Online publication date: 1-Mar-2024
      • (2024)Local residual preserving non-negative matrix factorization for multi-view clusteringNeurocomputing10.1016/j.neucom.2024.128054600:COnline publication date: 1-Oct-2024
      • (2024)Robust multi-view clustering via collaborative constraints and multi-layer concept factorizationApplied Intelligence10.1007/s10489-024-05652-254:19(9446-9463)Online publication date: 1-Oct-2024
      • (2023)Sliced Sparse Gradient Induced Multi-View Subspace Clustering via Tensorial Arctangent Rank MinimizationIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.318512635:7(7483-7496)Online publication date: 1-Jul-2023
      • (2022)Manifold biomedical text sentence embeddingNeurocomputing10.1016/j.neucom.2022.04.009492:C(117-125)Online publication date: 1-Jul-2022
      • (2022)A survey on deep learning based knowledge tracingKnowledge-Based Systems10.1016/j.knosys.2022.110036258:COnline publication date: 22-Dec-2022
      • (2022)Adaptive KNN and graph-based auto-weighted multi-view consensus spectral learningInformation Sciences: an International Journal10.1016/j.ins.2022.07.136609:C(1132-1146)Online publication date: 1-Sep-2022
      • Show More Cited By

      View Options

      View options

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media