Data guy. Deep into engineering, analytics and sciences. PhD in computer science and engineering from IIT Madras. Spent nearly 20 years in different aspects of data from storage to processing to analytics to sciences. Heading up data sciences team for SapientRazorfish. Also looking at modernizing engineering team. Heading all next generation work in the data space. Exploring deficiencies of deep learning and consequently dabble with capsule networks and other next gen AI algorithms.
Abstract: The power and flexibility of a distributed object system increases many fold if it allo... more Abstract: The power and flexibility of a distributed object system increases many fold if it allows objects to migrate across machines. CORBA, which is emerging as a standard for distributed object computing, currently lacks a proper mechanism for object migration. In ...
Master alternative Big Data technologies that can do what Hadoop can't: real-time analytics a... more Master alternative Big Data technologies that can do what Hadoop can't: real-time analytics and iterative machine learning. When most technical professionals think of Big Data analytics today, they think of Hadoop. But there are many cutting-edge applications that Hadoop isn't well suited for, especially real-time analytics and contexts requiring the use of iterative machine learning algorithms. Fortunately, several powerful new technologies have been developed specifically for use cases such as these. Big Data Analytics Beyond Hadoop is the first guide specifically designed to help you take the next steps beyond Hadoop. Dr. Vijay Srinivas Agneeswaran introduces the breakthrough Berkeley Data Analysis Stack (BDAS) in detail, including its motivation, design, architecture, Mesos cluster management, performance, and more. He presents realistic use cases and up-to-date example code for: Spark, the next generation in-memory computing technology from UC Berkeley Storm, the parall...
Reference LSIR-CONF-2007-039 URL: http://www.collaboratecom.org/2006/program.php Record created o... more Reference LSIR-CONF-2007-039 URL: http://www.collaboratecom.org/2006/program.php Record created on 2007-11-09, modified on 2017-05-12
Complex systems such as those in evolution, growth and depinning models do not evolve slowly and ... more Complex systems such as those in evolution, growth and depinning models do not evolve slowly and gradually, but exhibit avalanche dynamics or punctuated equilibria. Self-Organized Criticality (SOC) and Highly Optimized Tolerance (HOT) are two theoretical models that explain such avalanche dynamics. We have studied avalanche dynamics in two vastly different grid computing systems: Optimal Grid and Vishva. Failures in optimal grid cause an avalanche effect with respect to the overall computation. Vishva does not exhibit failure avalanches. Interestingly, Vishva exhibits load avalanche effects at critical load density, wherein a small load disturbance in one node can cause load disturbances in several other nodes. The avalanche dynamics of grid computing systems implies that grids can be viewed as SOC systems or as HOT systems. An SOC perspective suggests that grids may be sub-optimal in performance, but may be robust to unanticipated uncertainties. A HOT perspective suggests that grid...
This paper makes a step in identifying the state of the art in semantic P2P systems. On one hand,... more This paper makes a step in identifying the state of the art in semantic P2P systems. On one hand, lot of research in the P2P systems community has focused on fault-tolerance and scalability, resulting in numberous algorithms, systems such as Chord, Pastry and P-Grid. These systems, however, have no notion of semantics and consequently, have difficulty in knowledge sharing. On the other hand, research in the semantic web community have focused on knowledge sharing among different nodes with possibly different schemas. These have tended to use centralized repositories. The obvious benefits of combining P2P and semantic systems would be to have large scale collection of structured data. Several recent efforts have focused on this combination. However, there have been no attempt to have these efforts grouped in one place for easy assimilation and for finding interesting future directions; this paper fills the gap.
Note: Short paper Reference LSIR-CONF-2008-042 URL: http://www.aknowledge.org/ Record created on ... more Note: Short paper Reference LSIR-CONF-2008-042 URL: http://www.aknowledge.org/ Record created on 2008-02-01, modified on 2017-05-12
It is time for the healthcare industry to move from the era of &a... more It is time for the healthcare industry to move from the era of "analyzing our health history" to the age of "managing the future of our health." In this article, we illustrate the importance of real-time analytics across the healthcare industry by providing a generic mechanism to reengineer traditional analytics expressed in the R programming language into Storm-based real-time analytics code. This is a powerful abstraction, since most data scientists use R to write the analytics and are not clear on how to make the data work in real-time and on high-velocity data. Our paper focuses on the applications necessary to a healthcare analytics scenario, specifically focusing on the importance of electrocardiogram (ECG) monitoring. A physician can use our framework to compare ECG reports by categorization and consequently detect Arrhythmia. The framework can read the ECG signals and uses a machine learning-based categorizer that runs within a Storm environment to compare different ECG signals. The paper also presents some performance studies of the framework to illustrate the throughput and accuracy trade-off in real-time analytics.
Sriram Aananthakrishnan Sherif Abdelwahed Vijay Srinivas Agneeswaran Muhammad Aleem Nawab Ali Ath... more Sriram Aananthakrishnan Sherif Abdelwahed Vijay Srinivas Agneeswaran Muhammad Aleem Nawab Ali Athanasios Antoniou Anne Auger Jim Basney Leonardo Bautista Josep Ll. Berral Laurent Bobelin Ryan Braithwaite John Bresnahan Rodrigo N. Calheiros Miguel Camelo Ghislain Charrier Qian Chen Wei-Fan Chiang Nitin Chiluka Pierre-Nicolas Clauss Xabriel J. Collazo-Mojica Minh Quan Dang Thomas De Ruiter Simon Delamare Javier Delgado Benjamin Depardon Marcos Dias De Assuncao James Dinan Mohammed El Mehdi Diouri Bruno Donassolo Abhishek ...
CiteSeerX - Document Details (Isaac Councill, Lee Giles): www.elsevier.com/locate/jpdc Existing m... more CiteSeerX - Document Details (Isaac Councill, Lee Giles): www.elsevier.com/locate/jpdc Existing models for parallel programming over Common Object Request Broker Architecture (CORBA) do not address issues specific to parallel programming over a Network of Workstations ...
Abstract: The power and flexibility of a distributed object system increases many fold if it allo... more Abstract: The power and flexibility of a distributed object system increases many fold if it allows objects to migrate across machines. CORBA, which is emerging as a standard for distributed object computing, currently lacks a proper mechanism for object migration. In ...
Master alternative Big Data technologies that can do what Hadoop can't: real-time analytics a... more Master alternative Big Data technologies that can do what Hadoop can't: real-time analytics and iterative machine learning. When most technical professionals think of Big Data analytics today, they think of Hadoop. But there are many cutting-edge applications that Hadoop isn't well suited for, especially real-time analytics and contexts requiring the use of iterative machine learning algorithms. Fortunately, several powerful new technologies have been developed specifically for use cases such as these. Big Data Analytics Beyond Hadoop is the first guide specifically designed to help you take the next steps beyond Hadoop. Dr. Vijay Srinivas Agneeswaran introduces the breakthrough Berkeley Data Analysis Stack (BDAS) in detail, including its motivation, design, architecture, Mesos cluster management, performance, and more. He presents realistic use cases and up-to-date example code for: Spark, the next generation in-memory computing technology from UC Berkeley Storm, the parall...
Reference LSIR-CONF-2007-039 URL: http://www.collaboratecom.org/2006/program.php Record created o... more Reference LSIR-CONF-2007-039 URL: http://www.collaboratecom.org/2006/program.php Record created on 2007-11-09, modified on 2017-05-12
Complex systems such as those in evolution, growth and depinning models do not evolve slowly and ... more Complex systems such as those in evolution, growth and depinning models do not evolve slowly and gradually, but exhibit avalanche dynamics or punctuated equilibria. Self-Organized Criticality (SOC) and Highly Optimized Tolerance (HOT) are two theoretical models that explain such avalanche dynamics. We have studied avalanche dynamics in two vastly different grid computing systems: Optimal Grid and Vishva. Failures in optimal grid cause an avalanche effect with respect to the overall computation. Vishva does not exhibit failure avalanches. Interestingly, Vishva exhibits load avalanche effects at critical load density, wherein a small load disturbance in one node can cause load disturbances in several other nodes. The avalanche dynamics of grid computing systems implies that grids can be viewed as SOC systems or as HOT systems. An SOC perspective suggests that grids may be sub-optimal in performance, but may be robust to unanticipated uncertainties. A HOT perspective suggests that grid...
This paper makes a step in identifying the state of the art in semantic P2P systems. On one hand,... more This paper makes a step in identifying the state of the art in semantic P2P systems. On one hand, lot of research in the P2P systems community has focused on fault-tolerance and scalability, resulting in numberous algorithms, systems such as Chord, Pastry and P-Grid. These systems, however, have no notion of semantics and consequently, have difficulty in knowledge sharing. On the other hand, research in the semantic web community have focused on knowledge sharing among different nodes with possibly different schemas. These have tended to use centralized repositories. The obvious benefits of combining P2P and semantic systems would be to have large scale collection of structured data. Several recent efforts have focused on this combination. However, there have been no attempt to have these efforts grouped in one place for easy assimilation and for finding interesting future directions; this paper fills the gap.
Note: Short paper Reference LSIR-CONF-2008-042 URL: http://www.aknowledge.org/ Record created on ... more Note: Short paper Reference LSIR-CONF-2008-042 URL: http://www.aknowledge.org/ Record created on 2008-02-01, modified on 2017-05-12
It is time for the healthcare industry to move from the era of &a... more It is time for the healthcare industry to move from the era of "analyzing our health history" to the age of "managing the future of our health." In this article, we illustrate the importance of real-time analytics across the healthcare industry by providing a generic mechanism to reengineer traditional analytics expressed in the R programming language into Storm-based real-time analytics code. This is a powerful abstraction, since most data scientists use R to write the analytics and are not clear on how to make the data work in real-time and on high-velocity data. Our paper focuses on the applications necessary to a healthcare analytics scenario, specifically focusing on the importance of electrocardiogram (ECG) monitoring. A physician can use our framework to compare ECG reports by categorization and consequently detect Arrhythmia. The framework can read the ECG signals and uses a machine learning-based categorizer that runs within a Storm environment to compare different ECG signals. The paper also presents some performance studies of the framework to illustrate the throughput and accuracy trade-off in real-time analytics.
Sriram Aananthakrishnan Sherif Abdelwahed Vijay Srinivas Agneeswaran Muhammad Aleem Nawab Ali Ath... more Sriram Aananthakrishnan Sherif Abdelwahed Vijay Srinivas Agneeswaran Muhammad Aleem Nawab Ali Athanasios Antoniou Anne Auger Jim Basney Leonardo Bautista Josep Ll. Berral Laurent Bobelin Ryan Braithwaite John Bresnahan Rodrigo N. Calheiros Miguel Camelo Ghislain Charrier Qian Chen Wei-Fan Chiang Nitin Chiluka Pierre-Nicolas Clauss Xabriel J. Collazo-Mojica Minh Quan Dang Thomas De Ruiter Simon Delamare Javier Delgado Benjamin Depardon Marcos Dias De Assuncao James Dinan Mohammed El Mehdi Diouri Bruno Donassolo Abhishek ...
CiteSeerX - Document Details (Isaac Councill, Lee Giles): www.elsevier.com/locate/jpdc Existing m... more CiteSeerX - Document Details (Isaac Councill, Lee Giles): www.elsevier.com/locate/jpdc Existing models for parallel programming over Common Object Request Broker Architecture (CORBA) do not address issues specific to parallel programming over a Network of Workstations ...
Uploads
Papers by Dr, Vijay Srinivas Agneeswaran