default search action
Raul Castro Fernandez
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j16]Yue Gong, Sainyam Galhotra, Raul Castro Fernandez:
Nexus: Correlation Discovery over Collections of Spatio-Temporal Tabular Data. Proc. ACM Manag. Data 2(3): 154 (2024) - [j15]Tapan Srivastava, Raul Castro Fernandez:
Saving Money for Analytical Workloads in the Cloud. Proc. VLDB Endow. 17(11): 3524-3537 (2024) - [c31]Kevin Dharmawan, Chirag A. Kawediya, Yue Gong, Zaki Indra Yudhistira, Zhiru Zhu, Sainyam Galhotra, Adila Alfa Krisnadhi, Raul Castro Fernandez:
Demonstration of Ver: View Discovery in the Wild. SIGMOD Conference Companion 2024: 428-431 - [c30]Yue Gong, Raul Castro Fernandez:
Demonstrating Nexus for Correlation Discovery over Collections of Spatio-Temporal Tabular Data. SIGMOD Conference Companion 2024: 524-527 - [c29]Raul Castro Fernandez, Arnab Nandi:
Responsible Sharing of Spatiotemporal Data. SIGMOD Conference Companion 2024: 580-584 - [i22]Tapan Srivastava, Raul Castro Fernandez:
Saving Money for Analytical Workloads in the Cloud. CoRR abs/2408.00253 (2024) - [i21]Zhiru Zhu, Raul Castro Fernandez:
Controlling Dataflows with a Bolt-on Data Escrow. CoRR abs/2408.01580 (2024) - [i20]Siyuan Xia, Chris Zhu, Tapan Srivastava, Bridget Fahey, Raul Castro Fernandez:
Programmable Dataflows: Abstraction and Programming Model for Data Sharing. CoRR abs/2408.04092 (2024) - [i19]Qiming Wang, Raul Castro Fernandez:
FabricQA-Extractor: A Question Answering System to Extract Information from Documents using Natural Language Questions. CoRR abs/2408.09226 (2024) - 2023
- [j14]Raul Castro Fernandez:
Data-Sharing Markets: Model, Protocol, and Algorithms to Incentivize the Formation of Data-Sharing Consortia. Proc. ACM Manag. Data 1(2): 172:1-172:25 (2023) - [j13]Matthew Perron, Raul Castro Fernandez, David J. DeWitt, Michael J. Cafarella, Samuel Madden:
Cackle: Analytical Workload Cost and Performance Stability With Elastic Pools. Proc. ACM Manag. Data 1(4): 233:1-233:25 (2023) - [j12]Qiming Wang, Raul Castro Fernandez:
Solo: Data Discovery Using Natural Language Questions Via A Self-Supervised Approach. Proc. ACM Manag. Data 1(4): 262:1-262:27 (2023) - [j11]Zezhou Huang, Jiaxiang Liu, Daniel Alabi, Raul Castro Fernandez, Eugene Wu:
Saibot: A Differentially Private Data Search Platform. Proc. VLDB Endow. 16(11): 3057-3070 (2023) - [j10]Raul Castro Fernandez, Aaron J. Elmore, Michael J. Franklin, Sanjay Krishnan, Chenhao Tan:
How Large Language Models Will Disrupt Data Management. Proc. VLDB Endow. 16(11): 3302-3309 (2023) - [j9]Jian Pei, Raul Castro Fernandez, Xiaohui Yu:
Data and AI Model Markets: Opportunities for Data and Model Sharing, Discovery, and Integration. Proc. VLDB Endow. 16(12): 3872-3873 (2023) - [c28]Yue Gong, Zhiru Zhu, Sainyam Galhotra, Raul Castro Fernandez:
Ver: View Discovery in the Wild. ICDE 2023: 503-516 - [c27]Sainyam Galhotra, Yue Gong, Raul Castro Fernandez:
Metam: Goal-Oriented Data Discovery. ICDE 2023: 2780-2793 - [c26]Boxin Zhao, Boxiang Lyu, Raul Castro Fernandez, Mladen Kolar:
Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm. ICML 2023: 42081-42097 - [i18]Qiming Wang, Raul Castro Fernandez:
Data Discovery using Natural Language Questions via a Self-Supervised Approach. CoRR abs/2301.03560 (2023) - [i17]Sainyam Galhotra, Yue Gong, Raul Castro Fernandez:
METAM: Goal-Oriented Data Discovery. CoRR abs/2304.09068 (2023) - [i16]Siyuan Xia, Zhiru Zhu, Chris Zhu, Jinjin Zhao, Kyle Chard, Aaron J. Elmore, Ian T. Foster, Michael J. Franklin, Sanjay Krishnan, Raul Castro Fernandez:
Data Station: Delegated, Trustworthy, and Auditable Computation to Enable Data-Sharing Consortia with a Data Escrow. CoRR abs/2305.03842 (2023) - [i15]Zezhou Huang, Pranav Subramaniam, Raul Castro Fernandez, Eugene Wu:
Kitana: Efficient Data Augmentation Search for AutoML. CoRR abs/2305.10419 (2023) - [i14]Boxin Zhao, Boxiang Lyu, Raul Castro Fernandez, Mladen Kolar:
Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm. CoRR abs/2306.02543 (2023) - [i13]Zezhou Huang, Jiaxiang Liu, Daniel Alabi, Raul Castro Fernandez, Eugene Wu:
Saibot: A Differentially Private Data Search Platform. CoRR abs/2307.00432 (2023) - [i12]Zhiru Zhu, Raul Castro Fernandez:
Making Differential Privacy Easier to Use for Data Controllers and Data Analysts using a Privacy Risk Indicator and an Escrow-Based Platform. CoRR abs/2310.13104 (2023) - [i11]Minbiao Han, Jonathan Light, Steven Xia, Sainyam Galhotra, Raul Castro Fernandez, Haifeng Xu:
A Data-Centric Online Market for Machine Learning: From Discovery to Pricing. CoRR abs/2310.17843 (2023) - 2022
- [j8]Chaitanya K. Baru, Michael Pozmantier, Ilkay Altintas, Stephen Baek, Jonathan Cohen, Laura E. Condon, Giulia Fanti, Raul Castro Fernandez, Ethan Jackson, Upmanu Lall, Bennett A. Landman, Hai Li, Claudia Marin, Beatriz Martínez-López, Dimitris N. Metaxas, Bradley D. Olsen, Grier P. Page, Yelda Turkan, Jingbo Zhang, Peng Zhang:
Enabling AI Innovation via Data and Model Sharing: An Overview of the Nsf Convergence Accelerator Track D. AI Mag. 43(1): 93-104 (2022) - [j7]Siyuan Xia, Zhiru Zhu, Chris Zhu, Jinjin Zhao, Kyle Chard, Aaron J. Elmore, Ian T. Foster, Michael J. Franklin, Sanjay Krishnan, Raul Castro Fernandez:
Data Station: Delegated, Trustworthy, and Auditable Computation to Enable Data-Sharing Consortia with a Data Escrow. Proc. VLDB Endow. 15(11): 3172-3185 (2022) - [j6]Javen Kennedy, Pranav Subramaniam, Sainyam Galhotra, Raul Castro Fernandez:
Revisiting Online Data Markets in 2022: A Seller and Buyer Perspective. SIGMOD Rec. 51(3): 30-37 (2022) - [c25]Zixuan Zhao, Raul Castro Fernandez:
Leva: Boosting Machine Learning Performance with Relational Embedding Data Augmentation. SIGMOD Conference 2022: 1504-1517 - [c24]Raul Castro Fernandez:
Protecting Data Markets from Strategic Buyers. SIGMOD Conference 2022: 1755-1769 - 2021
- [i10]Pranav Subramaniam, Yintong Ma, Chi Li, Ipsita Mohanty, Raul Castro Fernandez:
Comprehensive and Comprehensible Data Catalogs: The What, Who, Where, When, Why, and How of Metadata Management. CoRR abs/2103.07532 (2021) - [i9]Yue Gong, Zhiru Zhu, Sainyam Galhotra, Raul Castro Fernandez:
Niffler: A Reference Architecture and System Implementation for View Discovery over Pathless Table Collections by Example. CoRR abs/2106.01543 (2021) - 2020
- [j5]Nadiia Chepurko, Ryan Marcus, Emanuel Zgraggen, Raul Castro Fernandez, Tim Kraska, David R. Karger:
ARDA: Automatic Relational Data Augmentation for Machine Learning. Proc. VLDB Endow. 13(9): 1373-1387 (2020) - [j4]Raul Castro Fernandez, Pranav Subramaniam, Michael J. Franklin:
Data Market Platforms: Trading Data Assets to Solve Data Problems. Proc. VLDB Endow. 13(11): 1933-1947 (2020) - [c23]Raul Castro Fernandez:
A System for Studying Deep Network Training. CIDR 2020 - [c22]Matthew Perron, Raul Castro Fernandez, David J. DeWitt, Samuel Madden:
Starling: A Scalable Query Engine on Cloud Functions. SIGMOD Conference 2020: 131-141 - [i8]Raul Castro Fernandez, Pranav Subramaniam, Michael J. Franklin:
Data Market Platforms: Trading Data Assets to Solve Data Problems [Vision Paper]. CoRR abs/2002.01047 (2020) - [i7]Nadiia Chepurko, Ryan Marcus, Emanuel Zgraggen, Raul Castro Fernandez, Tim Kraska, David R. Karger:
ARDA: Automatic Relational Data Augmentation for Machine Learning. CoRR abs/2003.09758 (2020) - [i6]Raul Castro Fernandez, Kyle Chard, Ben Blaiszik, Sanjay Krishnan, Aaron J. Elmore, Ziad Obermeyer, Josh Risley, Sendhil Mullainathan, Michael J. Franklin, Ian T. Foster:
The Data Station: Combining Data, Compute, and Market Forces. CoRR abs/2009.00035 (2020)
2010 – 2019
- 2019
- [c21]Raul Castro Fernandez, Jisoo Min, Demitri Nava, Samuel Madden:
Lazo: A Cardinality-Based Method for Coupled Estimation of Jaccard Similarity and Containment. ICDE 2019: 1190-1201 - [c20]Raul Castro Fernandez, Samuel Madden:
Termite: a system for tunneling through heterogeneous data. aiDM@SIGMOD 2019: 7:1-7:8 - [c19]Mohammad Mahdavi, Ziawasch Abedjan, Raul Castro Fernandez, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Raha: A Configuration-Free Error Detection System. SIGMOD Conference 2019: 865-882 - [p2]Mourad Ouzzani, Nan Tang, Raul Castro Fernandez:
Data civilizer: end-to-end support for data discovery, integration, and cleaning. Making Databases Work 2019: 291-300 - [p1]Raul Castro Fernandez:
Aurum: a story about research taste. Making Databases Work 2019: 387-391 - [i5]Raul Castro Fernandez, Samuel Madden:
Termite: A System for Tunneling Through Heterogeneous Data. CoRR abs/1903.05008 (2019) - [i4]Matthew Perron, Raul Castro Fernandez, David J. DeWitt, Samuel Madden:
Starling: A Scalable Query Engine on Cloud Function Services. CoRR abs/1911.11727 (2019) - [i3]Raul Castro Fernandez, Nan Tang, Mourad Ouzzani, Michael Stonebraker, Samuel Madden:
Dataset-On-Demand: Automatic View Search and Presentation for Data Discovery. CoRR abs/1911.11876 (2019) - 2018
- [c18]Andrew Ilyas, Joana M. F. da Trindade, Raul Castro Fernandez, Samuel Madden:
Extracting Syntactical Patterns from Databases. ICDE 2018: 41-52 - [c17]Raul Castro Fernandez, Essam Mansour, Abdulhakim Ali Qahtan, Ahmed K. Elmagarmid, Ihab F. Ilyas, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Seeping Semantics: Linking Datasets Using Word Embeddings for Data Discovery. ICDE 2018: 989-1000 - [c16]Raul Castro Fernandez, Ziawasch Abedjan, Famien Koko, Gina Yuan, Samuel Madden, Michael Stonebraker:
Aurum: A Data Discovery System. ICDE 2018: 1001-1012 - [c15]Essam Mansour, Dong Deng, Raul Castro Fernandez, Abdulhakim Ali Qahtan, Wenbo Tao, Ziawasch Abedjan, Ahmed K. Elmagarmid, Ihab F. Ilyas, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Building Data Civilizer Pipelines with an Advanced Workflow Engine. ICDE 2018: 1593-1596 - [c14]Abdulhakim Ali Qahtan, Ahmed K. Elmagarmid, Raul Castro Fernandez, Mourad Ouzzani, Nan Tang:
FAHES: A Robust Disguised Missing Values Detector. KDD 2018: 2100-2109 - [c13]Raul Castro Fernandez, William Culhane, Pijika Watcharapichat, Matthias Weidlich, Victoria Lopez Morales, Peter R. Pietzuch:
Meta-Dataflows: Efficient Exploratory Dataflow Jobs. SIGMOD Conference 2018: 1157-1172 - [i2]Guillaume Leclerc, Manasi Vartak, Raul Castro Fernandez, Tim Kraska, Samuel Madden:
Smallify: Learning Network Size while Training. CoRR abs/1806.03723 (2018) - 2017
- [j3]Michael Stonebraker, Raul Castro Fernandez, Dong Deng, Michael L. Brodie:
What to do about database decay. Commun. ACM 60(1): 11 (2017) - [c12]Dong Deng, Raul Castro Fernandez, Ziawasch Abedjan, Sibo Wang, Michael Stonebraker, Ahmed K. Elmagarmid, Ihab F. Ilyas, Samuel Madden, Mourad Ouzzani, Nan Tang:
The Data Civilizer System. CIDR 2017 - [c11]Raul Castro Fernandez, Dong Deng, Essam Mansour, Abdulhakim Ali Qahtan, Wenbo Tao, Ziawasch Abedjan, Ahmed K. Elmagarmid, Ihab F. Ilyas, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
A Demo of the Data Civilizer System. SIGMOD Conference 2017: 1639-1642 - [i1]Andrew Ilyas, Joana M. F. da Trindade, Raul Castro Fernandez, Samuel Madden:
Extracting Syntactic Patterns from Databases. CoRR abs/1710.11528 (2017) - 2016
- [b1]Raul Castro Fernandez:
Stateful data-parallel processing. Imperial College London, UK, 2016 - [j2]Ziawasch Abedjan, Xu Chu, Dong Deng, Raul Castro Fernandez, Ihab F. Ilyas, Mourad Ouzzani, Paolo Papotti, Michael Stonebraker, Nan Tang:
Detecting Data Errors: Where are we and what needs to be done? Proc. VLDB Endow. 9(12): 993-1004 (2016) - [j1]Badrish Chandramouli, Raul Castro Fernandez, Jonathan Goldstein, Ahmed Eldawy, Abdul Quamar:
Quill: Efficient, Transferable, and Rich Analytics at Scale. Proc. VLDB Endow. 9(14): 1623-1634 (2016) - [c10]Pijika Watcharapichat, Victoria Lopez Morales, Raul Castro Fernandez, Peter R. Pietzuch:
Ako: Decentralised Deep Learning with Partial Gradient Exchange. SoCC 2016: 84-97 - [c9]Alexandros Koliousis, Matthias Weidlich, Raul Castro Fernandez, Alexander L. Wolf, Paolo Costa, Peter R. Pietzuch:
The SABER system for window-based hybrid stream processing with GPGPUs: demo. DEBS 2016: 354-357 - [c8]Raul Castro Fernandez, Panagiotis Garefalakis, Peter R. Pietzuch:
Java2SDG: Stateful big data processing for the masses. ICDE 2016: 1390-1393 - [c7]Raul Castro Fernandez, Ziawasch Abedjan, Samuel Madden, Michael Stonebraker:
Towards large-scale data discovery: position paper. ExploreDB@SIGMOD/PODS 2016: 3-5 - [c6]Alexandros Koliousis, Matthias Weidlich, Raul Castro Fernandez, Alexander L. Wolf, Paolo Costa, Peter R. Pietzuch:
SABER: Window-Based Hybrid Stream Processing for Heterogeneous Architectures. SIGMOD Conference 2016: 555-569 - 2015
- [c5]Raul Castro Fernandez, Peter R. Pietzuch, Jay Kreps, Neha Narkhede, Jun Rao, Joel Koshy, Dong Lin, Chris Riccomini, Guozhang Wang:
Liquid: Unifying Nearline and Offline Big Data Integration. CIDR 2015 - 2014
- [c4]Raul Castro Fernandez, Matthias Weidlich, Peter R. Pietzuch, Avigdor Gal:
Scalable stateful stream processing for smart grids. DEBS 2014: 276-281 - [c3]Raul Castro Fernandez, Matteo Migliavacca, Evangelia Kalyvianaki, Peter R. Pietzuch:
Making State Explicit for Imperative Big Data Processing. USENIX ATC 2014: 49-60 - 2013
- [c2]Raul Castro Fernandez, Matteo Migliavacca, Evangelia Kalyvianaki, Peter R. Pietzuch:
Scalable and Fault-tolerant Stateful Stream Processing. ICCSW 2013: 11-18 - [c1]Raul Castro Fernandez, Matteo Migliavacca, Evangelia Kalyvianaki, Peter R. Pietzuch:
Integrating scale out and fault tolerance in stream processing using operator state management. SIGMOD Conference 2013: 725-736
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-22 18:58 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint