Inventors
Marcelo Arenas, Gonzalo Diaz, Achille Fokoue, Anastasios Kementsietsidis, Kavitha Srinivas
Publication date
2017/7/18
Patent office
US
Patent number
9710496
Application number
14151768
Description
A schema for a dataset is identified by identifying a dataset comprising data and relationships between data pairs. An original schema is identified for the dataset. This original schema comprises an organizational structure. An initial fit between the dataset and the original schema is determined. The initial fit quantifying a conformity of the data in the dataset to the organizational structure of the original schema. A plurality of additional schemas are identified. Each additional schema is a distinct organizational schema. The dataset is partitioned into a plurality of subsets. Each subset comprises a modified fit quantifying a modified conformity of subset data in each subset to one of the original schema and the additional schemas. The modified fit is greater than the original fit.
Total citations
202020212022202311
Scholar articles
M Arenas, G Diaz, A Fokoue, A Kementsietsidis… - US Patent 9,710,496, 2017
M Arenas, G Diaz, A Fokoue, A Kementsietsidis… - US Patent 11,573,935, 2023