• JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
  • JoomlaWorks Simple Image Rotator
 
  Bookmark and Share
 
 
Doctoral Thesis
DOI
https://doi.org/10.11606/T.55.2007.tde-07052007-155746
Document
Author
Full name
Gisele Busichia Baioco
Institute/School/College
Knowledge Area
Date of Defense
Published
São Carlos, 2007
Supervisor
Committee
Traina, Agma Juci Machado (President)
Carvalho, André Carlos Ponce de Leon Ferreira de
Fortes, Renata Pontin de Mattos
Souza, Fernando da Fonseca de
Souza, Jano Moreira de
Title in Portuguese
Modelo de custo para consultas por similaridade em espaços métricos
Keywords in Portuguese
Consultas por similaridade
Estimativa de seletividade
Modelo de custo
Abstract in Portuguese
Esta tese apresenta um modelo de custo para estimar o número de acessos a disco (custo de I/O) e o número de cálculos de distância (custo de CPU) para consultas por similaridade executadas sobre métodos de acesso métricos dinâmicos. O objetivo da criação do modelo é a otimização de consultas por similaridade em Sistemas de Gerenciamento de Bases de Dados relacionais e objeto-relacionais. Foram considerados dois tipos de consultas por similaridade: consulta por abrangência e consulta aos k-vizinhos mais próximos. Como base para a criação do modelo de custo foi utilizado o método de acesso métrico dinâmico Slim-Tree. O modelo estima a dimensão intrínseca do conjunto de dados pela sua dimensão de correlação fractal. A validação do modelo é confirmada por experimentos com conjuntos de dados sintéticos e reais, de variados tamanhos e dimensões, que mostram que as estimativas obtidas em geral estão dentro da faixa de variação medida em consultas reais
Title in English
Cost model for similarity queries in metric spaces
Keywords in English
Cost model
Selectivity estimation
Similarity queries
Abstract in English
This thesis presents a cost model to estimate the number of disk accesses (I/O costs) and the number of distance calculations (CPU costs) to process similarity queries over data indexed by dynamic metric access methods. The goal of the model is to optimize similarity queries on relational and object-relational Database Management Systems. Two types of similarity queries were taken into consideration: range queries and k-nearest neighbor queries. The dynamic metric access method Slim-Tree was used as the basis for the creation of the cost model. The model takes advantage of the intrinsic dimension of the data set, estimated by its correlation fractal dimension. Experiments were performed on real and synthetic data sets, with different sizes and dimensions, in order to validate the proposed model. They confirmed that the estimations are accurate, being always within the range achieved executing real queries
 
WARNING - Viewing this document is conditioned on your acceptance of the following terms of use:
This document is only for private use for research and teaching activities. Reproduction for commercial use is forbidden. This rights cover the whole data about this document as well as its contents. Any uses or copies of this document in whole or in part must include the author's name.
tese_final.pdf (1.17 Mbytes)
means that the file can only be accessed inside the Unversity of São Paulo.
Publishing Date
2007-05-08
 
WARNING: The material described below relates to works resulting from this thesis or dissertation. The contents of these works are the author's responsibility.
  • BAIOCO, G. B., TRAINA, A. J. M., and TRAINA JR, Caetano. An Effective Cost Model for Similarity Queries in Metric Spaces. In 22nd ACM Symposium on Applied Computing (SAC), Seul, 2007. Proceedings of the 22nd ACM SAC 2007.New York : ACM Press, 2007. Abstract.
  • BAIOCO, G. B., TRAINA, A. J. M., and TRAINA JR, Caetano. MAMCost: Global and Local Estimates leading to Robust Cost Estimation of Similarity Queries [doi:10.1109/ssdbm.2007.17]. In 19th International Conference on Scientific and Statistical Database Management (SSDBM 2007), Banff, 2007. Proceedings of the SSDBM 2007.Los Alamitos : IEEE Computer Society, 2007.
All rights of the thesis/dissertation are from the authors
CeTI-SC/STI
Digital Library of Theses and Dissertations of USP. Copyright © 2001-2024. All rights reserved.