Sandeep Tata
Sandeep Tata
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Estimating the selectivity of tf-idf based cosine similarity predicates
S Tata, JM Patel
ACM Sigmod Record 36 (2), 7-12, 2007
2232007
Using paxos to build a scalable, consistent, and highly available datastore
J Rao, EJ Shekita, S Tata
arXiv preprint arXiv:1103.2408, 2011
2072011
Column-oriented storage techniques for MapReduce
A Floratou, J Patel, E Shekita, S Tata
arXiv preprint arXiv:1105.4252, 2011
1992011
SQAK: doing more with keywords
S Tata, GM Lohman
Proceedings of the 2008 ACM SIGMOD international conference on Management of …, 2008
1532008
Practical suffix tree construction
S Tata, RA Hankins, JM Patel
VLDB 4, 36-47, 2004
1222004
Practical methods for constructing suffix trees
Y Tian, S Tata, RA Hankins, JM Patel
The VLDB Journal 14 (3), 281-299, 2005
1182005
Efficient and accurate discovery of patterns in sequence data sets
A Floratou, S Tata, JM Patel
IEEE Transactions on Knowledge and Data Engineering 23 (8), 1154-1168, 2011
772011
Clydesdale: structured data processing on MapReduce
T Kaldewey, EJ Shekita, S Tata
Proceedings of the 15th international conference on extending database …, 2012
732012
Sparkler: Supporting large-scale matrix factorization
B Li, S Tata, Y Sismanis
Proceedings of the 16th international conference on extending database …, 2013
472013
Declarative querying for biological sequences
S Tata, JS Friedman, A Swaroop
22nd International Conference on Data Engineering (ICDE'06), 87-87, 2006
472006
Differentiated secondary index maintenance in log structured NoSQL data stores
W Tan, S Tata
US Patent 9,218,383, 2015
412015
Diff-Index: Differentiated Index in Distributed Log-Structured Data Stores.
W Tan, S Tata, Y Tang, LL Fong
EDBT, 700-711, 2014
412014
BlueSNP: R package for highly scalable genome-wide association studies using Hadoop clusters
H Huang, S Tata, RJ Prill
Bioinformatics 29 (1), 135-136, 2013
402013
Scalable row-store with consensus-based replication
J Rao, EJ Shekita, S Tata
US Patent 9,047,331, 2015
392015
Quick access: building a smart experience for Google drive
S Tata, A Popescul, M Najork, M Colagrosso, J Gibbons, A Green, A Mah, ...
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge …, 2017
322017
Representation learning for information extraction from form-like documents
BP Majumder, N Potti, S Tata, JB Wendt, Q Zhao, M Najork
proceedings of the 58th annual meeting of the Association for Computational …, 2020
312020
Towards a scalable enterprise content analytics platform
K Beyer
Bulletin of the IEEE Computer Society Technical Committee on Data …, 2009
302009
Efficient join with one or more large dimension tables
RJ Barber, NK Chainani, GM Lohman, MH Pirahesh, V Raman, RS Sidle, ...
US Patent 9,141,667, 2015
262015
Clydesdale: structured data processing on Hadoop
A Balmin, T Kaldewey, S Tata
Proceedings of the 2012 ACM SIGMOD International Conference on Management of …, 2012
262012
Leveraging a scalable row store to build a distributed text index
N Li, J Rao, E Shekita, S Tata
Proceedings of the first international workshop on Cloud data management, 29-36, 2009
262009
The system can't perform the operation now. Try again later.
Articles 1–20