Washington DC-Baltimore Area
I consider myself lucky to be in a profession I am passionate about, and constantly keep looking for ways to learn. A few things about me... • Problem solver, with a strong grasp of backend technologies and distributed systems concepts • Experienced in handling data at scale and building processing pipelines in Apache Solr to make it scale for analytics and search operations • Possess deep understanding of text based search and have significant experience working with the most popular open-source search engines (Apache Solr and Elasticsearch) and Apache Lucene search library (which forms the foundation of many search engines). • Ability to quickly grasp new frameworks/platforms and understand the same in sufficient depth to know their strengths and limitations. Programming Languages: Java, Python Frameworks/Libraries/Tools: Apache Solr, Elasticsearch, Apache Spark Databases: MySQL, Oracle, MS-SQL Server Other skills: Linux shell-scripting, AWS, SQL
Commvault is a recognized global leader in enterprise backup, recovery, and data management. I work on the search infrastructure team at Commvault to implement and enhance features used by thousands of enterprise customers. Here's what I do/have done so far: • Enhance Commvault's distributed indexing infrastructure for backed up data in standalone Solr and SolrCloud and make it robust to scale across multi-terabyte indexes. • Leverage and extend(through custom plugins) Apache Solr to build scalable data pipelines and rich search features (morphological search, advanced faceting based on keywords, etc) • Improved the relevance of search results for the technical documentation website by 90% (by nDCG metric)
Active contributor to Apache Solr project. Contributions include but not limited to code patches and reviews.
(As of Nov 1, 2017, Search Technologies was acquired by Accenture and became part of Accenture Analytics) • Second place in the annual coding competition. • Trained a class of 30+ engineers, architects and project managers on building a recommendation type-ahead service using Elasticsearch and Spark. • Built a recommender system to make relevant document suggestions based on past user search and view patterns using Apache Spark (with Scala for processing) and Elasticsearch (as the search engine). • Implemented the query suggestion(auto-complete) feature for searches, aiding the user to pick from the most relevant historic search queries. The implementation was done using Elasticsearch, Logstash and Filebeat for indexing data, and Python scripts for data ingestion. • Implemented end-to-end application flow to capture user query though UI, query Solr search engine using SolrJ and render results and facets in a J2EE application. • Implemented mechanism for entity extraction and automatically tagging the documents indexed in Solr using Java, Python and Groovy, resulting into more relevant search results. • Developed the audit logging system for security monitoring using Elasticsearch, Logstash, Java, Python and PHP to alert the user about any suspicious account activity.
• Mentored and managed a team of two to develop a business rules evaluation framework developed in Rhino JavaScript and Drools, and an auto deployment framework using Linux shell scripts. • Designed a configurable framework using JSON configurations to enable on-the-fly transfer of data elements from an Excel based business document into Entity Service (modelled in the form of relational database tables) .This helped in eliminating the need to populate the data(used to generate reports) in these tables through SQL scripts.
• Designed and developed a configurable rule evaluation framework to validate customer-specific business rules and evaluate the corresponding business metrics using JSON and Rhino JavaScript (used for server-side applications). This helped in eliminating the complex Excel formulae in the UI as evaluation of rules was moved to the server side. • Developed the workflow module responsible for tracking tasks for each actor during the lifecycle of a pricing document. • Created an auto deployment toolkit using Linux shell scripts for deployment of server side solution artifacts. • Developed a Linux shell script toolkit containing commands to start, stop, configure and maintain different services in the product.