- Expertise in managing distributed storage for large datasets across multiple nodes in Solr hosted on AWS.
- Understanding of how data is ingested into Solr and the storage requirements for optimal performance and availability.
- Building and maintaining Solr search indexes, ensuring they are optimized for performance and scalability.
- Handling Solr operations such as managing replicas, handling failure scenarios, and restoring replicas when they go down.
- Managing ElasticSearch as part of the search infrastructure when necessary.
- Designing, building, and managing Solr clusters on AWS.
- Handling cluster operations such as bringing clusters back online when they go down and monitoring their health to ensure uptime and reliability.
- Automating the deployment and management of Solr infrastructure using Python.
- Ensuring smooth operation, scaling, and maintenance through automation tools and techniques.
Technical Expertise Required:
Core Skills - Apache Solr (50%)
- Strong knowledge and hands-on experience with Apache Solr, especially in distributed environments.
- Cluster management, replica handling, troubleshooting, and optimizing search performance.
- Knowledge of Solr operations, including dataset ingestion, failure handling, and restoration of replicas.
AWS, Python, Kubernetes (50%)
- Experience with AWS services for hosting and managing Solr infrastructure.
- Proficiency in Python scripting to automate Solr deployments, scaling, and monitoring.
- Hands-on experience with Kubernetes for containerizing and orchestrating Solr deployments.
Weightage:
- Apache Solr (50%): Focused on search index management, cluster operations, and Solr performance.
- AWS + Python + Kubernetes (50%): Cloud infrastructure, automation, and container management.