KEY RESPONSIBILITIES:
- Deploy and operate shared multi-tenant infrastructure for a SaaS microservice-based containerized systems in addition to customer-centric application environments
- Administer and automate the team’s growing presence in AWS
- Operational support for Guidewire’s flagship highly integrated Java-based Insurance Suite web applications
- Implement and operate a complex single sign-on SAML/OAuth-based central authentication platform
- Develop and deploy tooling to aid in driving 24x7x365 service operations of critical world-wide systems
- Automate deployment tasks for core product and infrastructure tools and maintain automation infrastructure
- Build system documentation and training materials to empower and educate our own and other CCO teams
REQUIREMENTS
- Bachelor’s Degree in Computer Science or related field
- Familiarity with the Agile software development lifecycle
- Background with Linux systems administration and strong scripting skills in Bash, Python, Go, etc.
- Experience supporting web applications running on Java / Apache / Tomcat in a live production environment
- Proven experience with automating systems and infrastructure with Terraform
- Production-At-Scale support background in a heavily microservice-based world
- Working with Kubernetes hands-on in a “Been there, Done that” way
- Strong understanding of Single-Sign On, SAML, oAuth (Bonus points if hands-on experience with Okta)
- Background utilizing and supporting log analytical tools such as DataDog (Logging and APM)
- Great understanding of DevOps tools, CI/CD and hands-on experience with git, Bitbucket and TeamCity
- Seasoned expertise around x.509 certificate technology and basic concepts of encryption
- Solid understanding of concepts surrounding containerized networking and all things IP
- Experience working with Relational Databases such as Aurora Postgres and/or Oracle RDS
- Exposure to broad technical skills such as application development, web UI (design and development), JSON, application architecture
- Ability to read and interpret application server thread dumps, Catalina outputs, CloudTrail, and other critical logging outputs.