Job Title：DevOps Engineer (Senior Level)
We are looking for a motivated and talented SRE engineer. You’ll work alongside the best and the brightest engineering talent in the industry to build world-class full life cycle machine learning platform.
•Run our machine learning platform with Kubernetes
•Daily system operation and automate them with scripts as much as possible
•Maintain and improve our Jenkins based CI/CD system, make the develop-to-deployment process smoother and more automatic
•Make monitoring with Prometheus and alert on symptoms instead of outage
•Work with developer investigate and support production issues
-Success track record of 3+ years software operation experience
-Well knowledge on linux OS, familiar with linux shell programming
-Has experience on docker and Kubernetes
-Accountable and responsible attitude in daily work.
-Able to address task, resolve problem independently.
-Good communication skills in oral and written English.
-Strong inter-personal networking skills with the ability to relate to executives and other team members through all organizational levels
-Have experience on GPU is a big plus
-Familiar with building CI/CD process, especially Jenkins based system is a plus
-Agile project experience is a plus