Kaidee ซื้อ-ขายของออนไลน์ได้หลากหลายที่สุด เรามุ่งมั่นพัฒนาแพลตฟอร์ม ให้เป็นแหล่งซื้อ-ขายสินค้าออนไลน์ ทั้งมือหนึ่งและมือสองที่ใช้ง่าย และมีสินค้าที่หลากหลายที่สุดสำหรับคนไทย ไม่ว่าจะเป็น สินค้าแฟชั่น เครื่องใช้ไฟฟ้า มือถือ เฟอร์นิเจอร์ อุปกรณ์กีฬาไปจนถึงรถยนต์ มอเตอร์ไซค์ บ้าน ที่ดิน พระเครื่อง ก็จบการซื้อ-ขายได้ในที่เดียว

เป้าหมายของเราคือการเป็นส่วนหนึ่งในการช่วยยกระดับคุณภาพชีวิต ด้วยการช่วยให้ผู้ซื้อและผู้ขายมาพบปะกัน เพื่อแลกเปลี่ยนสินค้าในราคาที่ทั้งผู้ซื้อและผู้ขายพึงพอใจที่สุด

Hope is not a strategy. Engineering solutions to design, build, and maintain efficient large-scale systems is a true strategy, and a good one.

Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems by implementing DevOps to develop tools and automation for toil reduction.

SRE ensures that Kaidee’s services—both our internally critical and our externally-visible systems—have reliability and uptime appropriate to users’ needs and a fast rate of improvement while keeping an ever-watchful eye on capacity and performance.

Our mission is to protect and provide a flexible, fast and stable infrastructure platform to achieve business goals in an effective way for customer satisfaction and our believed culture.

To learn more: check out Site Reliability Engineering, written by Google SREs.

Responsibilities

  • Deploy, upgrade, operate/maintain, and scale our suite of mission-critical products and services
  • Closely collaborate with Software Engineers to create highly operable and maintainable products
  • Manage the underlying infrastructure
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
  • Engage in and improve the whole lifecycle of services — from inception and design, through deployment, operation, and refinement
  • Practice sustainable incident response and blameless postmortems
  • Provide end-user support to engineering for products

Basic Qualifications

  • Understanding of Ansible, or other automation frameworks
  • Automation skills in shell bash, Python, and/or other languages
  • Experience with source code and version control tools such as Subversion or Git
  • This individual will also provide “On Call” support on a scheduled rotation or may be required to work a shift that provides operational support on Saturday and Sunday.

Preferred Skills and Experience

  • Strong understanding of Docker, Kubernetes or similar technologies
  • Strong understanding of cloud platform technologies
  • Strong understanding of networking knowledge of TCP/IP
  • Understanding of Continuous Integration and Continuous Delivery with measurement.
  • Understanding of databases and data modeling
  • Understanding of monitoring, logging and tracing for helping our team to find the root cause
  • Ability to automate routine tasks
  • Ability to working as a team and be a leader in the transformation process work better
  • Experience with Unix/Linux operating systems internals (e.g., filesystems, system calls) and administration or networking (e.g., routing).
  • Experience with automatically managing dozens or hundreds of servers
  • Experience with workflow and issue management tools such as JIRA, Markdown
  • Systematic problem-solving approach, coupled with effective communication skills and a sense of urgency appropriate to the responsibilities
  • Be growth mindset and comfort to learn, coach or teach others
  • (Bonus) Experience in Python-based software development
  • (Bonus) Understand and able to test basic security

Welfare & Benefits

  • Flexible hours
  • Town Hall & Happy Friday (Food & Drink provided)
  • Free Lunch every day
  • Training in-house & Abroad
  • Provident Fund (5 or 10%)
  • Life / Accident / Disability / Health Insurance
  • Dental
  • Vacation leave & Birthday leave

Apply Here