ABOUT US (AND OUR EXCITING FUTURE):
We pioneered cloud logistics with our on-demand fulfillment center and shipping tools, Web services, innovative developer tools and integration with today's top e-commerce platforms. Our award-winning platform provides on-demand e-commerce fulfillment and supply chain management from a network of global pick-pack and ship warehouses in the U.S., Canada, Europe and Asia to more than 1,000 emerging brands and web retailers.
It's an incredibly exciting time for our organization. In December 2013 we were acquired by Ingram Micro Inc. (NYSE: IM), a Fortune 100 company who is also the world's largest wholesale technology distributor and a global leader in supply-chain and mobile device lifecycle services. This will allow us to accelerate our platform (much more quickly than we could have as a stand-alone company) in a $40 billion market for e-commerce fulfillment services that is estimated to be growing by double-digits through 2015. The full details are publicly available here: http://goo.gl/mSHCjc and on our blog
As a stand-alone subsidiary of Ingram Micro in the Silicon Valley we will be continuing our start-up culture and maintaining our focus on revolutionizing the supply chain in favor of emerging brands. Through Ingram we have access to global facilities and distribution strength that will accelerate our innovation. Our charter is to be visionary and define the future of this market.
Exciting? We think so.
ABOUT THE OPPORTUNITY:
As the Senior Site Reliability Engineer (SRE) at ShipWire you will be working to improve the reliability and performance of our platform and family of services. You will work shoulder-to-shoulder with our engineering teams to design and build the next generation of web applications and systems infrastructure, focusing on automation, availability, scalability and performance. A thorough understanding of System administration is a must (we have a 30 server deployment that we'll double again next year), and specific experience with Linux is required.
YOUR RESPONSIBILITIES COULD CONSIST OF:
- Work closely with engineering team helping to build, maintain and extend a platform family of services that can serve millions of orders a day.
- Be the Senior Representative of the Operations Team for contributions in new and ongoing technology projects; Performance, High Availability and Scalability including partitioning, sharding (Mongodb, Redis, etc.), dynamic provisioning and de-provisioning of systems for current load, etc.
- Review entire environment and execute initiatives to reduce failures, defects and improving overall performance.
- Design, develop and execute automated tests to validate solutions and environments.
- Troubleshoot issues across the entire stack - hardware, software, application and network.
- Document current and future configuration processes and policies.
- Perform troubleshooting analysis and implement fixes to ensure availability SLAs are met.
- Take part in a 24x7 on-call rotation.
OUR CURRENT STACK:
We run single-page backbone.js-based apps, on top of a set of RESTful APIs while the back end is built in Modern PHP (clean, object-oriented, namespaced) and adjoining components like PHPnginx/php-fpm, Percona server, redis and gearman. All performing in a cloud/auto-scaling environment.
REQUIRED QUALIFICATIONS FOR THE POSITION:
- Experience with web server configuration, monitoring, trending, network design and high availability.
- Command of your favorite scripting language: Python, Perl, Ruby, Bash, Java, C++, Powershell, etc. to automate tasks and gather data.
- At least 3 years of experience with Linux systems administration (we use CentOS)
- Excellent verbal and written communication skills; including documentation.
- 3+ years of hands on operational experience in a high-volume or critical production service environment.
- Familiarity with systems management tools (Puppet, Chef, Capistrano, etc).
- Require limited supervision and direction; drive results and set priorities independently.
- Ability to handle multiple complex tasks, with tight deadlines concurrently.
- Hands on operational experience in a high-volume or critical production service environment.
- Experience with any enterprise monitoring systems like Nagios or Systems Center is highly desired as well as working with Vendors who assist us in this area.
DESIRED QUALIFICATIONS:
- Jenkins for deployment automation.
- Virtualizaiton (Vmware, etc.) is a plus.
- Some familiarity with mysql, mongodb, etc.