GoodUnited is a fundraising software that helps nonprofits harness the power of Social Networks, offering a solution that simplifies lead generation, automates supporter engagement, and maximizes fundraising revenue.
Location: Medellin, Colombia (Remote)
Position Type: Full time
GoodUnited is at the forefront of digital innovation for the nonprofit sector, and we're looking for a Site Reliability Engineer (SRE) who is also skilled in DevOps tasks to join our team. This role is an exceptional opportunity for someone who is passionate about maintaining and improving systems reliability while also embracing the challenges of DevOps in a fast-growing company that is committed to making a significant impact in the nonprofit world.
Our SRE will be central to ensuring both the reliability and operational efficiency of our platform. As an SRE with DevOps responsibilities, you'll focus on maintaining a high level of system availability and performance, while also playing a crucial role in developing and implementing CI/CD pipelines, automating processes, and collaborating with the software development team.
What does a Site Reliability Engineer (SRE) at GoodUnited do?
- Develop and maintain a robust monitoring and alerting system to ensure high availability and performance of our services.
- Lead efforts in automating and streamlining our operational processes to improve deployment cycles and system stability.
- Work closely with development teams to integrate infrastructure management with the software development lifecycle.
- Tackle system issues related to scalability and performance, ensuring optimal functioning of our applications.
- Continuously assess and incorporate new tools and technologies to enhance our operational capabilities.
OUTCOMES:
- A highly reliable and scalable platform, directly contributing to the success of our nonprofit clients.
- Efficient and effective deployment processes that support rapid and frequent software releases.
- Enhanced system stability and reduced downtime, ensuring a seamless user experience for our clients.
What experience and skills does a Site Reliability Engineer (SRE) need in order to be successful?
- 3-5 years of experience in an SRE role with additional experience in DevOps.
- Proficiency in cloud infrastructure (AWS, Azure, Google Cloud), containerization (Docker, Kubernetes), and infrastructure as code (Terraform, CloudFormation).
- Strong experience with monitoring and alerting tools (e.g., Prometheus, Grafana, New Relic).
- In-depth knowledge of network systems, databases, and scripting languages.
- Bachelor’s degree in Computer Science, Information Technology, or a related field is preferred.
- Technical Expertise: You have a solid background in system administration, network engineering, and software development, with a particular focus on reliability and scalability.
- DevOps Proponent: You understand the importance of collaboration between development and operations, and you're skilled in CI/CD practices and tools.
- Innovative Problem-Solver: You're not only adept at identifying and solving complex technical issues but also proactive in preventing them.
- Efficiency-Focused: You're driven to automate and optimize processes to enhance system performance and reliability.
- Adaptive and Resilient: You thrive in fast-paced environments and are capable of handling multiple priorities with resilience and flexibility.
What We Offer:
- A challenging and rewarding role in a dynamic, international environment.
- Competitive salary and benefits package.
- Opportunities for professional growth and development.
- A supportive and collaborative team culture.
Application Process:
Interested candidates are invited to submit a resume and a cover letter outlining their qualifications and experience.
We are an equal-opportunity employer and value diversity in our team. We encourage applications from all qualified individuals, regardless of their background.