Title : Site Reliability Engineer (SRE) (Automation & Scheduling)
Location : Fully Remote (CST hours) - open to tier 2 / 3 markets (e.g., Omaha, Kansas, etc.)
Duration : 6 Months Contract to Hire
Interview Process
3 rounds total : Hiring Manager
- Director of Back Office Systems
- Team Member
Seeking a Site Reliability Engineer Automation & Scheduling to lead efforts in automation, observability, and platform reliability across our enterprise job orchestration and data systems. This hands-on role is ideal for a technically skilled individual with strong scripting abilities, a drive for continuous improvement, and direct experience with modern automation technologies.
You will take ownership of complex scheduling workflows, improve system resiliency, reduce operational overhead through scripting and intelligent automation, and support the reliability of critical platforms like AppWorx and Power BI.
Key Responsibilities :
Job Scheduling & Automation :
Build and maintain scripts (PowerShell, Python, or Bash) to manage over 2,000 scheduled jobs, improve efficiency, and reduce manual interventionEnhance monitoring, alerting, and observability to detect issues early and maintain high system availabilityLead root cause analysis and implement preventative measures for job failures and outagesOperational Innovation :
Prototype and test automation and orchestration tools in isolated or lab environments, including the use of agent-based systems or orchestration frameworksApply Agentic AI or RPA solutions to operational use cases to drive down toil and increase responsivenessCollaborate with IT teams to optimize system capacity, improve resiliency, and modernize legacy scheduling patternsData Platform Reliability (Power BI / Microsoft Fabric) :
Support and administer data platform services such as Power BI Gateway and automated data refresh pipelines, with an emphasis on platform reliability and operational efficiencyTroubleshoot and resolve data refresh failures, optimize refresh cycles, and contribute to system observabilityIdentify and implement automation opportunities across evolving Microsoft Fabric components (e.g., Data Pipelines, Lakehouse, Real-Time Analytics), adapting responsibilities as platform capabilities expandContribute to monitoring and deployment improvements across the data ecosystem using scripting and automation toolsDocumentation & Collaboration :
Document job dependencies, workflows, and operational runbooks with clarity and rigorUse tools like ServiceNow, LeanIX, Jira, and Asana to ensure job metadata and support documentation remain currentPartner with application owners and infrastructure teams to align job execution with business needsQualifications :
Proficiency in PowerShell, Python, and Bash, with a focus on systems automation and scripting best practicesExperience managing enterprise job scheduling systems (AppWorx or similar) with attention to reliability and maintainabilityHands-on experience experimenting with Agentic AI or RPA platforms to automate operational workflows is strongly preferred. Candidates should be able to describe proof-of-concept efforts or prototype use cases they've built, even if in non-production environmentsFamiliarity with system monitoring and alerting tools; ability to build custom checks and observability dashboardsStrong documentation habits and ability to model complex dependencies and recovery stepsPower BI administration experience, including gateway and refresh managementBachelor's degree in IT, Computer Science, or related field3 5 years of experience in systems engineering, site reliability, or automation operationsITSM / ITIL process understanding preferred; ServiceNow experience a plusKey Attributes for Success :
Strong ownership mindset with a proactive approach to solving reliability issuesEagerness to learn and experiment with new tools, frameworks, and techniquesAbility to thrive in a fast-paced environment with shifting prioritiesEffective communicator who can translate technical findings into actionable plansFocused on outcomes, not effort; continuously looking to simplify and improveTechnical Must-Haves :
PowerShell, Python, Bash (scripting experience)
AppWorx (Broadcom) strongly preferred - niche skill, especially in education verticals for scheduling & automationSQL (working knowledge)Power BIAutomation & scheduling backgroundExposure to Microsoft-heavy environments, agentic AI / Power Automate a plusSoft Skills :Strong communication skills (must be very clear)
Curiosity, drive, adaptabilityAble to thrive in fast-paced, growing environmentThanks & Regards
Kartik Sharma
Recruitment Lead
Email : Kartik@kanakits.com
LinkedIn : Karthik Sharma | LinkedIn
Kanak IT is an equal opportunity employer. We consider all applicants for employment without regard to citizenship, immigration status, race, gender, disability, or any other protected category.
We respect your Online Privacy. This is not an unsolicited mail, If you are not interested in receiving our e-mails then please reply with a "REMOVE" in the subject to support@kanakelite.com and mention all the e-mail addresses to be removed with any e-mail addresses, which might be diverting the e-mail to you.