Principal Cloud Operations Architect

Service Management Group
Remote, CA
Remote
Full-time

As a Principal Cloud Operations Architect, you will provide expert-level support and architecture guidance for Azure cloud solutions, ensuring stability, security, and efficiency.

You will lead troubleshooting, performance tuning, and network / data infrastructure optimizations, with a focus on system health, security, and compliance.

Why work at SMG?

SMG is a leading experience management (XM) provider, serving restaurants, retailers, and other location-centric consumer businesses by changing how brands act on customer + employee insights.

With a rich 30-year history, SMG is uniquely pairing an enterprise software platform with professional services to help brands generate new revenue, grow existing revenue, reduce detractors, and drive operational efficiencies.

We offer our talent -

Work hard, have fun environment - We work hard to deliver a fulfilling, exciting workplace environment for each SMG employee.

Our teams are composed of smart, talented, curious people who love a good challenge.

  • Ample opportunities to learn and grow.
  • Fully Remote, Contract or Fulltime position
  • Unlimited PTO
  • Diverse, experienced, friendly team which will welcome you, support you and challenge you.
  • We are proud to be an equal opportunity employer. We celebrate diversity and create an inclusive work environment in which all our colleagues experience belonging, have their unique needs respected and met, have equal access to opportunities and resources, and feel fully engaged to contribute to the company’s success.

As a Principal Cloud Operations Architect, this is what you will do :

  • Cloud Platform Support : Provide expert-level support for the Azure cloud platform, focusing on the stability, security, and optimization of network and data solutions.
  • Troubleshooting and Incident Resolution : Lead the identification, troubleshooting, and resolution of cloud infrastructure and service issues, minimizing downtime and operational impact.
  • Performance Monitoring & Analysis : Implement and manage robust monitoring solutions (Azure Monitor, Log Analytics) to proactively identify and resolve potential performance bottlenecks or security issues.
  • Optimization of Cloud Resources : Regularly review and optimize cloud infrastructure and services, focusing on performance tuning, cost-efficiency, and reliability in network and data environments.
  • Root Cause Analysis (RCA) : Conduct detailed post-incident analysis to determine root causes and prevent future occurrences through architectural adjustments or new processes.
  • System Health Management : Continuously monitor system health and performance, providing solutions to maintain uptime, optimize data workflows, and improve network reliability.
  • Network Infrastructure Support : Support the design and maintenance of the full stack of Azure network infrastructure resources and the existing F5 implementation.
  • Data Infrastructure Support : Work closely with Engineering to support the design and maintenance of legacy and new data platforms to ensure seamless and secure operations.
  • Security & Compliance Oversight : Working closely with the Security Prime - ensure adherence to security and compliance best practices, responding to security incidents, and performing ongoing audits to maintain regulatory compliance (GDPR, HIPAA).
  • Automation of Troubleshooting and Optimization Processes : Develop and implement automation scripts and tools to streamline repetitive support tasks and optimize cloud performance.
  • Collaboration with Cross-Functional Teams : Work closely with Engineering, DevOps, and Operations teams to evolve operational processes and improve cloud solutions.

You are a perfect match for the role if you have :

  • Bachelor’s or Master’s Degree in Computer Science, Information Technology, or a related field.
  • Azure Cloud Experience : 7+ years of experience in designing, supporting, and optimizing Azure cloud environments, with a specific focus on troubleshooting and system performance.
  • Networking Specialization : In-depth experience with Azure networking components such as VWan, VHub, VNet, Subnets, VPNs, ExpressRoute, Traffic Manager, Firewalls, NSG’s, Private Endpoints, Load Balancers, Application Gateways etc.

Exposure to or F5 BigIP LTM and Shape.

  • Data Platform Expertise : Strong working knowledge of legacy (Hadoop, Mongo, SQL) and (Managed SQL Instances, Azure SQL, Snowflake) with hands-on experience in troubleshooting data workflows.
  • Cloud Support & Troubleshooting : Proven experience in managing and resolving operational issues in Azure environments, with strong troubleshooting skills across networking, compute, and data services.
  • Root Cause Analysis Expertise : Experience in conducting root cause analysis and implementing system optimizations to prevent future incidents.
  • Cloud Automation & Scripting : Hands-on experience with automation tools (PowerShell, ARM templates, Terraform, Bicep) to streamline cloud operations and improve system efficiency.

Technical Skills :

  • Cloud Support : Expert-level troubleshooting and support for Azure cloud environments, ensuring minimal downtime and operational disruptions.
  • Network & Data Infrastructure : Proficient in supporting and optimizing both networking and data platforms within Azure.
  • Performance Monitoring Tools : Familiarity with Azure-native monitoring tools (Azure Monitor, Log Analytics) to support system health and optimization efforts.
  • Security and Compliance : Knowledge of compliance and security practices in cloud environments, including GDPR, HIPAA, and other relevant regulations.
  • Automation of Cloud Tasks : Proficiency in automating support and optimization tasks using PowerShell, ARM templates, Terraform and Bicep.
  • 9 days ago
Related jobs
Promoted
VirtualVocations
Santa Clara, California

A company is looking for a Principal Cloud Operations Architect to provide expert-level support and architecture guidance for Azure cloud solutions. ...

Promoted
Apple
Cupertino, California

We are seeking an extraordinary Principal Cloud Security Architect who is passionate about cloud security and can thrive in a fast- paced environment where both individual drive and team collaboration are the keys to success. Deep understanding of the security models of cloud providers and cloud nat...

Promoted
VirtualVocations
Norwalk, California

A company is looking for a Master Principal GPU/HPC Cloud Architect. ...

Promoted
MKS Instruments
Irvine, California

You will be responsible for developing and implementing cloud security strategies, architecting secure cloud solutions, and providing guidance to ensure that our cloud environments are protected against emerging cyber threats. Design secure cloud architectures and solutions for various cloud platfor...

Promoted
VirtualVocations
Norwalk, California

A company is looking for a Principal AI/ML/HPC Cloud Architect. ...

Service Management Group
Remote, CA
Remote

As a Principal Cloud Operations Architect, you will provide expert-level support and architecture guidance for Azure cloud solutions, ensuring stability, security, and efficiency. Principal Cloud Operations Architect,. Work closely with Engineering, DevOps, and Operations teams to evolve operational...

arm limited
San Jose, California

Cloud Solutions Architect will define requirements and strategy to drive growth and success of Arm for key cloud and data-center applications. Architect cloud solutions that bring together Arm’s products, technologies, software, and ecosystem and provide best-in-class performance, power efficiency a...

eBay
San Jose, California

Work closely with network and systems engineers to build the next generation of eBay’s private cloud infrastructure and provide technical mentorship and leadership in the adoption of containerization and cloud-native technologies within the private cloud and optimize resource utilization and cost ef...

Monster Energy
Corona, California

This is an experienced supply chain and operations professional with a deep understanding of best business practices along with major system solutions implementations. Identify, evaluate, select, and implement AI applications and solutions across operations organizations. Actively engage with operat...

Tencent
Palo Alto, California

With over 20 years of research and experience in audio and video technology, Tencent Cloud launched Tencent Cloud Media Service, a new international audio and video brand, in 2022. Support the regional architecture team in analyzing customers' media business technology architecture and identifying t...