from Hacker News

Ask HN: If Kubernetes is the solution, why are there so many DevOps jobs?

by picozeta on 6/1/22, 12:19 PM with 416 comments

Arguable the goals of DevOps align partly with the goals of system administrators in former days: Provide reliable compute infrastructure for

  1) internal users: mainly developers by providing CI/CD
  2) external users: end users

Nowadays we call people that do 1) DevOps and people that do 2) SREs (so one could argue that the role of sys admins just got more specialized).

The platform of choice is mostly Kubernetes these days, which promises among other things stuff like

  - load balancing
  - self-healing
  - rollbacks/rollouts
  - config management

Before the cloud days, this stuff has been implemented using a conglomerate of different software and shell scripts, issued at dedicated "pet" servers.

In particular, a main critic is "state" and the possibility to change that state by e.g. messing with config files via SSH, which makes running and maintaining these servers more error-prone.

However, my main question is:

"If this old way of doing things is so error-prone, and it's easier to use declarative solutions like Kubernetes, why does the solution seem to need sooo much work that the role of DevOps seems to dominate IT related job boards? Shouldn't Kubernetes reduce the workload and need less men power?"

Don't get me wrong, the old way does indeed look messy, I am just wondering why there is a need for so much dev ops nowadays ...

Thanks for your answers.

by jmillikin on 6/1/22, 12:39 PM
```
  >   1) internal users: mainly developers by providing CI/CD
  >   2) external users: end users
  >
  > Nowadays we call people that do 1) DevOps and people that do
  > 2) SREs (so one could argue that the role of sys admins just
  > got more specialized).
```
Both are called sysadmins.
SRE is a specialized software engineering role -- you'd hire SREs if you wanted to create something like Kubernetes in-house, or do extensive customization of an existing solution. If you hire an SRE to do sysadmin work, they'll be bored and you'll be drastically overpaying.
DevOps is the idea that there shouldn't be separate "dev" and "ops" organizations, but instead that operational load of running in-house software should be borne primarily by the developers of that software. DevOps can be considered in the same category as Scrum or Agile, a way of organizing the distribution and prioritization of tasks between members of an engineering org.
---
With this in mind, the question could be reframed as: if projects such as Kubernetes are changing the nature of sysadmin work, why has that caused more sysadmin jobs to exist?
I think a general answer is that it's reduced the cost associated with running distributed software, so there are more niches where hiring someone to babysit a few hundred VMs is profitable compared to a team of mainframe operators.
by binarymax on 6/1/22, 12:58 PM
Kubernetes is a Google scale solution. Lots of teams said “hey if Google does it then it must be good!”…but forgot that they didn’t have the scale. It caught on so much that for whatever reason it’s now the horrendous default. I’ve worked on at least 3 consulting projects that incorporated K8s and it slowed everything down and took way too much time, and we got nothing in return - because those projects only needed several instances, and not dozens or hundreds.
If you need less than 8 instances to do host your product, run far away anytime anyone mentions k8s
by mancerayder on 6/1/22, 2:58 PM
Threads like the below are why DevOps jobs exist and why Kubernetes infrastructure skills pay so much and why there's such a large demand.
Yes, it's quite complicated.
No, an API to control a managed EKS/GCK cluster + terraform + Jenkins/Azure DevOps/etc. does not mean that magically the developer can 'just deploy' and infrastructure jobs are obsoleted. That's old AWS marketing nonsense predating Kubernetes.
There's a whole maintenance of the CI/CD factory and its ever demanding new requirements around performance, around Infosec requirements, around scale, and around whatever unique business requirements throw a wrench in the operation.
Sticking to ECS I guess is a valid point. What Kubernetes gives you is a more sophisticated highly available environment built for integration (Helm charts and operators and setups that when they work give you more levers to control resources allocations, separations of app environments, etc.)
And as an aside, I've been doing this for 20 years and long before Kubernetes, before Docker, hell, before VMs were used widely in production, I observed the developer mindset: Oh but it's so easy, just do X. Here, let me do it. Fast forward a year of complexity later, you start hiring staff to manage the mess, the insane tech debt the developers made unwittingly, and you realize managing infrastructure is an art and a full time job.
A story that is visible with many startups that suddenly need to make their first DevOps hire, who in turn inherit a vast amount of tech debt and security nightmares.
Get out of here with, it's just API calls. DevOps jobs aren't going away. It's just the DevOps folks doing those API calls now.
by mynameisash on 6/1/22, 11:15 PM
Reading the comments here validates my experience. When K8s was pitched as a way to make this all run smoothly, I thought, "Great! I'll write my code, specify what gets deployed and how many times, and it'll Just Work(tm)." I built a service which had one driver node and three workers. Nothing big. It deployed Dask to parallelize some compute. The workload was typically ~30 seconds of burst compute with some pretty minor data transfer between pods. Really straightforward, IMO.
Holy smokes, did that thing blow up. A pod would go down, get stuck in some weird state (I don't recall what anymore), and K8s would spin a new one up. Okay, so it was running, but with ever-increasing zombie pods. Whatever. Then one pod would get in such a bad state that I had to nuke all pods. Fortunately, K8s was always able to re-create them once I deleted them. But I was literally deleting all my pods maybe six or seven times per day in order to keep the service up.
Ultimately, I rewrote the whole thing with a simplified architecture, and I vowed to keep clear of K8s for as long as possible. What a mess.
by jeffwask on 6/1/22, 1:39 PM
First. DevOps is a culture not a job most places have so many DevOps roles because they are doing it wrong.
In the olden days of 10 years ago, most operations teams worked around the clock to service the application. Like every day there would be someone on my team doing something after hours usually multiple. Tools like Kubernettes, Cloud (AWS, GCP, Azure) have added significant complexity but moved operations to more of a 9 to 5 gig. Less and less do I see after hours deployments, weekend migrations, etc. Even alert fatigue goes way down because things are self healing. This is on top of being able to move faster and safer, scale instantly, and everything else.
Operations side used to be a lot of generalists admin types and DBA's. With today's environment, you need a lot more experts. AWS alone has 1 trillion services and 2.4 billion of those are just different ways to deploy containers. So you see a lot more back end roles because it's no longer automate spinning up a couple servers, install some software, deploy, monitor and update. It's a myriad of complex services working together in an ephemeral environment that no one person understands anymore.
by etruong42 on 6/1/22, 12:52 PM
New technology sometimes creates more work even though it makes the previous work easier. When the electronic spreadsheet was introduced in the 1980s, even though it made accountants more productive, the number of accountants GREW after the electronic spreadsheet was introduced. Sure, one accountant with an electronic spreadsheet could probably do the work of 10 or 100 accountants who didn't have the electronic spreadsheet, but accounting become so efficient that so many more firms wanted accountants.
"since 1980, right around the time the electronic spreadsheet came out, 400,000 bookkeeping and accounting clerk jobs have gone away. But 600,000 accounting jobs have been added." Planet Money, May 17, 2017, Episode 606: Spreadsheets!
by devonkim on 6/1/22, 8:26 PM
Kubernetes in a sense is very similar to Linux back in the 2000s - it was nascent technology in a hot market that was still absolutely evolving. The difference now is that everyone knows the battle for the next tier of the platform is where people will be able to sell their value (look at RedHat selling to IBM for the saddled legacy of maintaining an OS as a tough growth proposition). For a while people thought that Hadoop would be the platform but it never grew to serve a big enough group's needs back in 2013-ish and coupled with the headaches of configuration management containerization hit and it's now combined at the intersection of OS, virtualization, CI, and every other thing people run applications on in general. It may be the most disruptive thing to our industry overall since the advent of Linux in this respect (people thought virtualization was it for a while and it's shown to have been minor comparatively).
A lot of this stuff really is trying to address the core problem we've had for a long time that probably won't ever end - "works fine on my computer."
by majewsky on 6/1/22, 12:38 PM
In my opinion, the main benefit of Kubernetes for large companies is that it allows for a cleaner separation of roles. It's easier to have a network team that's fully separate from a storage team that's fully separate from a compute team that's fully separate from an application development team because they all work around the API boundaries that Kubernetes defines.
That's valuable because, on the scale of large companies, it's much easier to hire "a network expert" or "a storage expert" or even "a Gatekeeper policy writing expert" than to hire a jack of all trades that can do all of these things reasonably well.
The corollary from this observation is that Kubernetes makes much less sense when you're operating at a start-up scale where you need jacks of all trades anyway. If you have a team of, say, 5 people doing everything from OS level to database to web application at once, you won't gain much from the abstractions that Kubernetes introduces, and the little that you gain will probably be outweighed by the cost of the complexities that lurk behind these abstractions.
by moshloop on 6/1/22, 2:02 PM
High Availability, Scalability, Deployments, etc are NOT the goal of Kubernetes, they are features that are not exclusive to Kubernetes, nor is Kubernetes necessarily better at them then others.
The goal of Kubernetes is to improve the portability of people by introducing abstraction layers at the infrastructure layer - These abstractions can seem overly complex, but they are essential to meet the needs of all users (developers, operators, cloud providers, etc)
Before kubernetes in order for a developer to deploy an application they would need to (send email, create terraform/cloudformation, run some commands, create ticket for loadbalancer team, etc) - these steps would rarely be same between companies or even between different teams in the same company.
After kubernetes you write a Deployment spec, and knowing how to write a deployment spec is portable to the next job. Sure there are many tools that introduce opinionated workflows over the essentially verbose configuration of base Kubernetes objects, and yes your next job may not use them, but understanding the building blocks, still make it faster than if every new company / team did everything completely differently.
If you only have a single team/application with limited employee churn - then the benefits may not outweigh the increased complexity.
by habitue on 6/1/22, 6:03 PM
The thing you're noticing is the usual thing that happens when new labor saving technology is invented:
1. What people expect: less work needs to be done to get what you had before.
2. What people don't expect: more is expected because what used to be hard is now simple
So while it may have taken a few weeks to set up a pet server before and as a stretch goal you may have made your app resilient to failures with backoff retry loops etc. Now that's a trivial feature of the infrastructure, and you get monitoring with a quick helm deploy. The problems haven't disappeared, you're just operating on a different level of problems now. Now you have to worry about cascading failures, optimizing autoscaling to save money. You are optimizing your node groups to ensure your workloads have enough slack per machine to handle bursts of activity, but not so much slack that most of your capacity is wasted idling.
Meanwhile, your developers are building applications that are more complex because the capabilities are greater. They have worker queues that are designed to run on cheap spot instances. Your CI pipelines now do automatic rollouts, whereas before you used to hold back releases for 3 months because deploying was such a pain.
Fundamentally, what happens when your tools get better is you realize how badly things were being done before and your ambition increases.
by rconti on 6/1/22, 11:58 PM
Because everything has gotten bigger and more complicated.
It's like asking "if the computer saves us all so much work, why do we have more people building computers than we ever had building typewriters"?
Something can "save labor" and still consume more labor in aggregate due to growth.
by jljljl on 6/1/22, 4:38 PM
I think this Kelsey Hightower quote has summarized my experience working with Kubernetes:
> Kubernetes is a platform for building platforms. It's a better place to start; not the endgame.
https://twitter.com/kelseyhightower/status/93525292372179353...
Everywhere I've worked, having developers use and develop Kubernetes directly has been really challenging -- there's a lot of extra concepts, config files, and infrastructure you have to manage to do something basic, so Infra teams spend a lot of resources developing frameworks to reduce developer workloads.
The benefits of Kubernetes for scalability and fault tolerance are definitely worth the cost for growing companies, but it requires a lot of effort, and it's easy to get wrong.
Shameless plug: I recently cofounded https://www.jetpack.io/ to try and build a better platform on Kubernetes. If you're interested in trying it out, you can sign up on our website or email us at `demo [at] jetpack.io`.
by zelphirkalt on 6/1/22, 12:58 PM
The short answer is: Because of Kubernetes.
The longer answer is: When you switch to Kubernetes, you are introducing _a lot_ of complexity, which, depending on your actual project, might not be inherent complexity. Yes, you get a shiny tool, but you also get a lot of more things to think about and to manage, to run that cluster, which in turn will require, that you get more devops on board.
Sure, there might be projects out there, where Kubernetes is the right solution, but before you switch to it, have a real long hard thinking about that and definitely explore simpler alternatives. It is not like Kubernetes is the only game in town. It is also not like Google invents any wheels with Kubernetes.
Not everyone is Google or Facebook or whatever. We need to stop adopting solutions just because they get hyped and used at big company. We need to look more at our real needs and avoid introducing unnecessary complexity.
by caymanjim on 6/1/22, 12:48 PM
The premise of your question is invalid. Have you ever tried setting up a Kubernetes cluster and deploying apps in it? Kubernetes doesn't save work, it adds work. In return, you get a lot of benefits, but it wasn't designed to reduce human work, nor was it designed to eliminate devops jobs. It was designed for scalability and availability more than anything. Most people using Kubernetes should be using something simpler, but that's a separate problem.
by bombcar on 6/1/22, 12:31 PM
Because many many companies herd pets using kubernets.
The number of single-server setups with kubernetes thrown in for added complexity and buzzwords I’ve found is way too dang high.
by MrBuddyCasino on 6/1/22, 12:36 PM
> "If this old way of doing things is so error-prone, and it's easier to use declarative solutions like Kubernetes, why does the solution seem to need sooo much work that the role of DevOps seems to dominate IT related job boards? Shouldn't Kubernetes reduce the workload and need less men power?"
Because we're living in the stone age of DevOps. Feedback cycles take ages, languages are not typed and error prone, pipelines cannot be tested locally, and the field is evolving rapidly like FE javascript did for many years. Also I have a suspicion that the mindset of the average DevOps person has some resistance to actually using code, instead of yaml monstrosities.
There is light at the tunnel though:
- Pulumi (Terraform but with Code)
- dagger.io (modern CI/CD pipelines)
Or maybe the future is something like ReplIt, where you don't have to care about any of that stuff (AWS Lambdas suck btw).
by rahen on 6/1/22, 1:21 PM
Kubernetes can really help bringing more scalability.
All you need is to rewrite your application (think microservices), reduce cold latency (get rid of anything VM based such as Java, or rewrite in Spring or Quarkus), use asynchronous RPC, and decouple compute and storage.
Then you need an elastic platform, for instance Kubernetes, with all the glue around such as Istio, and Prometheus, and Fluentd, and Grafana, Jaeger, Harbor, Jenkins, maybe Vault and Spinnaker.
Then you can finally have your production finely elastic, which 90% of companies do not need. Microservices are less performant, costlier, and harder to develop than n-tiers applications and monoliths, and way harder to debug. They're just better at handling surges and fast scaling.
If what you want is:
- automated, predictable deployments
- stateless, declarative workloads
- something easy to scale
Then Docker Compose and Terraform is all you need.
If you also need orchestration and containers are your goal, then first try Docker Swarm. If you need to orchestrate various loads and containers are a mean and not a goal, then try Nomad.
Finally, if you will need most resources Kubernetes has to offer (kubectl api-resources), then yes, opt for it. Few companies actually have a need for the whole package, yet they have to support its full operational cost.
Most companies just pile up layers, then add yet a few more (Java VMs on top of containers on top of an orchestrator on top of x86 VMs on top of(...)), and barely notice the miserable efficiency of the whole stack. Well it's using Kubernetes, it's now "modernized".
by tapoxi on 6/1/22, 12:48 PM
From my experience, Kubernetes drastically reduces the number of DevOps people required. My current place has a team of 5, compared to a similarly sized, vmware-centric place I worked at a decade ago with a team of 14.
But DevOps means many things because it's not clearly defined, which also makes it difficult to hire for. It's a "jack-of-all-trades" role that people somehow fell into and decided to do instead of more traditional software engineering.
Also, from what I've experienced from our internship program, CS programs are really bad at covering these fundamentals. Students aren't learning such basics as version control, ci/cd, cloud platforms, linux, etc.
by oxplot on 6/1/22, 2:18 PM
Someone put it nicely when they said Kubernetes is like an operating system for containers. If you take linux as an analogy, it's clearly a non-trivial investment to learn linux and learn enough to be effective and efficient in it. Further time perhaps needed to achieve the productivity, functionality and performance of what you were used to on Mac or Windows.
Kubernetes definitely achieves this goal well, and in a relatively portable way. But just like any other engineering decision, you should evaluate the trade offs of learning a completely new OS just to get a simple web site up, versus running a nginx instance with bunch of cgi scripts.
by jedberg on 6/1/22, 10:04 PM
DevOps is a philosophy, not a job role. It's the idea that developers deploy and operate their own code. An SRE is often someone who helps make that happen, by building the tools necessary for developers to operate their own code.
In a small organization, you can get away with a sysadmin running a Kubernetes cluster to enable that. In a larger org you'll need SREs as well as Operations Engineers to build and maintain the tools you need to enable the engineers.
by whalesalad on 6/1/22, 1:24 PM
Kubernetes is raw material, like concrete and lumber. It needs to be massaged/crafted/assembled into something that fits the use case. A 'devops' engineer would leverage Kube to build a system, the same way a builder/contractor would leverage raw materials, subcontractors, off the shelf components, etc to build a home or office.
by codegeek on 6/1/22, 2:10 PM
Few reasons :
- Kubernetes is very complex to setup
- It is not needed for many use cases
- It is (hopefully) not the defacto and standard for devops
- Load Balancing is already a solved problem way before Kubernetes. For many use cases, you don't need the complexity. Even things like Self Healing are kinda solved by AWS Auto Scaling for example.
- NOt every use case needs Kubernetes and its additional overhead/complexity
- Most importantly, devops is not "one size fits all" magic wand that Kubernetes or any other tool can solve. Various nuances to consider and hence you need DevOps as a role.
by techthumb on 6/1/22, 4:37 PM
K8S is not easy.
```
  It helps standardize:
    - deployments of containers
    - health checks
    - cron jobs
    - load balancing
```
What is the "old way" of doing things?
Is it same/similar across teams within and outside your organization.
If not, what would it cost to build consensus and shared understanding?
How would you build this consensus outside your organization?
For small organizations, one should do whatever makes them productive.
However, as soon as you need to standardize across teams and projects, you can either build your own standards and tooling or use something like K8S.
```
  Once you have K8S, the extensibility feature kicks in to address issues such as:
   - Encrypted comms between pods
   - Rotating short lived certificates
```
I don't love K8S.
However, if not K8S then, what alternative should we consider to build consensus and a shared understanding?
by Seriomino on 6/1/22, 3:03 PM
As one person with kubernetes I can build and operate quite a big platform more secure and better than ever before.
Our current platform is much more stable, has more features, and bigger than what the prev team did.
There are plenty of things you can't see like security or backup or scalability.
Backup were done app by app basis. Now you can do snapshots in k8s.
Security still is a mess. But now you can at least isolate stuff.
Scalability meant installing your application x times manually and configuring load balancer etc. Now you set it up per cluster.
Additional features you get with k8s: Auto scaling, high availability, health checks, self healing, standardization.
A lot of things got invented which lead to k8s like container or yaml.
Now with the operator pattern you can also replace admin and embed operational knowledge I to code.
Infrastructure was not ready to be controlled by code like this ever before.
by viraptor on 6/1/22, 12:34 PM
It sure does reduce the amount of work. But there's a lot that remains or just gets shifted to another area/technology. Who's setting up the image build pipeline? Who's handling the scaling and capacity planning? Who's planning how the deployments actually happen? Who's setting up the system for monitoring everything? And tonnes of other things...
Kubernetes helps with: networking setup, consistent deployments, task distribution. That's about it. It's more standardised than plain VMs, but you still have to deal with the other 90% of work.
by togaen on 6/1/22, 12:55 PM
Because software engineers can’t help but make things more complicated than they need to be.
by edanm on 6/1/22, 4:09 PM
The easiest answer to your post is that you are looking at evidence which doesn't necessarily mean what you think it means.
If k8s is as amazing and time saving as you would imagine, you'd expect many companies to want to adopt it, so you'd expect there to be lots of job postings!
It's like saying "if computers are such time savers, why do so many companies hire people that have knowledge in computers". It's because this is a good tool that companies want to hire people with knowledge in that tool!
by samsk on 6/1/22, 12:51 PM
Simple, because they before the company had 2-3 beefy servers running some binaries and it handled all the load without problems.
Now because of new possibilities, and new development they want to switch to Kubernetes to have that new possibilities everyone is talking about, and now you have to build many new containers, configure k8s, autoscalling etc... and developers don't know it (yet) and don't have time to learn it.
So lets hire a DevOps (me) that will do it ;-)
by wvh on 6/1/22, 2:11 PM
There's a shift away from programs that run on actual computers to software that runs on clusters, a large conceptual computer. Whether ultimately Kubernetes-the-software is the answer or not I don't know, but I don't think we're going back to installing packages on individual machines, as the benefits of the conceptual large computer are too great and the most logical way to solve challenges with scale and availability.
A lot of what is called DevOps goes into adapting software to this new mindset. A lot of that software is not written with best practices in mind, and likewise lots of tools are still in their infancy and have rough edges. I think it's fair to say some time and resources go into learning new ways of doing things, and it might not be the best choice for everybody at this stage to spend those resources unless there's an obvious need.
by jstream67 on 6/1/22, 1:15 PM
As far as I can tell from doing years of contracting for non-FANG mid to large size companies -- they basically do their best to copy whatever trends they see coming out of FANG. There is no thought behind it.
Kubernetes and 'DEVOPS' are the new hotness at non-FANG companies as they are always 5-10 years behind the trends. Expect to see more of it before it goes out of fashion.
Also DevOps is just a title. Nobody read the book, nobody is trying to create what the original guy at Google or whatever had in mind. It is just a all encompassing job doing the same activities that the sysadmin used to do. HR tells companies that they should rename their sysadmin departments to DevOps departments and everything else continues as normal.
by JohnHaugeland on 6/1/22, 4:07 PM
Starting from "if Kubernetes is the solution," you aren't going to be able to get to the answer, because:
1. Kubernetes isn't the solution 2. Kubernetes is expensive and extremely maintenance prone 3. Most of the companies I've seen switch to Kube I've seen switch away afterwards
Every time I've seen someone bring up Kubernetes as a solution, everyone at the table with first hand experience has immediately said no, loudly
Remember, there was a time at which someone wouldn't have been laughed out of the room for suggesting Meteor stack, and right now people are taking GraphQL seriously
Kube doesn't make sense until you have hundreds of servers, and devops makes sense at server #2
by bashinator on 6/1/22, 4:41 PM
Like cloud APIs, K8s replaces a ton of by-hand work that formerly made up the profession of systems administration (among others). I see my career progressions from sysadmin to devops as being a pretty natural development of "automate your (old) job away". So today with cloud and k8s, I can be as productive as ten of my old selves fifteen years ago. Back then, it would have been almost unimaginable for a company like the one I'm with to be able to thrive and grow during its first 18-24 months with only a single "IT" staff who can still maintain a great work/life balance.
TL; DR - K8s and cloud let me do the work of ten of my old selves.
by durnygbur on 6/1/22, 1:41 PM
Kubernetes and Angular are how Google burns resources to prevent any significant competition or innovation being created outside of their sphere of influence. Engineers who wank at sophistication usually receive the most attention and decisive power, btw that's what keeps me from participating in interviews. Another reason is that IT engineers are such a low plankton that someone up in the hierarchy with massive indirect financial bonus for using this technology decides to hire "Angular Developers", "Kubernetes DevOps consultants" and this triggers the hiring down to HR and recruiters who simply filter by keywords.
by rbranson on 6/1/22, 2:09 PM
This is like asking: if writing code in a high level language is the solution, then why are there so many software engineering jobs?
by pythops on 6/1/22, 11:59 PM
DevOps is not only k8s. Think about DevOps as managing the whole infrastructure: setup ci/cd pipelines, implementing infra-as-a-code, managing ML pipelines, implementing security policies ... DevOps is very wide
by mfer on 6/1/22, 12:44 PM
A few thoughts...
1) Kubernetes is an infra platform for the ops in DevOps. If developers need to spend a lot of time doing Kubernetes it takes away from their ability/time to do their dev. So, there are a lot of platform teams who pull together tools to simplify the experience for devs or DevOps specialists who handle operating the workloads.
2) Kuberentes is, as Kelsey Hightower puts it, a platform to build platforms. You need DevOps/SREs to do that.
3) Kubernetes is hard. The API is huge and it's complex. The docs are limited.
by raffraffraff on 6/2/22, 6:55 AM
I don't think there are "so many devops" jobs. Everywhere I've worked in the last 15 years, the number of people managing everything from hardware up to developer tools and CI/CD was tiny compared with the number of developers. Some start-ups dont bother with those people at all to begin with and regret it later. Then they hire a tiny team after years of neglecting these areas, and then expect the "wings on the plane to be swapped out during flight". Those devops people are casually expected to be experts in process (incident/problem management), cloud infra / infra as code, db config / replication, networking, security (IAM, SSO, network, OS), release / deployment, monitoring / metrics / alerting / tracing (not just deploying them but working with devs to implement observability in their code), dev tooling (code/artifact repos, every brand of CICD pipelines & runners) ... basically anything that isn't software development. They're also expected to be oncall for other teams in many companies.
Many years ago I worked at a large, old tech company where all of these areas had dedicated teams.
PS: how many people at your company "really" know Kubernetes inside out? And if it misbehaves, who do you expect to have the answer?
by phoehne on 6/1/22, 3:17 PM
I think that part of the problem in general tech is that many developers don't understand they're being marketed to. I really have no opinion on whether or not K8s is a smart choice, and will save you time and effort, require more effort but provide benefits, or is a bad trade-off. But the crazy push for k8s as the one size fits all solution for everything that you get from some corners of the webs smells like hype cycle.
by hsn915 on 6/1/22, 1:14 PM
There's an argument to be made in general (which I don't think applies here - but playing devil's advocate) that says:
When technology makes previous difficult things easy, this technology will be used to do more things that were previously impossible.
I personally haven't seen anyone use k8s to achieve things that were impossible before. They just use it because 1) They think everyone is using it 2) They don't know how to do it any other way
by thedougd on 6/1/22, 1:15 PM
Container adoption by workload is still pretty low. Most workloads are difficult to containerize because they're commercial off the shelf software (cots), Windows based, etc. You need good people who can make the best of these situations and automate what they can with configuration management, image bakeries, CI/CD pipelines, infra as code, and reverse engineering or bending that legacy app to run in a container.
by throwaway892238 on 6/1/22, 3:48 PM
For the past 250 years, new machines have replaced old ways of working to make us more productive. But each new machine is more complicated than the last, requiring more technical jobs to address the new complexity in the machine. Kubernetes is just a new machine, and because so many businesses now want to run that machine, they need more maintenance crews that are trained on said machine. In order to have fewer people, we'd need to leverage economies of scale and specialization so that companies don't need these new big complex machines. Even then, the appearance of fewer workers might just be jobs moved offshore where labor is cheaper.
It's true that moving away from mutable state is a sea change that is (very) slowly engulfing the industry. But the tide is still very far out. The cloud is a pile of mutable state, leading to the same system instability and maintenance headaches that existed before the cloud, requiring new skills and workers to deal with. Redesigning the cloud to be immutable will take decades of re-engineering by vendors, and even when it's done, we'll still need economies of scale to reduce headcount.
by GuB-42 on 6/1/22, 4:16 PM
I find the meaning of DevOps confusing.
Originally I thought it was a methodology assisted by a set of tools to make it easier for devs and ops (sys admins) to work together, mostly by giving both sides the same environment, usually in the form of docker containers.
Admins configure servers, or server pools since physical machines tend to be abstracted these days, set up applications, including the CI/CD stuff, make sure everything is secure, up to date and in working order, etc... Devs write the code, and test it on the platform supplied by the admins.
And now I see all these DevOps jobs. I mean what does it means? You are dev or you are ops, so what does DevOps means? It doesn't mean both, DevOps usually don't write software, so DevOps is ops, maybe we could call them container administrators, like we have database administrators.
I think that the confusion between the DevOps methodology and the DevOps job title give the wrong idea. Someone needs to make all these servers work, calling it serverless just means there is another abstraction layer, and while abstraction has some benefits, it rarely lessens the workload, but it may change job titles, here sysadmin -> DevOps.
by muskmusk on 6/1/22, 4:47 PM
Because everyone and their dog insists on using way over complicated micro service architectures.
Wages does not count as cost of goods sold, so who cares about a couple of extra hires? Funding is easy.
Also you severely underestimate the amount of work that goes under DevOps. Everything from build servers to test and security infrastructure is usually handled by DevOps. It's a massive surface area and it would be way worse without kubernetes.
by autarchprinceps on 6/1/22, 3:01 PM
Well, if you did DevOps the way it is meant to be used, the idea is that the developers can do the minimal efford of the ops part, and you no longer need that role itself. So, depending on what a company means by DevOps, it could mean developers willing to that bit extra, and clearly we need ever more developers, or they could understand it as just a modern kind of ops, in which case they are NOT doing DevOps. Kubernetes has its complexities, but it certainly doesn't require more people than previous methods. But what it does require, is people with new skills, and lots of companies want to move away from their old fields to this new, and therefore neeed to fill those roles, while people only begin to reeducate. Add to that, that more companies are doing more IT in more kinds of business segments, and in many countries the bigger generations are leaving for pension, with less people coming after. DevOps, true or not, is hardly the only field with a lack of enough educated people. You will find the same in lots of engineering fields.
by drakonka on 6/1/22, 8:43 PM
I think that K8s _could_ make the process of maintaining CI/build/deployment tasks much more hands-off and automated. But in practice we often use this new "power" as an opportunity to invent more fancy/smart things to do, taking advantage of it to the point where it requires as much if not more maintenance as whatever came before.
by oofnik on 6/2/22, 5:27 AM
The perception of a "need for so much dev ops nowadays" stems, in my view, from a skills mismatch in the market.
Division of labor and specialization are two natural results of any rapidly evolving industry. The tech industry is no exception. The problem is that there are currently comparatively quite a lot of ways to learn the skills necessary to become a competent web developer, backend engineer, data scientist, etc. Compared to these titles, the ways to learn the skills involved in designing, operating, and maintaining scalable cloud infrastructure have not kept up with market demand.
Kubernetes is not "the" solution, but it is one of several solutions to the problem of standardizing a trade skill for the purposes of making it transferrable. Nobody wants to go to work at a place where the skills they need to do their job well are both difficult to acquire and completely useless once they get a new job.
by vegai_ on 6/1/22, 1:08 PM
> If Kubernetes is the solution...
Yeah, you lost me already. This is a bit like asking why there are other languages besides Java.
by moomin on 6/1/22, 3:26 PM
Let me answer your question with another question: if Microsoft Word is so good, why are there so many people whose job it is to produce documents?
Now, I’m not saying k8s is as transformative as the word processor, but if it was, you’d probably expect to see more ops people, not fewer. They’d just be doing different things.
by ac50hz on 6/1/22, 1:16 PM
Kubernetes is, amongst other things, a technical solution to the billing problem. Composable resources with unit costs permit organizations to charge profitably for such resources. There are also some advantages for organizations who need to purchase such resources, giving some clarity to the process.
by bg24 on 6/11/22, 7:57 PM
It is a people problem. In a typical enterprise with multiple teams involved in every small thing, it is expected to see teams having their agenda and justifying their place/time. Kubernetes is an elegant platform to do just that, because it is so extensible.
As part of my job, I see thousands of small and medium businesses happily embracing kubernetes. They do not have separate devops or security or infra folks. A few engineers do it all. Yes, there are challenges with so many moving parts and rapid releases with breaking APIs. But expect that to be fixed in coming years.
Why so many DevOps jobs? Are you noticing the jobs being eliminated, or are you just seeing more DevOps jobs being created?
by kodah on 6/2/22, 1:40 AM
If you're referring to the days of when systems engineers would run named fleets, then yeah, those days were nightmarish but for different reasons. Instead of debugging the Linux networking stack, I had to defer everything out to a team that ran a help desk to try to find time to investigate because I was locked out of access and tooling. I don't miss those days one bit, though, it was fun to name components of my fleet.
A lot of companies moved to the cloud because their old data centers (or hosts) were driven by ticket systems instead of API's and access management. What took three weeks was now solved in seconds; the beginning was fascinating without a doubt. Then companies realized they owned none of this virtualized infrastructure and were at the behest of a very large corporation who could make sweeping changes with little to no notice. Although Kubernetes was pitched as providing extra grease to the gears of the enterprise, and they weren't wrong, that is not its total value to the enterprise.
The real value in Kubernetes running your own platform on someone elses hardware, especially to the degree where you can eventually free yourself from cloud provider lock-in that the above incurred. An example, if a company can spin up a team to create a database-as-a-service in Kubernetes clusters, then your RDS costs can shift dramatically down, and it develops a new level of capability and understanding that your company never had before.
I'm a SRE-SE, but I mostly use the title "Distributed Systems Software Engineer" because I feel that really fits what I do. DevOps is just a catch-all title for non-application-software tasks and roles at this point because it consumed so many things like "release manager", "QA", "application operations", etc... Personally, I do not trust companies or teams that use this as some sort of distinguishable title.
To answer the last part of your question, "Why are DevOps everywhere", because companies have diverse needs in terms of supporting software and software development, and DevOps is basically the catch all email of software engineering.
by hotpotamus on 6/1/22, 12:43 PM
My glib response is that automation is a lot of work.
by chazu on 6/1/22, 12:54 PM
> one could argue that the role of sys admins just got more specialized
Read the introduction to the SRE book, available free online [1] - and you'll see that SRE is defined _in contrast to_ systems administration. Its specifically defined as software engineering with the goal of managing operational complexity.
Modern shops' failure to understand this (most SREs haven't read any of the book, let alone stopped to think what SRE actually means) is IMHO a primary factor in the failure of most "devops transformations"
[1] https://sre.google/sre-book/part-I-introduction/
by aaccount on 6/2/22, 4:13 AM
Kubernetes isn't a solution. It just pushes the problem down the line. Fundamental problem is most software developed these days aren't packaged properly. The devs just ship the code to production. The hacks that they used to get the development going stays in production.
For example, you don't need a docker container to deploy Mysql, although you can deploy Mysql inside one. But most development processors are so badly managed that one product has many conflicting libraries and dependencies. Eventual requiring each component to be isolated within its own container. Finally leading to an unmanageable number of containers requiring Kubernetes to manage the mess.
by iknownothow on 6/1/22, 12:47 PM
From what I've seen, most developers either aren't systems thinkers or are too busy to take a step back and spot and eliminate redundancy. The best way I can explain this is that many software processes and pipelines within companies are usually complex Directed Acyclic Graphs and very often Transitive Reduction [1] is not applied to these processes.
At the end of transitive reduction, you end up with a graph with all redundant dependencies removed but functionally, it is still the same graph.
[1] https://en.wikipedia.org/wiki/Transitive_reduction
by kevwil on 6/1/22, 11:01 PM
Because it solves 5 problems while creating 4 more in the most complicated way possible. Seriously. It makes some things better, no question, but it is very complicated for some (most) people and is very time-consuming.
by raygelogic on 6/1/22, 2:35 PM
I think it comes down to jevon's paradox--if your demand is unbounded, making production more efficient doesnt reduce inputs, it makes you produce more. in the case of k8s, more reliable, more scalable, etc
by Art9681 on 6/1/22, 5:19 PM
To put it simply, anything that increases efficiency most likely increases the desire to scale. There is a ton of demand because everyone wants to scale to billions of users. Kubernetes is one way to get there until the next thing comes along.
Also, there is more demand than supply. Everyone wants to do Kubernetes and DevOps pipelines but the amount of folks experienced in those fields is small compared with demand.
It requires knowledge in many domains because it abstracts the entire data center. So you can’t just take a mid level sysadmin or developer and expect them to jump right in.
by throwaway894345 on 6/1/22, 2:04 PM
I can't speak for everyone, but Kubernetes lets my company do more with the same amount of manpower, so we use that manpower to do more stuff rather than reduce our SRE/sysadmin footprint.
by ltbarcly3 on 6/1/22, 1:26 PM
Kubernetes is a bad approximation to the infrastructure solution at a place that has different problems than you have. It only complicates and makes everything worse and more expensive to maintain.
by jasonshaev on 6/1/22, 3:41 PM
Something else to consider: what % of server workloads actually run on kubernetes?
I have no data to back this up, but my hypothesis is that if you zoom out, and look across the entire industry, the % is vanishingly small. It may seem like every company is running or adopting kubernetes within our bubbles but our perspective is biased.
(Note: I'm not espousing an opinion on kubernetes itself, just about it's total adoption across the entire industry and how that effects the number of devops/sysadmin/SRE roles.
by Garlef on 6/1/22, 1:15 PM
My short hot take on DevOps and infrastructure as code: "Infrastructure as code has it backwards"
-------
Take the development of programming as an analogy:
* Punched cards
* Programming in assembler
* Goto
* Callable procedures
* Proper functions
* Compiled languages (There used to be companies just selling a big C compiler)
* Interpreters/JIT compilation/...
* ...
-------
And here's a similar progression:
* Servers in your basement
* Some server rented where you login via SSH
* Docker/Kubernetes/Clusters in the cloud
* Lambdas and other serverless solutions
* ...
As a sibling comment pointed out: We're still in the stone ages. Somewhere between punch cards and proper functions.
-------
To rephrase it in reversal: "Infrastructure as code has it backwards"
Right now, we manually partition our code, then provision infrastructure and then push our code to this infrastructure.
Instead, we should take a high level language that compiles to the cloud:
Just write your business logic and the the compiler figures out what clusters/services/event-buses/databases/etc to use; It will automatically partition the code, package, build, provision, push, update. And there's even room for something like JIT: Based on the load parts of your logic get, the compiler could switch databases. Also: Automated data migrations based on your code updates. But I guess we'll end up with a big distributed virtual machine that scales infinitely and completely hides the existence of servers.
There's already some glimpses of this future: No-code, the magic pulumi does with lambdas, several language projects that get rid of the file system and just store the AST in a DB, smart contracts where you pay for single computation steps...
-------
But back to the question: Kubernetes/AWS/etc is a lot of work because it's not really THE SOLUTION.
by rufius on 6/1/22, 6:16 PM
The problems weren’t simplified. The problems were collected together into a single large platform.
However, as with most large platforms, they require ceremonies and priests (devops engineers). Someone has to make the offerings.
Much as people would like to believe, you don’t reduce complexity, you just shuffle it around and there’s an exchange rate. Even with solutions like Fly.io, you’re not getting rid of complexity in aggregate, you’re paying them to manage it (I.e. the exchange rate).
by taylodl on 6/1/22, 1:43 PM
The DevOps jobs are to configure and maintain Kubernetes. The problem is K8S is a general-purpose solution that's being used for managing an application comprised of container images. It's way overcomplicated for that task, but that's already been noted here in this thread. I know of proprietary solutions that greatly simplify DevOps compared to K8S, but they're proprietary.
by dtech on 6/1/22, 1:03 PM
Because they're also doing more things. 20 years ago you might have a single server in the broom closet handled by the sysadmin and developers running tests locally (if you were lucky), nowadays we want all those things you mentioned for production, and CI/CD for developing.
I'd wager providing all those things 20 years ago without k8s and CI tools would've required relatively more sysadmins
by mkl95 on 6/1/22, 3:44 PM
Kubernetes solves a subset of your usual deployment problems and replaces it with a set of its own. I'd call it a tradeoff, but it's such a leaky abstraction that unless your Kubernetes fu is really strong it's mostly going to make your life harder. It's a nice keyword to have in your CV though. Most jobs that "require" it don't actually use it.
by atmosx on 6/2/22, 7:52 PM
The answer to your question is this: _complexity does not go away, we just move complexity to another layer_.
Kubernetes buys the org some things but it is complex and you have to know how to write the app in a certain _way_ in order of the app to be "scalable, etc.".
There are no free meals or as someone smarter than me said long time ago "there is no royal road to learning".
by pantulis on 6/1/22, 1:18 PM
When compared to a Linode VPS box that you provision and setup with Ansible yes, it's much more work (and much more cryptic at that) but also Kubernetes covers for a lot of failure scenarios that a simple Linux box would not be able to cope with while adding many other benefits.
The question is: do _you_ need this added complexity? That humble VPS can scale _a lot_ too.
by fertrevino on 6/1/22, 9:50 PM
DevOps has its days numbered. What you see is the wide adoption of DevOps in every industry. This trend is likely to plateau and decline in the next couple of years, after which DevOps practices are taken for granted and become rather an expectation from customers. The DevOps problem only needs to be solved once, public cloud providers are almost there.
by wecloudpro on 6/1/22, 9:10 PM
If you use Kubernetes, you need custom operators and controllers in order to a have feature rich environment that can support your applications and support all CI/CD instrumentation.
Then, for designing, implementing and maintaining all these extra elements is why you need a devops guy. Also not mentioning how extremely fast things are moving in the cloud era.
by oznog on 6/1/22, 6:47 PM
Clouds have multiple conflicts of interest in favor of:
1. Dethroning sysadmins introducing devops in the middle ("devs" capable of deploying in the cloud but unable to control the OS).
2. Increase CPU and other resource consumption (promoting heavy frameworks, unable to pass the Doherty threshold in 2022).
For clouds, increasing complexity and costs almost always expands the business.
by benlivengood on 6/1/22, 4:16 PM
The cloud providers haven't closed the DevOps loop yet is why. I mostly have experience with Google's stuff, so I will take Cloud Build as an example. It provides the framework of CI/CD, but there isn't automatic build+deploy for every software and framework ecosystem.
What I'm trying to do at work is simplify the build ecosystem for all languages to the familiar `configure ; make ; make test ; make install` sequence that works well for OSS. If every ecosystem fit into that metaphor then the loop could be closed pretty effectively by any cloud provider by letting users add repositories to the CI/CD framework and it would do e.g. a standard docker build (configure, make, test), docker push (make install 1/2), and kubectl rollout to k8s at the end (remainder of make install).
Blockers:
Liveness and readiness checks are not automatic; they need to be part of each language*framework so that developers don't have to implement them by hand. At Google they just sort of came with the default HTTPServer class in your language of choice, with callbacks or promises you knew you had to invoke/complete when the instance was ready to serve. It helped that only 4 languages were officially supported.
Integration tests have no standard format and many deployments are combinations of artifacts built from multiple repositories, and configuration is not standardized (configmaps or ENVs? Both? External source of feature flags?) so all integration tests are manual work.
Metrics and SLOs are manual work; only humans can decide what actual properties of the system are meaningful to measure for the overall health of the system beyond simply readiness/liveness checks. Without key metrics automatic rollouts are fragile. This also means autoscaling isn't truly automatic; you need quality load metrics to scale properly. Not all services are CPU or RAM limited, and sometimes the limit varies depending on traffic.
All that said, cloud functions (Google, AWS, or other versions) are beyond DevOps. If you don't need high-QPS services then use cloud functions. They bypass 90% of the headaches of having code running on https endpoints. Most people don't have high-QPS (10K requests per second per shard) services, and could probably get away with cloud functions (1000 RPS on GCP). Everyone else pays the DevOps or hopefully the SRE tax for now. But we're still trying to automate ourselves out of a job; we don't want to be doing DevOps day-to-day either.
by unity1001 on 6/1/22, 1:36 PM
Kubernetes just solves the long-standing problems at a certain level of infra.
But like everything else in tech, solution of the problems at a given level enables everyone to do much more and build more complex systems that do more complicated stuff on top of that level. We always push the frontier forward with every new solution.
by ciguy on 6/1/22, 5:12 PM
In short - Kubernetes solves a lot of very complex problems. The problems are complex enough that the solutions are also complex and require specialized knowledge to implement well. Most teams using Kubernetes probably shouldn't be, but tech companies like to over-optimize for future scale.
by eric4smith on 6/1/22, 2:37 PM
This is the same thing when people say going to the cloud is not easier.
It’s not.
You still need Devops staff.
Cloud just provisions the hardware and OS. You still have to be responsible for the apps. You still have to be responsible for IO, memory, cpu and networking capacity.
You still need to make sure your apps are able to run on cloud - whether metal or k8s.
by quickthrower2 on 6/4/22, 10:51 AM
Kubernetes is not the solution to completely automating ops so that you don’t have to employ anyone
The solution to that is PaaS (Platform as a Service), and you can start a startup with almost no devops knowledge using things like Heroku and it’s myriad competitors from startups to AWS offerings.
by hukl on 6/1/22, 2:22 PM
Kubernetes and co do not reduce the amount of work - it’s just shifted to the next abstraction layer. Before when DevOps meant „you build it - you run it“ we removed dedicated ops teams to which the code was thrown over the fence and to reduce the animosity and friction between dev and ops. This was great but now all the hips companies have dedicated ops teams only that they are now called „platform teams“. Instead of code artefacts it is now containers that are thrown over the fence and now the ops part became so complex that separating dev and ops again seems reasonable. Luckily for me I managed to keep the good old DevOps of working, developing code and running it or bare metal servers with FreeBSD and Jails - even converting an existing Kubernetes setup back to bare metal. In my opinion the platformisation of the internet infrastructure isn’t a desirable state, monocultures are too and for the vast majority of projects kubernetes is overkill as they won’t ever reach the scale that would justify a kubernetes setup. It‘s like the milage fear for EV cars - but I guess everyone wants to hit facebook or google scale and that desire misinforms the early infrastructure architecture. That is just my 40 years old grey beard view which you can happily ignore whilst flying amongst the clouds :)
by politician on 6/1/22, 12:41 PM
It’s the recognition on the part of companies that cloud providers don’t provide a turnkey solution.
by dogman144 on 6/1/22, 8:18 PM
Because K8s is the very end of a long road, and even when that is done and setup, cloud eng work, shifts to CI/CD, data eng, significant networking maintenance, and IAM/account wrangling will keep the devops'ers employed. SRE is a golden goose job IMO
by mastazi on 6/1/22, 10:43 PM
> The platform of choice is mostly Kubernetes these days
Is it? These days I see SAM or Serverless Framework or other FaaS solutions all around me and it seems that everyone is migrating away from ECS/EKS/containers, it might be my own particular bubble though.
by mmcnl on 6/2/22, 5:51 PM
You're looking at only infrastructure costs, but not at benefits. Being able to autonomously deploy an application in production increases your team's velocity by orders of magnitude, e.g. faster time-to-market, faster feedback loops, etc.
by djohnston on 6/1/22, 1:55 PM
DevOps is basically the "tool smith" from Mythical Man Month's surgical team isomorphism. Any sufficiently large (>10) team of engineers will benefit immensely from a specialist focused on improving internal developer efficiency.
by inportb on 6/1/22, 12:55 PM
Even though Kubernetes could reduce the workload and might require less manpower in some cases, it's still a beast that requires management. So DevOps has shifted from managing traditional infrastructure to managing Kubernetes configurations.
by mbrodersen on 6/2/22, 1:58 AM
DevOps are supposed to be part of the software development team. NOT a separate department. That’s the difference between SysAdmins and DevOps. It’s in the name! Developers (on a team that run) Operations (of the teams products). DevOps.
by fartcannon on 6/2/22, 2:56 AM
I think it was originally pushed as a way to get more people to use cloud platforms. And who better than Google to host that which they created?
Luckily its from the functionally less evil google days and open source so it is possible to use anywhere.
by aristofun on 6/1/22, 5:28 PM
Kubernetes is just heavily overegineered and overmarketed thing. Let’s face the truth.
by haspok on 6/1/22, 3:23 PM
It's like saying that you won't need human workers because you'll have robots doing the work. Aha, sure, but who is going to program those robots?...
by mr_toad on 6/1/22, 10:38 PM
https://en.wikipedia.org/wiki/Jevons_paradox
by flyinprogrammer on 6/1/22, 12:55 PM
Because it primarily wasn't built for developers. It was built to keep sys admins relevant and give vendors a common place to sell their vaporware.
by pabs3 on 6/2/22, 4:00 AM
I keep wondering when the systemd folks will come up with an orchestration layer over systemd-nspawn/systemd-machined to replace Kubernetes.
by mcharezinski on 6/1/22, 11:41 PM
I'd love to see a movement where more engineers write tooling in-house to solve technical problems. Not adapting existing and promoted ones.
by lumost on 6/1/22, 2:01 PM
I've seen a general blurring of the lines between these roles. But a common theme is that if you have a dedicated "role" for something, they will prefer tools which cater to their "role". This is both a good thing for companies who benefit from further optimization within that "role", and a bad thing for companies who do not.
Kubernetes is a powerful tool for "DevOps" roles. It provides an immense array of configuration, and largely replaces many OpenStack, Xen, or VMWare type environments. You can build powerful internal workflows on it to colocate many services on one compute fleet while maintaining developer velocity - which can translate to large margin improvements for some firms. This comes at a cost that you are likely to need a Kubernetes team, and potentially a dev tooling team to make it all work. On a large compute environment, the latter costs don't effect the big picture.
Now on the other hand, more teams than you would expect are just fine on Heroku/AppEngine/AppRunner/Lambda. These teams tend to pay the cost of not having a dedicated dev tooling team through more expensive compute, and sub-optimal tooling. The benefit here though is that "more expensive compute" may mean a fraction of a salary in many environments, and "sub-optimal" tooling may mean a production grade SaaS offering that has a few rough edges you talk to the vendor about.
IME it's much cheaper/lower risk to choose the latter in the long-run. The apparent savings from option 1 eventually turn into tech debt as the shiny tools get old, and migrating to newer/cheaper compute options becomes more expensive. I once built a colo facility which resulted in a 4x reduction in monthly recurring expenses (including salaries) for the same compute footprint, 1 year into the lifetime of the facility the former cloud provider reduced prices by ~30%. Around 6 months into the facility the DataScience team suffered attrition, resulting in fewer compute needs. At the 1.5 year mark the team begged for a flip to SSDs as they were having latency issues (a point of initial contention with the team that SSDs should have been used in the first place). Over the 3 year expected lifespan of the facility there were about ~2.5 months of ramp up/migration work which impacted ROI.
Overall, in hindsight, I'd say at best we achieved a 1.5x reduction in compute expenses compared to the alternative of tooling improvements, cloud cost reductions, and compute optimization. I now seek the tool which provides the lowest friction abstraction as at the worst case I can simply migrate to something cheaper - investing in compute infra has a crazy level of depreciation.
by jrockway on 6/1/22, 2:57 PM
Here's my thought on the current state of the industry. DevOps at some point was not a specialty that you hired for, it was a way of thinking about your team's responsibility. Your team would make an application and your team would run that in production. If you wanted to test things before deploying, you would do that. If you wanted automated deploys, you would set that up. No middleman with competing concerns between you and your users.
Eventually, people had a hard time finding well-rounded individuals that could design, develop, test, and deploy software. It seems to be a rare skillset, and people are resigned to not being able to hire for that kind of role. So, all of these ancillary concerns got split off into separate teams. You have a design team, a software engineering team, a test engineering team, operations, and so on. DevOps changed from "developers that operate their software" to "developer operations", which is just your 1990s operations team with a new name. You the developer want something, it goes on a backlog for some other team, you wait 6-8 years, you get your thing.
All the complexity of the devops world comes from having one team writing the software and one team running the software. An example are service meshes. They are super popular right now, and everyone and their mother is writing one and selling it for tens of thousands of dollars per year. To the software engineer, having two applications communicate over TLS is pretty simple; you read the certificates and keys from disk or an environment variable, throw them into a tls.Config, and give that tls.Config to your internal servers and internal clients. But, what happens in the real world is that the organization says something like "all apps must use mTLS by January 2023". The software team says "meh, we don't care, we'll get to it when we get to it". So the poor devops team is stuck figuring out some way to make it work. The end result is a Kubernetes admission controller that injects sidecars into every deployment, which provision TLS keys from a central server at application startup time. The sidecars then adjust iptables rules so that all outgoing connections from the original application go through the proxy, and if some distributed policy says that the connection is supposed to be mTLS, it makes that happen. Basically, because nobody on the dev team was willing to spend 15 minutes learning how to make this all work, it got bolted on by $100k worth of consultants, all for a worse result than just typing in a very small number of lines of code by yourself. That's the state of devops. The people writing the software won't run it, so you have to add complexity to get what the organization wants.
I think it's terrible, but that's the fundamental disconnect. When you need to change how software works without being able to edit the code, the workarounds get increasingly complicated.
As always, what looks like a software complexity problem is actually an organizational complexity problem. The way I've managed this in the past is to organize a lot of knowledge sharing, and make a conscious effort to avoid hiring too many specialists. At my current job my team used to make a SaaS product, and our team was a combination of backend software engineers, frontend software engineers, and some engineers with experience with operations. We were one team; frontend engineers would write Go code, backend engineers would make React changes, and we all did operational drills ("game days") once a week. The result was a very well-rounded team. Everyone could deploy to production. Everyone could be on call. Everyone could fix problems outside of their official area of expertise. I wouldn't have it any other way. The industry, however, deeply disagrees with that approach. So you're going to have testing teams, devops teams, etc.
by mountainriver on 6/1/22, 2:46 PM
Why aren’t we working 3 day weeks when we have all this automation power today? Like all things in life the bar just gets raised
by kristianp on 6/2/22, 1:16 AM
If x is the solution, why are there so many <x related> jobs?
Economics says that as something becomes cheaper, demand increases.
by broknbottle on 6/1/22, 5:43 PM
Because coding in Yamllang and Jsonlang is superior to old and archaic languages like Rust and Golang.
by musicale on 6/3/22, 1:02 AM
"To Kubernetes! The cause of - and solution to - all of life's problems!"
by imwillofficial on 6/1/22, 1:59 PM
Scale, we are using so much more IT infra now. Old sysadmin ways don’t scale well.
by otabdeveloper4 on 6/1/22, 5:16 PM
Yes, it is the solution for keeping DevOps employed and happily compensated.
by BirAdam on 6/1/22, 1:25 PM
So, I am old enough that when I started my career I was just a "system administrator" who happened (rather luckily) to work primarily with BSD and Linux servers. At that time, I was still learning a lot. I eventually learned enough and gained enough experience to become a "systems engineer" which meant that I could architect solutions for customers of my employer. I then became a senior systems engineer. Throughout this entire time things like Chef, Puppet, ansible, and Salt were not widely used even after they were created. Red Hat pushed ansible really really hard once it came out, and config management became a thing. The combination of config management systems with containers created two new roles: DevOps, SRE. Servers became VMs, which in turn became container platforms. Config managers took the place of version control and a bash script. CI/CD became weirder. In times past, you would have something like HAproxy on FreeBSD, which would then send traffic to Apache/Nginx servers, which in turn sent traffic to PHP servers, which called data from database servers and an NFS cluster. Now, behind the scenes, you may still have HAproxy or other load balancers, but those are combined with something similar to OpenStack with an underlying storage system like Ceph. All of that may get partnered with geo-aware DNS if you're really fancy. Systems engineers and admins are still managing that stuff behind the scenes at Azure, AWS, Google, RackSpace, Cloudflare, DigitalOcean, and other places (or at least I imagine so). There are also engineers who specialize in OpenStack. Most, however, have transitioned to the new roles of DevOps or SRE, because the need for highly skilled SEs and SAs has waned.
Essentially, these roles have narrowed the focus of system administrators and systems engineers. In one, you are concerned with CI/CD, and in the other you are making and maintaining cloudy solutions for people. This is yet another layer of abstraction for people, but it also means that most people do not know how to configure underlying software anymore. Because they lack knowledge of how to configure underlying software, they also require automation frameworks. They now do not know how to automate their workflows with Bash, Ruby, Python, or anything else. They need the cloud system to do it for them, which means that they get very vendor locked.
EDIT: the plus side of a new abstraction layer is cheaper tech departments at non-tech companies (fewer and cheaper personnel); which also means that pretty much everyone wants to be a software developer now, and very few people want to be SAs, SEs, DOEs, or SREs; you have to know everybit as much but you get paid much less.
All of this may bust. Increasingly, more and more people are becoming wary of monopolistic tech giants. The cost of their datacenters on the planet is increasingly rapidly. The governments of the world are growing wary of their increasing power. For businesses, complete reliance on a third party who has vastly more power isn't as palatable as it used to be. We may see a resurgence of smaller DCs and bare metal deployment, but any such change would only happen if another massive tech bust occurs. The reality that I see is that we may see both models live in tandem indefinitely, as there are differing use-cases that make either more suitable.
by eeZah7Ux on 6/1/22, 5:09 PM
Because Kubernetes automates away 4 jobs by creating the need for 5.
by merb on 6/1/22, 9:17 PM
btw. kubernetes is just a scheduler. you give kubernetes a definition and it will schedule the things according to your definition. everything else is basically just an addon.
by janosdebugs on 6/2/22, 6:15 AM
Kubernetes is insanely complex and modular. Just yesterday I was looking at the source code and the code part I knew was replaced by yet another pluggable system. Instead of consolidating into a well-understoof set of features, Kubernetes is exploding with complexity, so it's almost impossible to "build it yourself" for a production environment.
However, there are plenty of companies that will sell you a system, including varying levels of support. You then, of course, have to hire your own DevOps engineers that will deal with the areas the support doesn't cover, which, given the complexity, is still an awful lot. Or you do everything in-house, which means hiring even more people.
TL;DR DevOps engineers won't be out of the job anytime soon. Same for Kubernetes developers.
by fancyfaith on 6/1/22, 4:25 PM
Checkout jetpack.io they are trying to solve exactly that
by znpy on 6/1/22, 5:39 PM
Because kubernetes is both the problem and the solution.
by zxcvbnm on 6/1/22, 5:45 PM
because it's an overengineered hype that does not reduce complexity, only shovels it around, turning simple problems into obscure ones
by lachlanwhite on 6/1/22, 11:48 PM
It isn't the solution :)
by kulikalov on 6/1/22, 12:52 PM
Tl;dr: Kubernetes is not "the platform of choice. There is no universal tool. That's why you need system architects, DevOps, etc.
by blodkorv on 6/1/22, 2:34 PM
instead of 3-4 devops guys, you now only need 1-2 really good kubernetes guy.
by throwaway787544 on 6/1/22, 1:47 PM
DevOps isn't a job. DevOps is a system to work with people directly and find out what they need and give them things that enable them to get their job done faster, while also getting enough information to make sure the product stays online and reliable. What people call "a DevOps role" today is just sysadmin or sysop or syseng or SRE.
Back in the day we cobbled together solutions out of different parts because it gave us a strategic advantage over monolithic commercial solutions. It was cheaper, but it was also easy to customize and fit to product & user needs. Yes configuration management was a nightmare, and it came back from the dead as Terraform, because instead of an OS with mutable state we now have a Cloud with mutable state. Docker and Packer and a few other solutions have fixed a lot of the mutable state issues, but databases are still flawed and SaaS is still just a mucky mess of unversioned mutable state and nonstandard uncomposeable poorly documented APIs.
With Kubernetes, we're back in the land of commercial monolithic products. Technically you can build it yourself and customize it all, but it's expensive and time consuming and difficult because of how many components there are tied together. It "gives you everything you need" the way the International Space Station does. Do you need a space station, or a barn?
People get so wrapped up in terminology. Declarative doesn't mean anything other than "not procedural"; it's not better than procedural, it's just different. Plenty of declarative things are a tire fire. Infrastructure as Code just means "there is some code that is committed to git that can manage my servers". A shell script calling AWS CLI is IaC. Doesn't make it a good solution.
You can't just install a piece of software and be done. That's the entire point of the DevOps movement, really. It's not about what you use, it's all about how you use it. Work with humans to figure out what will work for your specific situation. Use your brain. Don't just install some software because it's trendy and hope it will fix all your problems.