Nov 30

Like many good things in the US, cloud computing originated on the West coast inside the innovative web giants Google and Amazon.  Both offer forms of storage and application services.  Google aims today at the consumer and end-user with the ever-expanding office-type applications (Calendar, documents, mail, etc.).  For many organizations (such as the city of Los Angeles) Google replaces the traditional Microsoft desktop application suite.  Amazon is geared more for advanced consumers (like web developers) and the enterprise with a range of increasingly data center friendly storage and compute services.

An ever-increasing list of cloud-storage focused startups are on the scene, and now the major vendors are making entries to market, slowly but surely.  Don’t forget the Telecom’s like AT&T, who like EMC, are closely modeled after the Amazon cloud model. wave

The public cloud model threatens the premise of the traditional enterprise hardware and software industry, so we are beginning to see the major infrastructure vendors positioning for the private cloud market, as seen with the EMC/Cisco/VMware joint venture cloud services company Acadia and a slew of similarly minded partnerships.

Indeed, the cloud is a confusing marketing fueled circus at the moment, but I do think the impact on traditional infrastructure roles will be significant over time.  I personally don’t think the private cloud is a cloud (I like to call it virtualized infrastructure with good engineering), and like some people would suggest, “not everything is a cloud“.  What I’m talking about is the impact of public cloud infrastructure services on general IT infrastructure.

So here’s a preliminary speculation on the cloud impact to infrastructure roles:

  • Tech Cycle One (2010-2012)
    • Storage cloud services evolve for archive, file, and disaster recovery
    • Compute cloud services mature to data center ‘friendly’ offerings
    • Flurry of device and software entries to market to solve ‘first mile problem’ associated with cloud archive and storage (this is something I plan to write about soon)
    • Fewer system administrator type roles required for small-medium businesses and startups (via cloud compute)
    • Early adopters of large enterprise storage and compute services – primarily x86 and secondary storage, but limited impact to headcount
  • Tech Cycle Two (2013-2015)
    • Storage cloud services mature, consolidations and major acquisitions begin
    • Compute cloud services become standard for small business and adoption begins in mid-market
    • Significant reduction of standard back-office support (files, email) for small-medium businesses and startups
    • System administrator type roles required for small-medium businesses and startups become less common place, in some cases not required at all
    • Initial impact to large enterprise storage and compute services – primarily x86 and secondary storage, possible redux in skill requirements

As a practitioner, this is a good time to track the trends and stay ahead of the shift. This one may shake up the status-quo management model we’ve enjoyed for the last 15 years in distributed computing.

Tagged with:
Aug 23

As the cloud hype continues into an exponential roar of a pre-launch countdown, you have to wonder how much of the excitement is catching outside of the IT microcosm. On one hand you have a relatively unproven business model, yet we’re looking at possibly the largest wave of hype ever to hit the IT industry. The average CIO today is driving hard for cloud, private, public, whatever it takes to get to the cloud, while the average IT architect is still trying to figure out what exactly we mean by cloud. Politicians have gotten on the magic carpet ride too.

A great example is Rueven Carlye, state representative of my old stomping grounds in Washington state, who is rallying against a new state data center project in Olympia, calling it ‘a 300 million dollar mistake’ in defense of lower costs via cloud providers.  That’s a bold move for attacking an entire data center budget, where the state likely runs on 100’s of legacy applications and platforms, unlikely ever to be candidates for cloud computing. A more pragmatic push to cloud is afoot at the city of Los Angeles, with legislators gunning to shift desktop workload to Google applications.

So do we suddenly have tech-savvy legislators, or do we have a growing lobby influence from the cloud pioneers, Google and Amazon? I’d speculate that millions of lobbying effort per quarter is only the beginning.

But we IT pioneers pack up our wagon train and head into the cloud; do we have any laws defining ‘who owns the data’? As far as my research on the topic goes, the answer is no.  You do have broad legislation in the European Union limiting private data movement out of country, but practically no laws outlining who owns your data when it goes into the cloud (or anywhere for that matter). One 3rd party cloud architect noted, ‘chain of custody is the only legal precedent protecting data ownership in the cloud’, and if you don’t have that, you don’t have much of a legal leg to stand on if you’re data is being uploaded into a 3rd party virtual infrastructure and ownership/control is called into question.  Definitely reminds me of feudal Europe – those who own the land (infrastructure) have the power (over the data) in the emerging cloud market.

Tagged with:
Jul 21

5 years ago, your average data center hosting 10,000 servers looked like a linoleum covered football field,  was wicked cold, yet had plenty of aisles where you could break a sweat due to warm air-current hot-spots. In the early century, air-convection planning and floor-space reconfigurations were considered ‘green’ strategies to reduce environmental costs.Google

Flash forward to, uh now, for those who haven’t seen the video of google data centers, this is an interesting glimpse into the world of massive-scale distributed computing on the cheap and the green.

First, some observations:

What you see:

  • Lots of modular components: starting with containers, basic power and plumbing conduits, heat-exchangers, and lots of open-case x86 kit crammed into high-density confined spaces.
  • An obsessive design focus on minimizing the Power Usage Effectiveness ratio
  • Centralized cooling and power infrastructure and many non-descript containers in a warehouse rack configuration
  • Vertical distribution of gear and workload (way beyond the scale of your average data center rack)
  • As noted in the exceptionally dry video dialogue, 45000 servers across 45 containers (1000 servers per container), in a single warehouse type facility
  • (now, close your eyes and imagine Google running on over 450,000 servers world wide and tell me if you’re going to sign up for a yahoo account tomorrow)

What don’t you see?

  • No raised floor, specialized rooms or floors built for data center infrastructure
  • The classic air conditioned hockey-rink full of heat generating equipment
  • Large volumes of ambient space being cooled
  • Centralized UPS / Power Fail-Safe equipment
  • No cooling or power machines competing for compute space
  • Vendor labels (notice, no labels whatsoever)
  • Specialized hardware – in fact the hardware is engineered to be dead simple and uniform (which is relatively easy to do if your business runs massively distributed custom application code)

If you’re watching the Google drip-feed of information about their data centers, the innovation is absolutely incredible. The latest news about the chiller-less data center in Belgium, you see geography, climatology, IT infrastructure, and geographic load-balancing strategy all in play, with a goal to load-balance work away from Belgium on hot days, and not rely on a single chiller. That’s seriously more innovative than creating warm/cold air convection cells in your raised-floor meat-locker. Maybe my earth sciences degree will come in handy for IT work after all…

Next, if you take a look at the major IT infrastructure vendors, we now have a horse race to emulate big modularized x86 kit in a box, ready for rapid deployment and consumption. Sun arguably pioneered the concept several years ago with the modular data center, and now we have pods, pods, and more pods.

In the larger picture, these innovations seem to validate Friedman’s semi-futuristic speculation in Hot, Flat, & Crowded, of the convergence between power grid and IT infrastructure. Granted, it will take years, if not decades to purge legacy systems and applications, but for your average enterprise planning on ‘massively virtualized’, this isn’t a far-stretch of the imagination to see where centralized infrastructure could possibly take hold for large parts of IT infrastructure. Absolutely, there are scores of dependencies and assumptions which must be true for this to work.

The Google drip feed also shed’s light on why cloud computing is more than just a link to the next best data center. As a buddy of mine put it, ‘maybe the people who want to build their own cloud are missing the point’.

Tagged with:
Jul 08

The biggest drama in the 2007-2009 storage industry is over. NetApp loses Data Domain.

This is very bad news for Quantum and FalconSTOR, leaves NetApp in a less than desirable position to market, and puts EMC in a strong technology position.  With a capital infusion of 2.3 billion, EMC is going to lead aggressively and move more kit than we can collectively imagine with the Data Domain acquisition.

For Quantum, their main channel was EMC through a not too shabby product line up (if you exclude the Franken-EDL with Deduplication), which extended through the Dell relationship to the mid-market, with a collective wide and deep market reach. That’s going away.

For FalconSTOR, the EMC relationship presumably ends with EDL. EMC still pushes EDL due to the fact it’s a solid product and does a great job at being a non-deduplicated VTL. But nontheless deduplication for backup data storage on disk is usually a deal-maker when you do the capacity sizing and are getting 8:1 deduplication or better.

For NetApp, it’s back to square one with kind of late-to-market VTL+deduplication offering that has yet to even scratch the powerful legacy of the NAS product lineup.

For EMC, things just got a lot more interesting. EMC has acquired arguably the most ‘proven’ deduplication feature set in the field (if you could instances and years in production). While some real engineering will be required for Data Domain deduplication to legitimately play in the primary storage space, the collective deduplication, virtualization, and security capabilities of EMC (Data Domain, VMWARE, RSA), position EMC with a mad toolkit for the cloud storage game. And in the meantime, a simple and proven product offering to market in the small-mid-enterprise market for VTL/NAS deduplication is ready to roll.

Plus, EMC has done a great job of acquiring companies and not screwing them up over the last several years. That is of course if you look past the Data General acquisition and the 2nd-class citizenry of the CLARiiON line, which persists to this day as a result of an ego-driven acquisition. I’d speculate EMC plays somewhat softer and smarter these days, and is going to make this work and take advantage of the investment.

Tagged with:
Jun 25

The mission?

To research, spot, and explore trends and their real world impact as they apply to the next generation of IT infrastructure and enterprise computing.phase shift

Why this blog?

Because there aren’t many independents covering this space in a holistic way. Plenty of ‘company blogs’ but not so many IT infrastructure people. There are plenty of niche bloggers and evangelists who are super deep in various specialities, such as virtualization, storage, deduplication, data protection, and cloud, but few who are looking at the wholesale change underway for IT infrastructure. At first I thought I was late to the party but upon further inquiry realized the party is just starting…

why now?

There’s a lot of hype in IT, it keeps the spending and budgeting up.

Cloud is hype city without a whole lot of focus or clarity yet. I just look at the vast inefficiencies in classic IT infrastructure versus the radically innovative core infrastructures of Amazon, Google, etc and think the cloud gateway to these architectures will crush IT as we know it for some data types, some types of companies, in many different situations. Labor arbitrage for enterprise infrastructure management could become irrelevant over time if next generation architectures flatten out skill requirements.  Why now? It’s timing.

WAN Network architectures are fatter and cheaper. Cheap petabyte scale infrastructures are do-able and you don’t necessarily have to hack together custom built gear and code to make it work. And most importantly you have base of technologies for virtualization, grid, deduplication, all rapidly becoming commoditized, and now cloud emerging as the gateway to super efficient architecture. This is more than just the next wave of Moore’s law and new feature sets or gadget buzz. This is a phase shift.

Why should people read it?

To keep up, to contribute, to challenge ideas, to address a plethora of good questions. Because the whole idea we are used to and for many of us the basic paradigm of distributed computing “as we learned it” and is staple to our entire careers, may in fact be changing. Were talking about 20 years of one model and entering into the next 20 years of the new new new thing. And the practical impacts to us, well that’s really yet to be seen but it’s a fair bet certain roles, functions, and technologies will begin to disappear in the next 5 years as we shift gears into the next 20 year cycle of IT infrastructure.

Who should read it?

Technologists, engineers, architects, entrepreneurs, strategists, designers, scientists, researchers, and thinkers who share a common curiosity of what is going to happen to IT infrastructure as we know it.

Tagged with:
preload preload preload