The right time to assess Windows Vista's performance

Measuring the performance of an operating system is a tricky thing.  At the same time, it's the right and necessary thing to do, because performance is one of many criteria important to customers.  Part of the trick of measuring performance is to time testing execution with the product cycle such that the results are as meaningful as possible for customers; this helps them make a better decision by making use of the full array of available information.  As one example, about a year ago we commissioned a firm called Principled Technologies to conduct a study comparing Windows XP SP2 to Windows Vista RTM.  That study found the performance measures of the two operating systems were within the same range for many tasks that home and business users frequently perform under real-world conditions.

My point is that we waited to conduct these benchmarking tests until Windows Vista had reached the RTM milestone in the product cycle, as this allowed us to provide our customers the most meaningful data available at the time -- the data most likely to directly affect their decision to upgrade to Windows Vista.  We do a whole range of performance tests at every stage of the OS development process, but, as a general rule, we avoid sharing benchmark tests of software that hasn't gone RTM (i.e., final code).  This explains why we have not to date published any findings of benchmark tests (nor commissioned anyone to do so) on performance improvements brought about by Windows Vista SP1.  Publishing benchmarks of the performance of Windows Vista SP1 now wouldn't be a worthwhile exercise for our customers, as the code is still in development and, to the degree that benchmarking tests are involved, remains a moving target.

Aside from that point, let me also emphasize that there are a variety of ways to benchmark the performance of a PC.  Different techniques can yield different results.  Some benchmark techniques simply test PC hardware performance by running a series of tasks at superhuman speed.  Such tests tend to exaggerate small differences between test platforms and consequently are used less frequently nowadays, replaced in favor of benchmarks running tasks at human speeds with realistic waits and data entry.  Benchmarks that run at superhuman speeds often deliver results that don't tell the whole story.  In fact, we made deliberate choices during the development of Windows Vista to focus on real-world scenarios affecting user experience, rather than focusing on improvement of microsecond operations imperceptible to the user.  In addition, in Windows many operations can require additional processing time for work that is done for reasons that benefit the customer; these can include security, reliability or application compatibility checks conducted when a program launches.  These operations may add microseconds to an individual application's launch that under real usage isn't perceivable to the human eye.  When thousands such operations are strung together through automation, those few microseconds can have a cumulative effect on the benchmark result, causing performance to appear much better or worse than expected.

I've included below a video we captured depicting a "benchmark test" running a window-open, window-close routine at accelerated speed.  You can see that it isn't representative of real-world user behavior and hence isn't an accurate gauge of the actual end-user experience.  Further, tests like these only measure a very small set of Windows capabilities and so aren't representative of the user's overall day-to-day experience of working with Windows and running applications.


Video: Windows Vista benchmark testing

Methods like those of Principled Technologies that actually approximate the experience of using the PC, taking an OS through the paces of completing actual tasks at the approximate pace a user might click through them, tend to provide results far more useful to our customers.  The typical Windows customer generally wants to know how his/her actual computing experience will change (read: improve) with an upgrade.  The Principled Technologies tests do that.

For what it's worth, I can personally attest that I prefer to get my work done on Windows Vista SP1 RC bits.  I run Windows Vista RTM on two production machines and SP1 RC bits on two others; in fact, I'm writing this post on a machine with SP1 RC bits installed.  As a part of our internal SP1 testing program, I know that we continue to develop and improve SP1 every day, in large part based on feedback and bug submissions from external an internal Beta-test program members.  IMO, the perceived gains in performance between SP1 Beta and SP1 RC code are significant.  As I said at the beginning, though, performance is only part of the story -- don't forget that SP1 also brings support for new types of hardware and several emerging standards, and further eases an IT administrator's deployment and management efforts.

But don't take my word alone for it.  We'll broaden the testing pool of SP1 RC bits soon (very soon), so when I post that notice here on the blog, you'll be able to put Windows Vista SP1 RC through its paces yourself.  I think you'll find the experience worthwhile and satisfying.


Comments

  1. Posted on: January 10, 2008 at 3:37PM  

    Hey ChicagoBobt:  not sure what you're referring to, as the release of Windows XP SP3 has not been delayed - it is on track and progressing according to schedule.

  2. Posted on: January 14, 2008 at 1:01PM  

    Considering it was to be launched in 07

    then backed to early 08 and now early 08 means

    first half (June) I would call that sliping.

  3. Posted on: January 20, 2008 at 12:53AM  

    I just bought a new Core 2 processor and couldn't wait to put in my Vista machine to see it fly.  The result was absolutly no increase in speed at all.  So I disconnected the primary hard drive and installed xp on the secondary hd I used for mirroring.  It's the exact same hard drive as the primary and I even used the same cables.  So how does the exact same machine run with xp instead of Vista?  Incredibly faster.  No contest.  I'm wiping the Vista drive and considering the money lost.  Was this the correct way to assess Vista or am I doing something wrong?

  4. Posted on: January 21, 2008 at 5:22PM  

    I bougth a notebook with Windows Vista and WOW, my new pc was as slow as a 486, the solution: install a free operating system "UBUNTU"

    WOW i lost my money... WOW

  5. Posted on: January 22, 2008 at 2:16PM  

    Nick,  

    If the Devil Mountain benchmarks are flawed, but you agree that benchmarking is important, then what is the correct methodology?  The Principled Technologies tests reference proprietary tools that were lent to them from Microsoft for the tests (ScenCap), I don’t see how independent tests can reproduced independently.  Can you advise on a tool or methodology for benchmarking that you do think would produce accurate results?

  6. Posted on: January 22, 2008 at 9:07PM  

    Hey session0:  we'll have more news and guidance on benchmarking Windows Vista SP1 at its RTM, which is coming soon ...

  7. Posted on: January 25, 2008 at 12:16PM  

    Will they be industry standard or

    MS standard?

    Will these benchmarks contain video/audio compression of several compressors, simutaneous processes, statistical analysis,

    time to start applications, start applications

    like Visual Studio 2005, time to start Vista from a cold start, thread benchmarks, and raytracing, database query and record retrival?

    Or?

  8. Posted on: February 15, 2008 at 3:29PM  

    Hey Nick: I would love to see an update to this now that Windows Vista SP1 has released to manufacturing.  May people want to know what the performance story of Vista is and if SP1 changes it and how do we measure that.

  9. Posted on: February 23, 2008 at 8:29PM  

    Every newer version of windows has reflected a performance drop.  There was a significant drop from 95 to 98, not too bad to XP.  I can't speak for Vista yet - but its coming pre-installed on a Dell laptop.  I'm looking forward to fiddling with all the bells and whistles.  I'm not too worried about the performance, as long as it works.

    ...And I can allways dual boot with SUSE if Vista fails to perform.  I get to wobble windows and play with 3d desktop environments on an 850 duron with 386MB RAM on that platform, so I know it will be pretty smooth even on a single core 1.8Ghz celeron and a few GB of RAM.

    DX10 isn't enough to tempt me to Vista for the other PC's, although I'd describe myself as a hardware enthusiast.  In fact, no, really, buying a console is cheaper and better for games.  MMPOG'S are ported to Linux now - and that's an OS I can upgrade and re-install as many times as I like for free.

    I like MS products - but, really, when we talk performance I'd like to see some real improvements and clever stuff - INNVOATION - like optional platter striping for the OS, programs or VM.  I want "green" innovations for using "retired" memory modules over a dedicated bus - that would be cool, and much better power management, clever power management.  I want better self-diagnostics.  I don't want hangs BSODS and freezes and problems with drivers and IRQ's. I want windows to repair itself better.  I want more control of what gets swapped and what is in RAM.  And I want an OS that can run previous generations of operating systems virtually so the software I have will allways run and so that better, braver and faster solutions can be found without worrying about backwards-compatability too much.  

    And I don't want bloat.

Trackbacks

  1. Posted by: Windows Vista News on December 01, 2007 at 12:30AM

    Did you see this post at windowsvistablog.com

  2. Posted by: TechBlog on December 01, 2007 at 7:04PM

    Buried at the bottom of a blog post from the Windows Vista Team's Nick White about how now is the right time to assess Windows Vista's performance is an intriguing bit of news: For what it's worth, I can personally...

  3. Posted by: Strategic Developer | Martin Heller on December 03, 2007 at 5:40PM

    I've noticed (well, who wouldn't?) that Randall Kennedy (RCK) is in a kerfuffle about Nick White, a Microsoft Vista Product Manager, who blogged a relatively mild discussion of "The right time to assess Windows Vista's performance." So, at the risk of

  4. Posted by: External Links & References (SSQA.net) on December 06, 2007 at 1:38PM

    Windows Vista SP1 RC is released on 05th December to the Beta Testers on the Microsoft-Connect and on

  5. Posted by: David Overton's Blog on December 11, 2007 at 5:41AM

    Following up on my articles that discuss Vista performance (I still stick by my statement that it is