Re: Performance Translators' Stability and Usefulness - Regression test outline

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi Gordan,

> Ouch, ouch, ouch. That sounds like a monumental bodge. If somebody
> working for me implemented that kind of a "solution" for a frequently
> occuring problem in a production environment, they'd be finding
> themselves looking for a new job pretty quickly. Most likely along with
> the architect who trialed the solution before putting it into production
> without finding the problems that require such a solution. Solution to
> crashing processes is fixing the bug that causes them to crash, not a
> wrapper that gets them restarted.

Then I've got to say, ouch. Because a very similar sort of bodge (admittedly, 
a touch more sophisticated than a cron job :) is what's keeping my production 
system together. As I have been unable to track down the crash bug that crept 
into my thoroughly-tested-before-deployment production system over the years.

(But then you know this tune already :)

That said - if you're aiming for reliability in your systems - shouldn't you 
have this sort of automation anyway? Sometimes even the most reliable 
software hits the unexpected.

I'd rather have the server handle it automatically (and investigate at my 
leisure from detailed logs etc), then get an irate phone call at some ungodly 
hour.

> The _LAST_ thing Gluster needs at the moment is more features. Lack of
> stability loses you customers much faster than extra features gain them.

Agreed.

> IMO, a top down approach merely glazes over the more fundamental
> problems. You cannot engineer quality from the top down. You design from
> top down, but quality comes from bottom up.

I think you need both approaches.

Functional tests - the top down approach - often catches what you don't have 
unit or integration tests for yet. Unit tests and integrations tests catch 
what your functional tests can't test for.

Geoff.

On Wed, 8 Jul 2009, Gordan Bobic wrote:
> On 07/07/2009 19:38, Mickey Mazarick wrote:
> > Since I'm running my setup as a storage farm it just doesn't matter to
> > me if there's a memory leak of if a server daemon crashes, I have cron
> > jobs that restart it and I barely take notice.
>
> Ouch, ouch, ouch. That sounds like a monumental bodge. If somebody
> working for me implemented that kind of a "solution" for a frequently
> occuring problem in a production environment, they'd be finding
> themselves looking for a new job pretty quickly. Most likely along with
> the architect who trialed the solution before putting it into production
> without finding the problems that require such a solution. Solution to
> crashing processes is fixing the bug that causes them to crash, not a
> wrapper that gets them restarted.
>
> > True a regression testing
> > would get rid of the memory leak you hate but if they have to start from
> > the ground up I would rather encourage the dev team to add hotadd
> > upgrade and hotadd features. These things would keep my cluster going
> > even if there were catastrophic problems.
>
> The _LAST_ thing Gluster needs at the moment is more features. Lack of
> stability loses you customers much faster than extra features gain them.
>
> > What I'm saying is that a good top down testing system is something we
> > can discuss here, spec out and perhaps create independently of the
> > development team. I think what most people want is a more stable product
> > and I think a top down approach will get it there faster than trying to
> > implement a given UT system from the bottom up. It will defiantly answer
> > the question "should I upgrade to this release?"
>
> IMO, a top down approach merely glazes over the more fundamental
> problems. You cannot engineer quality from the top down. You design from
> top down, but quality comes from bottom up.
>
> Gordan
>
>
> _______________________________________________
> Gluster-devel mailing list
> Gluster-devel@xxxxxxxxxx
> http://lists.nongnu.org/mailman/listinfo/gluster-devel




[Index of Archives]     [Gluster Users]     [Ceph Users]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Security]     [Bugtraq]     [Linux]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux