Re: [Gluster-infra] NetBSD regression fixes

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Sat, Jan 16, 2016 at 06:55:49PM +0100, Emmanuel Dreyfus wrote:
> Hello all
> 
> Here are the problems identified in NetBSD regression so far:
> 
> 1) Before starting regression, slave compains about "vnconfig:
> VNDIOCGET: Bad file descriptor" and fails the run.
> 
> This will be fixed by that changes:
> http://review.gluster.org/13204
> http://review.gluster.org/13205
> 
> 
> 2) Spurious failures
> I added a retry-failed-test-once feature so that we get less regression
> failures because of spurious failures. It is not used right now because
> it does not play nicely with bad tests blacklist.
> 
> This will be fixed by that changes:
> http://review.gluster.org/13245
> http://review.gluster.org/13247
> 
> I have been looping failure-free regression for a while with that trick.

Nice, thanks for these improvements!

> 3) Stale state from previous regression
> We sometime have processes stuck from previous regression, awaiting
> vnode locks for destroyed NFS filesystems. This cause starting cleanup
> scripts to hang before starting regression and we get a timeout.
> 
> I modified slave's /opt/qa/regression.sh to check for stuck processes
> and reboot the system if we find them. That will fail the current
> regression run, but at least the next ones coming after reboot will be
> safe.
> 
> This fix is not deployed yet, I await the fixes from point 2 to be
> merged

Could you send a pull request for the regression.sh script on
https://github.com/gluster/glusterfs-patch-acceptance-tests/ ? Or, if
you dont use GitHub, send the patch by email and we'll take care of
pushing it for you.

> 4) Jenkins casts concurent runs on the same slave
> We observed Jenkins sometimes runs two jobs on the same slave at once,
> which of course can only lead to horrible failure.
> 
> I modified slave's /opt/qa/regression.sh to add a lock file so that this
> situation is detected early and reported. The second regression will
> fail, but the idea is to get a better understanding of how that can
> occur.
> 
> This fix is not deployed yet, I await the fixes from point 2 to be
> merged

Hmm, I have not seen that before, but it surely is something to be
concerned about :-/

Thanks,
Niels

Attachment: signature.asc
Description: PGP signature

_______________________________________________
Gluster-devel mailing list
Gluster-devel@xxxxxxxxxxx
http://www.gluster.org/mailman/listinfo/gluster-devel

[Index of Archives]     [Gluster Users]     [Ceph Users]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Security]     [Bugtraq]     [Linux]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux