I got the same >>>> 4 reps @ 2372.4955 msec ( 0.4/sec): GetImage XY 500x500 square
With a Radeon 9000 board. Why so dam slow?
Mark Vojkovich wrote:
I haven't seen any benchmarks showing anything slower than anything else.
Mark.
On Sat, 31 Jan 2004, [iso-8859-2] Martin MOKREJ? wrote:
On Sat, 31 Jan 2004, Mark Vojkovich wrote:
Hello Mark, thanks for nice explanation. Could you tell me then why is my system thousand times slower( Radeon 9200, AGP 8x but Xfree4.4.0 RC2 accepts only AGP 4x)?
(--) PCI:*(1:0:0) ATI Technologies Inc Radeon RV280 [Radeon 9200] rev 1, Mem @ 0xd0000000/27, 0xfe8f0000/16, I/O @ 0xb000/8, BIOS @ 0xfe8c0000/17 (--) PCI: (1:0:1) ATI Technologies Inc unknown chipset (0x5941) rev 1, Mem @ 0xc8000000/27, 0xfe8e0000/16
(II) Primary Device is: PCI 01:00:0 (--) Assigning device section with no busID to primary device (WW) RADEON: No matching Device section for instance (BusID PCI:1:0:1) found (--) Chipset ATI Radeon 9200 5961 (AGP) found
(II) Loading sub module "radeon" (II) LoadModule: "radeon" (II) Reloading /usr/X11R6/lib/modules/drivers/radeon_drv.o (II) resource ranges after probing:
(II) Setting vga for screen 0. (II) RADEON(0): MMIO registers at 0xfe8f0000 (II) Loading sub module "vgahw" (II) LoadModule: "vgahw" (II) Loading /usr/X11R6/lib/modules/libvgahw.a (II) Module vgahw: vendor="The XFree86 Project" compiled for 4.3.99.902, module version = 0.1.0 ABI class: XFree86 Video Driver, version 0.7 (II) RADEON(0): vgaHWGetIOBase: hwp->IOBase is 0x03d0, hwp->PIOOffset is 0x0000 (II) RADEON(0): PCI bus 1 card 0 func 0 (**) RADEON(0): Depth 24, (--) framebuffer bpp 32 (II) RADEON(0): Pixel depth = 24 bits stored in 4 bytes (32 bpp pixmaps) (==) RADEON(0): Default visual is TrueColor (**) RADEON(0): Option "AGPMode" "4" (**) RADEON(0): Option "EnablePageFlip" "yes" (==) RADEON(0): RGB weight 888 (II) RADEON(0): Using 8 bits per RGB (8 bit DAC) (II) Loading sub module "int10" (II) LoadModule: "int10" (II) Loading /usr/X11R6/lib/modules/linux/libint10.a (II) Module int10: vendor="The XFree86 Project" compiled for 4.3.99.902, module version = 1.0.0 ABI class: XFree86 Video Driver, version 0.7 (II) RADEON(0): initializing int10 (WW) RADEON(0): Bad V_BIOS checksum
-----------------^ what's that? It's a brand new card!
(II) RADEON(0): Primary V_BIOS segment is: 0xc000 (--) RADEON(0): Chipset: "ATI Radeon 9200 5961 (AGP)" (ChipID = 0x5961) (--) RADEON(0): Linear framebuffer at 0xd0000000 (--) RADEON(0): BIOS at 0xfe8c0000 (--) RADEON(0): VideoRAM: 131072 kByte (128 bit DDR SDRAM) (II) RADEON(0): AGP card detected
(**) RADEON(0): Using AGP 4x mode (II) RADEON(0): AGP Fast Write disabled by default (II) RADEON(0): Depth moves disabled by default (II) Loading sub module "shadowfb" (II) LoadModule: "shadowfb" (II) Loading /usr/X11R6/lib/modules/libshadowfb.a (II) Module shadowfb: vendor="The XFree86 Project" compiled for 4.3.99.902, module version = 1.0.0 ABI class: XFree86 ANSI C Emulation, version 0.2 (II) RADEON(0): Page flipping enabled (!!) RADEON(0): For information on using the multimedia capabilities of this adapter, please see http://gatos.sf.net. (--) Depth 24 pixmap format is 32 bpp (II) do I need RAC? No, I don't.
(II) RADEON(0): Wrote: rd=12, fd=96, pd=1 drmOpenDevice: minor is 0 drmOpenDevice: node name is /dev/dri/card0 drmOpenDevice: open result is -1, (No such device) drmOpenDevice: open result is -1, (No such device) drmOpenDevice: Open failed drmOpenDevice: minor is 0 drmOpenDevice: node name is /dev/dri/card0 drmOpenDevice: open result is -1, (No such device) drmOpenDevice: open result is -1, (No such device) drmOpenDevice: Open failed drmOpenDevice: minor is 0 drmOpenDevice: node name is /dev/dri/card0 drmOpenDevice: open result is 7, (OK) drmGetBusid returned '' (II) RADEON(0): [drm] loaded kernel module for "radeon" driver (II) RADEON(0): [drm] created "radeon" driver at busid "PCI:1:0:0" (II) RADEON(0): [drm] added 8192 byte SAREA at 0xf8d63000 (II) RADEON(0): [drm] mapped SAREA 0xf8d63000 to 0x40016000 (II) RADEON(0): [drm] framebuffer handle = 0xd0000000 (II) RADEON(0): [drm] added 1 reserved context for kernel (II) RADEON(0): [agp] Mode 0x1f004a0f [AGP 0x8086/0x2578; Card 0x1002/0x5961] (II) RADEON(0): [agp] 8192 kB allocated with handle 0x00000001 (II) RADEON(0): [agp] ring handle = 0xe8000000 (II) RADEON(0): [agp] Ring mapped at 0x48267000 (II) RADEON(0): [agp] ring read ptr handle = 0xe8101000 (II) RADEON(0): [agp] Ring read ptr mapped at 0x40018000 (II) RADEON(0): [agp] vertex/indirect buffers handle = 0xe8102000 (II) RADEON(0): [agp] Vertex/indirect buffers mapped at 0x48368000 (II) RADEON(0): [agp] GART texture map handle = 0xe8302000 (II) RADEON(0): [agp] GART Texture map mapped at 0x48568000 (II) RADEON(0): [drm] register handle = 0xfe8f0000 (II) RADEON(0): [dri] Visual configs initialized (II) RADEON(0): CP in BM mode (II) RADEON(0): Using 8 MB GART aperture (II) RADEON(0): Using 1 MB for the ring buffer (II) RADEON(0): Using 2 MB for vertex/indirect buffers (II) RADEON(0): Using 5 MB for GART textures (II) RADEON(0): Memory manager initialized to (0,0) (1280,8191) (II) RADEON(0): Reserved area from (0,1024) to (1280,1026)
(II) RADEON(0): Largest offscreen area available: 1280 x 7165 (II) RADEON(0): Will use back buffer at offset 0x1400000 (II) RADEON(0): Will use depth buffer at offset 0x1900000 (II) RADEON(0): Will use 100352 kb for textures at offset 0x1e00000 (II) RADEON(0): Using XFree86 Acceleration Architecture (XAA) Screen to screen bit blits Solid filled rectangles 8x8 mono pattern filled rectangles Indirect CPU to Screen color expansion Solid Lines Scanline Image Writes Offscreen Pixmaps Setting up tile and stipple cache: 32 128x128 slots 32 256x256 slots 16 512x512 slots (II) RADEON(0): Acceleration enabled (==) RADEON(0): Backing store disabled (==) RADEON(0): Silken mouse enabled (II) RADEON(0): Using hardware cursor (scanline 1026) (II) RADEON(0): Largest offscreen area available: 1280 x 7161 (**) Option "dpms" (**) RADEON(0): DPMS enabled (II) RADEON(0): X context handle = 0x00000001 (II) RADEON(0): [drm] installed DRM signal handler (II) RADEON(0): [DRI] installation complete (II) RADEON(0): [drm] Added 32 65536 byte vertex/indirect buffers (II) RADEON(0): [drm] Mapped 32 vertex/indirect buffers (II) RADEON(0): [drm] dma control initialized, using IRQ 16 (II) RADEON(0): [drm] Initialized kernel GART heap manager, 5111808 (II) RADEON(0): Direct rendering enabled (==) RandR enabled
The x11perf output is the one at the very end. Thanks! Martin
On Sat, 31 Jan 2004, [iso-8859-2] Martin MOKREJ? wrote:
On Sun, 2 Nov 2003, Pieter Hulshoff wrote:
On Sunday 02 November 2003 20:10, Billy Biggs wrote:
So you compared the output of x11perf or xvtest ? I'm curious if you have stats for those.
I don't have xvtest on my computer, but here's the output from x11perf:
x11perf - X11 performance program, version 1.5 The XFree86 Project, Inc server version 40300000 on :0.0 from Spaceballs50 Sun Nov 2 20:16:33 2003
Sync time adjustment is 0.0502 msecs.
1200 reps @ 5.1985 msec ( 192.0/sec): ShmPutImage 500x500 square 1200 reps @ 5.2014 msec ( 192.0/sec): ShmPutImage 500x500 square 1200 reps @ 5.2001 msec ( 192.0/sec): ShmPutImage 500x500 square 1200 reps @ 5.2020 msec ( 192.0/sec): ShmPutImage 500x500 square 1200 reps @ 5.1998 msec ( 192.0/sec): ShmPutImage 500x500 square 6000 trep @ 5.2004 msec ( 192.0/sec): ShmPutImage 500x500 square
This is a perfectly respectable score for 32bpp. It's actually exemplary if you're doing it in software. That's about AGP 2x performance if you're doing it in hardware.
I've just downloaded a new kernel. I'll see if I can get it installed and working, and see if I get any better results that way.
Hi, I just tried on my xfree86 4.4.0 RC2 candidate linux box with Radeon 9200 with DRI enabled:
$ x11perf -getimagexy500
XY images operations are usually in software because no modern hardware can really accelerate them very well. XY images are a fallback from the days when graphics hardware designers couldn't decide whether linear or planar framebuffers were better. Today, it's obvious that linear (the "Z" format) are the way to go. I wish X11 didn't support XY formats. Certainly, nobody should be using them. Also, PC hardware typically can do putimage much quicker than getimage operations.
Mark.
x11perf - X11 performance program, version 1.5 The XFree86 Project, Inc server version 40399902 on :0.0 from aquarius Sat Jan 31 01:35:00 2004
Sync time adjustment is 0.0280 msecs.
4 reps @ 2372.4955 msec ( 0.4/sec): GetImage XY 500x500 square 4 reps @ 2642.7983 msec ( 0.4/sec): GetImage XY 500x500 square 4 reps @ 2372.0420 msec ( 0.4/sec): GetImage XY 500x500 square 4 reps @ 2369.9855 msec ( 0.4/sec): GetImage XY 500x500 square 4 reps @ 2369.9825 msec ( 0.4/sec): GetImage XY 500x500 square 20 trep @ 2425.4608 msec ( 0.4/sec): GetImage XY 500x500 square
$
My system was even slower, it more or less hung my X-windows for a while.
-- Martin Mokrejs <mmokrejs@xxxxxxxxxxxxx> PGP5.0i key is at http://www.natur.cuni.cz/~mmokrejs _______________________________________________ XFree86 mailing list XFree86@xxxxxxxxxxx http://XFree86.Org/mailman/listinfo/xfree86
-- Martin Mokrejs <mmokrejs@xxxxxxxxxxxxx> PGP5.0i key is at http://www.natur.cuni.cz/~mmokrejs
_______________________________________________ XFree86 mailing list XFree86@xxxxxxxxxxx http://XFree86.Org/mailman/listinfo/xfree86
_______________________________________________ XFree86 mailing list XFree86@xxxxxxxxxxx http://XFree86.Org/mailman/listinfo/xfree86