Am Montag, 11. April 2016, 13:54:51 schrieb Javier Martinez Canillas: > Hello Heiko, > > Sorry for the late response, I got this email while I was sick and > forgot to answer when I recovered. > > On Sat, Apr 2, 2016 at 9:37 AM, Heiko Stuebner <heiko at sntech.de> wrote: > > Hi Shawn, > > > > Am Samstag, 2. April 2016, 17:56:17 schrieb Shawn Lin: > >> ? 2016/4/2 7:52, Heiko Stuebner ??: > >> > it looks like commit ea9849113343 ("spi: rockchip: check return value > >> > of > >> > dmaengine_prep_slave_sg") negatively affects the cros_ec spi backend. > >> > > >> > During boot on the most recent mainline kernel I get: > >> > > >> > [ 1.025480] cros-ec-spi spi0.0: Chrome EC device registered > >> > [ 1.641636] input: cros_ec as > >> > /devices/platform/ff110000.spi/spi_master/spi0/spi0.0/ff110000.spi:ec at 0 > >> > > >> > :keyboard-controller/input/input0 [ 2.340214] cros-ec-spi spi0.0: EC > >> > > >> > failed to respond in time > >> > [ 2.357735] cros-ec-spi spi0.0: packet too long (249 bytes, expected > >> > 4) [ 2.470353] cros-ec-spi spi0.0: EC failed to respond in time > >> > [ 2.508495] cros-ec-spi spi0.0: packet too long (249 bytes, expected > >> > 4) [ 2.620176] cros-ec-spi spi0.0: EC failed to respond in time > >> > [ 2.637345] cros-ec-spi spi0.0: packet too long (249 bytes, expected > >> > 4) [ 2.750245] cros-ec-spi spi0.0: EC failed to respond in time > >> > [ 2.767519] cros-ec-spi spi0.0: packet too long (249 bytes, expected > >> > 4) > >> > > >> > The cros-ec works ok after boot without further errors [aka keyboard > >> > and everything working correctly] and I haven't been able to figure out > >> > what goes wrong, but was able to bisect the issue down to the commit > >> > mentioned above. > >> > >> Which Soc I can reproduce it? > > > > I can see that on both a veyron-pinky as well as a veyron-jerry, so the > > rk3288-based devices. > > > >> I'm not able to find out how this commit to break the cros-ec. So I > >> prone to think this commit just expose some issues rather than > >> introducing negatively affects. I guess, before this commit, cros-ec > >> always get successful transfer, but the reality is that the tranfer > >> does failed in the early booting stage but spi-rockchip doesn't log out > >> anything. If that is the case, that means spi-rockchip fails to prepare > >> sg for some unknown reasons? > > > > That is a possibility. > > I also agree with this theory. AFAICT the mentioned commit is only > adding some checks so it seems the problem is with on the spi-rockchip > driver. > > > I haven't had much experience with both spi and cros-ec and it seems I've > > forgotten to also include Javier in my Cc-list who die the cros-ec > > mainlining. I've corrected that now and maybe he has some additional idea > > what may go wrong there. > > Unfortunately I'm neither familiar with Rockchip platforms nor have > access to Rockchip based Chromebooks to reproduce this issue. But I > don't see this error on my Exynos based Chromebooks so that's another > sign that the issue may be in the spi-rockchip driver. > > I've added to the cc list to Enric and Tomeu who were working on the > cros_ec drivers lately and AFAIK have access to some Rockchip > Chromebooks. I dug a bit further now, and it really seems to be the rockchip-spi (or more importantly the dma-part) at fault. If I just set the use_dma always to 0 and thus force pio mode, everything works fine. With dma transfers enabled it seems to bunch commands together in some way. For packets that are reported as the controller not responding, it seems like they were already transfered with the previous to long packet in cros_ec_cmd_xfer_spi [0]: [ 4.928480] cros-ec-spi spi0.0: packet too long (249 bytes, expected 4) [ 4.935104] cros-ec-spi spi0.0: bytes: [ 4.949560] 3 f9 0 0 4 0 0 0 4 0 0 0 0 0 0 0 ed ed ed ed ed ed ed ed ed ed ed ed ed ed ed ed 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 [ 5.010978] cros-ec-spi spi0.0: packet too long (249 bytes, expected 4) [ 5.017599] cros-ec-spi spi0.0: bytes: [ 5.022044] 3 f9 0 0 4 0 0 0 fa fa fa fa fa fa fa fa ec 3 f9 0 0 4 0 0 0 0 0 0 0 ed ed ed 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 [ 5.071057] cros_ec_spi_receive_packet: needing 124 bytes [ 5.312045] cros-ec-spi spi0.0: EC failed to respond in time [ 5.329564] cros-ec-spi spi0.0: packet too long (249 bytes, expected 4) [ 5.336197] cros-ec-spi spi0.0: bytes: [ 5.340651] 3 f9 0 0 4 0 0 0 fa fa fa fa fa fa ec 3 f9 0 0 4 0 0 0 0 0 0 0 ed ed ed ed ed 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 [ 5.389937] cros_ec_spi_receive_packet: needing 124 bytes [ 5.597082] cros-ec-spi spi0.0: EC failed to respond in time [ 5.603102] cros_ec_spi_receive_packet: needing 124 bytes [ 5.812077] cros-ec-spi spi0.0: EC failed to respond in time It also only affects the big packages, all the 10-20 byte long packages are transfered nicely it seems. I'll dig further, but if anybody has an idea where the dma-code makes a wrong turn I would be happy about pointers :-) [0] diff is: diff --git a/drivers/mfd/cros_ec_spi.c b/drivers/mfd/cros_ec_spi.c index ebe9b94..24da42a 100644 --- a/drivers/mfd/cros_ec_spi.c +++ b/drivers/mfd/cros_ec_spi.c @@ -584,6 +589,10 @@ static int cros_ec_cmd_xfer_spi(struct cros_ec_device *ec_dev, if (len > ec_msg->insize) { dev_err(ec_dev->dev, "packet too long (%d bytes, expected %d)", len, ec_msg->insize); +dev_err(ec_dev->dev, "bytes: "); +for (i = 0; i < len; i++) + printk("%x ", ptr[i]); +printk("\n"); ret = -ENOSPC; goto exit; }