Hi Tedd, On Fri, May 13, 2022 at 4:52 PM Luiz Augusto von Dentz <luiz.dentz@xxxxxxxxx> wrote: > > Hi Ahmad, > > On Fri, May 13, 2022 at 1:14 PM Luiz Augusto von Dentz > <luiz.dentz@xxxxxxxxx> wrote: > > > > Hi Ahmad, > > > > On Fri, May 13, 2022 at 7:10 AM Ahmad Fatoum <a.fatoum@xxxxxxxxxxxxxx> wrote: > > > > > > Hello, > > > > > > On Linux v5.18-rc5, I can reliably crash the kernel on the second (un)pairing > > > with a customer's BLE device. I have bisected the issue and found two commits: > > > > > > - Commit 6cd29ec6ae5e ("Bluetooth: hci_sync: Wait for proper events when > > > connecting LE") causes previously working pairing to time out, presumably > > > because it keeps waiting for the wrong event. > > > > Can you describe in more details what is the second pairing, are you > > pairing 2 devices concurrently? I recall someone for nxp having > > similar problem, at least the traces look pretty similar, the problem > > seems to be the expected event don't match the event the controller > > send, in this case hci_le_enh_conn_complete_evt, so hci_event process > > it and frees the hci_conn instead of first running the callback. > > Looks like my memory failed me on this one, the sync callback is run > last so we shouldn't cleanup the hci_conn at that point, perhaps > something like the following should fix the crash: > > diff --git a/net/bluetooth/hci_event.c b/net/bluetooth/hci_event.c > index 0270e597c285..c1634af670b8 100644 > --- a/net/bluetooth/hci_event.c > +++ b/net/bluetooth/hci_event.c > @@ -5632,10 +5632,8 @@ static void le_conn_complete_evt(struct hci_dev > *hdev, u8 status, > status = HCI_ERROR_INVALID_PARAMETERS; > } > > - if (status) { > - hci_conn_failed(conn, status); > + if (status) > goto unlock; > - } > > if (conn->dst_type == ADDR_LE_DEV_PUBLIC) > addr_type = BDADDR_LE_PUBLIC; > > > > - Commit a56a1138cbd8 ("Bluetooth: hci_sync: Fix not using conn_timeout") > > > fixes, despite the title, what event is waited on. First Pairing works now, > > > but the second pairing times out and crashes the kernel: > > > > > > [ 84.191684] Bluetooth: hci0: Opcode 0x200d failed: -110 > > > [ 84.230478] Bluetooth: hci0: request failed to create LE connection: err -110 > > > [ 84.237690] Unable to handle kernel read from unreadable memory at virtual address 0000000000000ca8 > > That said the error -110 mean -ETIMEDOUT We might want to incorporate some test to the likes of mgmt-tester to check this behavior, afaik the crash can be triggered by just causing a le_conn_complete event with status != 0 which perhaps we need to extend bthost.c to be able to reject connections. > > > -- > Luiz Augusto von Dentz -- Luiz Augusto von Dentz