Re: /dev/bcacheX can still be written when plug out backing device

Coly Li <i@xxxxxxx> · Sun, 1 Oct 2017 23:57:22 +0800

On 2017/10/1 下午11:51, qiulg wrote:
> That's bad news. :( 
> What is expected is freezing write I/O and return error to application when cache device or backing device failed. Hope the patch can be released soon.  
> 

Yes this is one of the expected behavior.
Another expectation is after backing device is failed, bcache complains,
but still permits cache device to cache a pre-defined amount of data,
then freeing write I/O.

Once the patch is available, I will let you know :-)

Coly

> Thanks. 
> Qiu
>  
> ------------------ Original ------------------
> From:  "Coly Li"<i@xxxxxxx>;
> Date:  Sun, Oct 1, 2017 10:50 PM
> To:  "qiulg"<qiulg@xxxxxxxxxxxxxxxxx>;
> Cc:  "linux-bcache"<linux-bcache@xxxxxxxxxxxxxxx>; "朱菁"<zhujing@xxxxxxxxxxxxxxxxx>; "秦晓强"<qinxq@xxxxxxxxxxxxxxxxx>;
> Subject:  Re: /dev/bcacheX can still be written when plug out backing device
>  
> On 2017/10/1 下午10:29, qiulg wrote:
>> Thanks Coly. Sorry for the wrong subject, changed. 
>>
>> Tridd dd with oflag=direct, the bcache device can still be written without backing device, but this time it finished at about 7.7GB.
>>
>> # dd if=/dev/zero of=/dev/bcache4 oflag=direct 
>>  dd: writing to `/dev/bcache4': Input/output error
>> 15052824+0 records in
>> 15052823+0 records out
>> 7707045376 bytes (7.7 GB) copied, 1022.63 s, 7.5 MB/s 
>>
> 
> Aha, it is backing device...I was blind...
> The situation you hit is the backing device failure, which is not
> properly handled by bcache yet. (indeed cache device failure is not
> properly handled neither)
> 
> You are not the first one complains on this. I also receive such a
> similar feature request, but it is on the tail of my task list. If
> someone may contribute patches, I am glad to test.
> 
> Thanks.
> 
> Coly
> 
>> ------------------ Original ------------------
>> From:  "Coly Li"<i@xxxxxxx>;
>> Date:  Sat, Sep 30, 2017 08:18 PM
>> To:  "qiulg"<qiulg@xxxxxxxxxxxxxxxxx>;
>> Cc:  "linux-bcache"<linux-bcache@xxxxxxxxxxxxxxx>;
>> Subject:  Re: zhujing@xxxxxxxxxxxxxxxx; qinxq@xxxxxxxxxxxxxxxx;qiulg@xxxxxxxxxxxxxxxx
>>  
>> On 2017/9/30 下午6:50, qiulg wrote:
>>> Hi,
>>>  
>>> Our environment is Redhat Enterprise 6.8, upgrade to kernal 4.4.89. We use sdj(480G) as as backing devices and sdk1(20G) as cache device. To test maintenance step when hard disk fail, we plug out the backing device sdj.
>>> Here are the results during test:
>>>  
>>> # lsscsi
>>> -- the sdj device is gone.
>>>  
>>> # ls -l /dev/bcache4
>>> brw-rw---- 1 root root 251, 4 Sep 30 18:00 /dev/bcache4
>>> -- the bcache4 device is still there
>>>  
>>> # dd if=/dev/bcache4 of=/dev/null bs=1M count=100
>>> dd: reading `/dev/bcache4': Input/output error
>>> 0+0 records in
>>> 0+0 records out
>>> 0 bytes (0 B) copied, 0.00201728 s, 0.0 kB/s
>>> -- this is resonable
>>>  
>>> # fdisk -l /dev/bcache4
>>> -- fdisk has no result. This is resonable
>>>  
>>> # dd if=/dev/zero of=/dev/bcache4 bs=1M count=100
>>> 100+0 records in
>>> 100+0 records out
>>> 104857600 bytes (105 MB) copied, 0.230408 s, 455 MB/s
>>> -- bcache4 can still be written. If the backing device is really destroyed, then application don't know that, and thought data is saved. It's not acceptable. The application should get I/O error.
>>>  
>>> # dd if=/dev/bcache4 of=/dev/null bs=1M count=100
>>> dd: reading `/dev/bcache4': Input/output error
>>> 46+1 records in
>>> 46+1 records out
>>> 48238592 bytes (48 MB) copied, 0.120517 s, 400 MB/s
>>> -- now read still shows error but 48MB data read out.
>>>  
>>> # fdisk -l /dev/bcache4
>>> Note: sector size is 4096 (not 512)
>>> Disk /dev/bcache4: 480.1 GB, 480103972864 bytes
>>> 255 heads, 63 sectors/track, 7296 cylinders
>>> Units = cylinders of 16065 * 4096 = 65802240 bytes
>>> Sector size (logical/physical): 4096 bytes / 4096 bytes
>>> I/O size (minimum/optimal): 4096 bytes / 4096 bytes
>>> Disk identifier: 0x00000000
>>> -- fdisk got answer. Strange. 480GB is the backing device size.
>>>  
>>> # dd if=/dev/zero of=/dev/bcache4 bs=1M
>>> dd: writing `/dev/bcache4': No space left on device
>>> 457863+0 records in
>>> 457862+0 records out
>>> 480103972864 bytes (480 GB) copied, 620.993 s, 773 MB/s
>>> -- keep writing to bcache4, after 480GB, it says "no space left". But the backing device is 480G, cache device is only 20G, where did the data go?
>>> -- during dd, the /var/log/message shows error
>>> ...
>>> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224259, lost async page write
>>> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224260, lost async page write
>>> Sep 30 17:52:35 s103 kernel: Buffer I/O error on dev bcache4, logical block 2224261, lost async page write
>>> Sep 30 17:52:40 s103 kernel: buffer_io_error: 1093169 callbacks suppressed
>>> Sep 30 17:52:40 s103 kernel: Buffer I/O error on dev bcache4, logical block 3317548, lost async page write
>>> ...
>>>  
>>> I think correct behaviour should be, if backing device lost, "dd" (the application) should know rightaway, got I/O error.
>>>  
>>> Appreciated for any help.
>>>  
>> Hi,
>>
>> Could you please try oflag=direct and observe what is the difference ?
>>
>> Buffered I/O is asynchronized, application is not able to get an
>> immediate -EIO.
>>
> 

-- 
Coly Li
--
To unsubscribe from this list: send the line "unsubscribe linux-bcache" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html