Error encountered copying data from source path ==> /USB_HDD_3/ due to unknown reason. Please see log.
I assumed that I was loosing the backup drive, and quickly ran a check of the volume using the built in tools. Some errors were found, but FrontView said that it corrected them. Backups still failed. I then assumed that there were deeper drive issues and pulled out SpinRite to run a deep analysis. 80+ hours later, no errors found after a scan of the whole spanned volume. In between, I also tried completely formatting the drive and starting from scratch with an EXT3 volume, formatted through FrontView. No dice there either. So the external USB Drive does not appear to be bad.
The backup doesn't fail immediately, either. It takes a few hours (~3) to hit the problem and then stop.
I decided to start digging in the logs. First the backup logs:
- Code: Select all
Backup failed. Sun Aug 5 09:28:12 PDT 2012.
Reason for failure:
Error encountered copying data from source path ==> /USB_HDD_3/ due to unknown reason. Please see log.
This is the same error every time, but the logs for each of the past few days stop at different files each time. And the logs appear as though the get cut off mid line, like this example:
- Code: Select all
`./Pictures/Personal/2012/2012 - 04 Paris/Paris_CJ-20120411-1833.xmp' -> `/USB_HDD_3/Pictures/Personal/2012/2012 - 04 Paris/Paris_CJ-20120411-1833.xmp'
`./Pictures/Personal/2012/2012 - 04 Paris/Paris_CJ-20120408-0594.xmp' -> `/USB_HDD_3/Pictures/Personal/2012/2012 - 04 Paris/Paris_CJ-20120408-0594.xmp'
`./Pictures/Personal/2012/2012 - 04 Paris/Paris_CJ-20120410-1539.JPG' -> `/USB_HDD_3/Pictures/Personal/2012/2012 - 04 Paris/Paris_CJ-20120410-1539.JPG'
`./Pictures/Personal/2012/2012 - 04 Paris/Paris_CJ-20120410-1022.JPG' -> `/USB_HDD_3/Pictures/Personal/2012/2012 - 04 Paris/Paris_CJ-20120410-1022.JPG'
`./Pictures/Personal/2012/2012 - 04 Paris/Paris_CJ-20120408-0552.xmp' -> `/USB_HDD_3/Pictures/Personal/2
Backup failed. Sat Aug 4 09:06:40 PDT 2012.
Reason for failure:
Error encountered copying data from source path ==> /USB_HDD_3/ due to unknown reason. Please see log.
So I dug in a different set of logs, the system log. What I find is that at some point around the 3 hour mark in the backup job (each and every time), something happens that generates a large amount of log information in a single second of time, and then, a few seconds later, the backup fails. This is an example of those bits from the system log:
- Code: Select all
Aug 5 09:28:11 bach kernel: leafp2p invoked oom-killer: gfp_mask=0x201da, order=0, oom_adj=0, oom_score_adj=0
Aug 5 09:28:11 bach kernel: Pid: 2817, comm: leafp2p Tainted: P 2.6.37.6.RNx86_64.2.4 #1
Aug 5 09:28:11 bach kernel: Call Trace:
Aug 5 09:28:11 bach kernel: [<ffffffff8807eeaf>] dump_header+0x6f/0x190
Aug 5 09:28:11 bach kernel: [<ffffffff8808ee2a>] ? wait_iff_congested+0xba/0x120
Aug 5 09:28:11 bach kernel: [<ffffffff8804d500>] ? autoremove_wake_function+0x0/0x40
Aug 5 09:28:11 bach kernel: [<ffffffff8807f3be>] oom_kill_process+0x27e/0x290
Aug 5 09:28:11 bach kernel: [<ffffffff8807f61e>] out_of_memory+0x24e/0x300
Aug 5 09:28:11 bach kernel: [<ffffffff880825a3>] __alloc_pages_nodemask+0x5f3/0x610
Aug 5 09:28:11 bach kernel: [<ffffffff88084c47>] __do_page_cache_readahead+0xe7/0x220
Aug 5 09:28:11 bach kernel: [<ffffffff88084d9c>] ra_submit+0x1c/0x20
Aug 5 09:28:11 bach kernel: [<ffffffff8807e319>] filemap_fault+0x3b9/0x3c0
Aug 5 09:28:11 bach kernel: [<ffffffff88092da0>] __do_fault+0x50/0x490
Aug 5 09:28:11 bach kernel: [<ffffffff88002bce>] ? apic_timer_interrupt+0xe/0x20
Aug 5 09:28:11 bach kernel: [<ffffffff88094f8c>] handle_mm_fault+0x1ac/0x840
Aug 5 09:28:11 bach kernel: [<ffffffff88051051>] ? hrtimer_try_to_cancel+0x41/0x80
Aug 5 09:28:11 bach kernel: [<ffffffff88020139>] do_page_fault+0x139/0x420
Aug 5 09:28:11 bach kernel: [<ffffffff8805145e>] ? hrtimer_nanosleep+0x9e/0x120
Aug 5 09:28:11 bach kernel: [<ffffffff88050970>] ? hrtimer_wakeup+0x0/0x30
Aug 5 09:28:11 bach kernel: [<ffffffff885b479f>] page_fault+0x1f/0x30
Aug 5 09:28:11 bach kernel: Mem-Info:
Aug 5 09:28:11 bach kernel: DMA per-cpu:
Aug 5 09:28:11 bach kernel: CPU 0: hi: 0, btch: 1 usd: 0
Aug 5 09:28:11 bach kernel: CPU 1: hi: 0, btch: 1 usd: 0
Aug 5 09:28:11 bach kernel: DMA32 per-cpu:
Aug 5 09:28:11 bach kernel: CPU 0: hi: 186, btch: 31 usd: 70
Aug 5 09:28:11 bach kernel: CPU 1: hi: 186, btch: 31 usd: 124
Aug 5 09:28:11 bach kernel: active_anon:88528 inactive_anon:88589 isolated_anon:0
Aug 5 09:28:11 bach kernel: active_file:8187 inactive_file:8166 isolated_file:352
Aug 5 09:28:11 bach kernel: unevictable:381 dirty:0 writeback:0 unstable:0
Aug 5 09:28:11 bach kernel: free:5061 slab_reclaimable:5675 slab_unreclaimable:4323
Aug 5 09:28:11 bach kernel: mapped:137 shmem:13 pagetables:836 bounce:0
Aug 5 09:28:11 bach kernel: DMA free:4184kB min:248kB low:308kB high:372kB active_anon:5084kB inactive_anon:5280kB active_file:456kB inactive_file:452kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15680kB mlocked:0kB dirty:0kB writeback:0kB mapped:4kB shmem:0kB slab_reclaimable:392kB slab_unreclaimable:0kB kernel_stack:0kB pagetables:16kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:1795 all_unreclaimable? no
Aug 5 09:28:11 bach kernel: lowmem_reserve[]: 0 985 985 985
Aug 5 09:28:11 bach kernel: DMA32 free:16060kB min:16132kB low:20164kB high:24196kB active_anon:349028kB inactive_anon:349076kB active_file:32292kB inactive_file:32084kB unevictable:1524kB isolated(anon):0kB isolated(file):1536kB present:1008672kB mlocked:0kB dirty:0kB writeback:0kB mapped:544kB shmem:52kB slab_reclaimable:22308kB slab_unreclaimable:17292kB kernel_stack:880kB pagetables:3328kB unstable:0kB bounce:0kB writeback_tmp:0kB pages_scanned:109525 all_unreclaimable? yes
Aug 5 09:28:11 bach kernel: lowmem_reserve[]: 0 0 0 0
Aug 5 09:28:11 bach kernel: DMA: 50*4kB 0*8kB 3*16kB 2*32kB 1*64kB 2*128kB 2*256kB 0*512kB 1*1024kB 1*2048kB 0*4096kB = 4216kB
Aug 5 09:28:11 bach kernel: DMA32: 1727*4kB 157*8kB 22*16kB 16*32kB 10*64kB 6*128kB 4*256kB 1*512kB 0*1024kB 0*2048kB 1*4096kB = 16068kB
Aug 5 09:28:11 bach kernel: 24397 total pagecache pages
Aug 5 09:28:11 bach kernel: 7268 pages in swap cache
Aug 5 09:28:11 bach kernel: Swap cache stats: add 979679, delete 972411, find 135337/162625
Aug 5 09:28:11 bach kernel: Free swap = 0kB
Aug 5 09:28:11 bach kernel: Total swap = 524272kB
Aug 5 09:28:11 bach kernel: 259744 pages RAM
Aug 5 09:28:11 bach kernel: 6516 pages reserved
Aug 5 09:28:11 bach kernel: 21595 pages shared
Aug 5 09:28:11 bach kernel: 226536 pages non-shared
Aug 5 09:28:11 bach kernel: [ pid ] uid tgid total_vm rss cpu oom_adj oom_score_adj name
Aug 5 09:28:11 bach kernel: [ 785] 0 785 450 90 1 0 0 ifplugd
Aug 5 09:28:11 bach kernel: [ 904] 0 904 586 2 0 -17 -1000 udevd
Aug 5 09:28:11 bach kernel: [ 1852] 0 1852 424 11 1 0 0 acpid
Aug 5 09:28:11 bach kernel: [ 1858] 0 1858 422 2 1 0 0 raidard
Aug 5 09:28:11 bach kernel: [ 1895] 0 1895 1365 0 1 0 0 sshd
Aug 5 09:28:11 bach kernel: [ 1991] 0 1991 633 0 0 0 0 quota_nld
Aug 5 09:28:11 bach kernel: [ 1996] 1 1996 453 0 1 0 0 portmap
Aug 5 09:28:11 bach kernel: [ 2041] 0 2041 437 23 1 0 0 syslogd
Aug 5 09:28:11 bach kernel: [ 2054] 0 2054 489 0 0 0 0 atd
Aug 5 09:28:11 bach kernel: [ 2059] 0 2059 425 6 0 0 0 klogd
Aug 5 09:28:11 bach kernel: [ 2072] 0 2072 940 37 1 0 0 cupsd
Aug 5 09:28:11 bach kernel: [ 2077] 21 2077 688 1 1 0 0 dbus-daemon
Aug 5 09:28:11 bach kernel: [ 2079] 0 2079 494 26 1 0 0 cron
Aug 5 09:28:11 bach kernel: [ 2114] 0 2114 3731 50 1 0 0 smbd
Aug 5 09:28:11 bach kernel: [ 2115] 0 2115 3728 0 1 0 0 smbd
Aug 5 09:28:11 bach kernel: [ 2549] 0 2549 431 5 0 0 0 ifplugd
Aug 5 09:28:11 bach kernel: [ 2690] 0 2690 431 8 0 0 0 ifplugd
Aug 5 09:28:11 bach kernel: [ 2737] 0 2737 13025 5565 1 0 0 minidlna
Aug 5 09:28:11 bach kernel: [ 2748] 0 2748 534 0 0 0 0 cnid_metad
Aug 5 09:28:11 bach kernel: [ 2751] 0 2751 531 21 1 0 0 chronyd
Aug 5 09:28:11 bach kernel: [ 2769] 0 2769 1779 50 1 0 0 afpd
Aug 5 09:28:11 bach kernel: [ 2783] 0 2783 1201 0 0 0 0 leafp2p.sh
Aug 5 09:28:11 bach kernel: [ 2784] 0 2784 9083 102 0 0 0 leafp2p
Aug 5 09:28:11 bach kernel: [ 2844] 0 2844 3976 9 1 0 0 mt-daapd
Aug 5 09:28:11 bach kernel: [ 2873] 0 2873 431 5 1 0 0 ifplugd
Aug 5 09:28:11 bach kernel: [ 2906] 0 2906 2227 59 0 0 0 nmbd
Aug 5 09:28:11 bach kernel: [ 2957] 0 2957 603 123 0 0 0 monitor_enclosu
Aug 5 09:28:11 bach kernel: [ 3181] 98 3181 481 11 1 -17 -1000 hidups
Aug 5 09:28:11 bach kernel: [ 3215] 98 3215 483 11 1 -17 -1000 upsd
Aug 5 09:28:11 bach kernel: [ 3228] 0 3228 440 12 1 -17 -1000 upsmon
Aug 5 09:28:11 bach kernel: [ 3250] 20 3250 711 30 1 0 0 avahi-daemon
Aug 5 09:28:11 bach kernel: [ 3254] 0 3254 425 2 1 0 0 getty
Aug 5 09:28:11 bach kernel: [19348] 0 19348 9072 4348 1 0 0 apache-ssl
Aug 5 09:28:11 bach kernel: [19373] 0 19373 424 15 1 0 0 upnpd
Aug 5 09:28:11 bach kernel: [19375] 0 19375 444 22 0 0 0 lld2d
Aug 5 09:28:11 bach kernel: [19385] 98 19385 9072 4347 1 0 0 apache-ssl
Aug 5 09:28:11 bach kernel: [19392] 0 19392 1055 24 1 0 0 rsync
Aug 5 09:28:11 bach kernel: [19418] 0 19418 1201 21 1 0 0 remote.sh
Aug 5 09:28:11 bach kernel: [19420] 0 19420 1262 18 1 0 0 remote
Aug 5 09:28:11 bach kernel: [18448] 0 18448 659 29 1 0 0 cron
Aug 5 09:28:11 bach kernel: [18449] 0 18449 625 18 1 0 0 sh
Aug 5 09:28:11 bach kernel: [18450] 0 18450 4576 3652 1 0 0 backup
Aug 5 09:28:11 bach kernel: [18465] 0 18465 653 59 0 0 0 sh
Aug 5 09:28:11 bach kernel: [18495] 0 18495 280504 155383 0 0 0 cp
Aug 5 09:28:11 bach kernel: Out of memory: Kill process 18495 (cp) score 696 or sacrifice child
Aug 5 09:28:11 bach kernel: Killed process 18495 (cp) total-vm:1122016kB, anon-rss:621532kB, file-rss:0kB
Aug 5 09:28:15 bach RAIDiator: Error encountered copying data from source path ==> /USB_HDD_3/ due to unknown reason. Please see log. [Job 001]
So what do I do? My NAS seems to be working, but I like to have it backed up just in case and I am pretty tired of seeing the error in my inbox every morning. Let me know what other information I can provide.
