3ware Raid controllers and 64bit Etch

Tags:

Dear List,

I have been having data corruption problems for the last two months on
7 servers.

After extensive testing, I have finally narrowed the problem down to
Debian Etch 2.6.18-5 kernel
with the 3ware PCI controller. The same machine using the onboard SATA
controller does not
corrupt data.

The machines would also hang occasionally - no errors displayed on
screen.

I upgraded to a 2.6.23-13 kernel.org kernel 24 hours ago, and have not
been able to reproduce
these problems since then - Previously it would take about 10 minutes
for the problem to appear.

I could reproduce these problems by using a java program to insert
logs (30,000,000 records)
into a local postgres 8.2.5 database -

After this I would see
"DETAIL: Could not open file "pg_clog/0495": No such file or
directory."
type messages in my postgres logs.

I had also managed to corrupt my SVN repository - md5s of the files no
longer matched
what was in the SVN database... (svnadmin verfify /path/to/repository)

Has anyone seen these problems?

Below - details as to my raid controller.

Regards

Andrew
---

03:05.0 RAID bus controller: 3ware Inc 7xxx/8xxx-series PATA/SATA-RAID
(rev 01)
Subsystem: 3ware Inc 7xxx/8xxx-series PATA/SATA-RAID
Control: I/O+ Mem+ BusMaster+ SpecCycle+ MemWINV+ VGASnoop- ParErr-
Stepping- SERR+ FastB2B-
Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
SERR-

0

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.

3ware Raid controllers and 64bit Etch

On 11/01/2008, at 2:43 PM, Andrew Miehs wrote:

> Dear List,
>
> I have been having data corruption problems for the last two months
> on 7 servers.
>
>
> After extensive testing, I have finally narrowed the problem down to
> Debian Etch 2.6.18-5 kernel
> with the 3ware PCI controller. The same machine using the onboard
> SATA controller does not
> corrupt data.

Just to make this clear - this is with 64bit version of Etch - I have
not tested the 32bit version
with the Raid controller.

Andrew

3ware Raid controllers and 64bit Etch

Andrew Miehs wrote:
>
> On 11/01/2008, at 2:43 PM, Andrew Miehs wrote:
>
>> Dear List,
>>
>> I have been having data corruption problems for the last two months on
>> 7 servers.
>>
>>
>> After extensive testing, I have finally narrowed the problem down to
>> Debian Etch 2.6.18-5 kernel
>> with the 3ware PCI controller. The same machine using the onboard SATA
>> controller does not
>> corrupt data.
>
> Just to make this clear - this is with 64bit version of Etch - I have
> not tested the 32bit version
> with the Raid controller.
>

I've been using an 9650SE-8LPML in RAID 6 (with battery) under 64bit
Etch without any problems. It's as heavily loaded as bonded dual 100 can
let it be. Although you didn't say what controller and RAID level you're
using.

~Seth

--

3ware Raid controllers and 64bit Etch

The machines are using a 3ware 64 bit PCI Raid controller - with 2 (or
4 can't remember exactly).
They are in a 32bit PCI slot and the disks are SATA and mirrored (2
disks)
Unfortunately I don't know exactly which controller as I do not have
physical access to the machines.
(The machines are supplied by a hosting provider) - Is there anyway to
find out exactly which cards they
are from the OS?

Cheers

Andrew

On 11/01/2008, at 6:13 PM, Seth Mattinen wrote:

> Andrew Miehs wrote:
>> On 11/01/2008, at 2:43 PM, Andrew Miehs wrote:
>>> Dear List,
>>>
>>> I have been having data corruption problems for the last two
>>> months on 7 servers.
>>>
>>>
>>> After extensive testing, I have finally narrowed the problem down
>>> to Debian Etch 2.6.18-5 kernel
>>> with the 3ware PCI controller. The same machine using the onboard
>>> SATA controller does not
>>> corrupt data.
>> Just to make this clear - this is with 64bit version of Etch - I
>> have not tested the 32bit version
>> with the Raid controller.
>
> I've been using an 9650SE-8LPML in RAID 6 (with battery) under 64bit
> Etch without any problems. It's as heavily loaded as bonded dual 100
> can let it be. Although you didn't say what controller and RAID
> level you're using.

3ware Raid controllers and 64bit Etch

After searching the web...

Looks like it is a "3ware 8006-2LP" controller...

Andrew

On 11/01/2008, at 7:10 PM, Andrew Miehs wrote:

> The machines are using a 3ware 64 bit PCI Raid controller - with 2
> (or 4 can't remember exactly).
> They are in a 32bit PCI slot and the disks are SATA and mirrored (2
> disks)
> Unfortunately I don't know exactly which controller as I do not have
> physical access to the machines.
> (The machines are supplied by a hosting provider) - Is there anyway
> to find out exactly which cards they
> are from the OS?
>
> Cheers
>
> Andrew
>

3ware Raid controllers and 64bit Etch

Hi,

I have the exact same problem, and finally got this link :

http://www.3ware.com/KB/article.aspx?id=15243&cNode=6I1C6S

Where it is said :

Quote:

If you have an Intel 64-bit EMT64 system with 4 GB or more of system RAM, then you should not use the 7000/8000 series in kernel driver 3w-xxxx if you are using Linux kernels 2.6.15 through 2.6.22.

Instead you should use the driver source from this KB article to compile a new driver.

You can still install Linux by specifying mem=2048M, then update the driver after you install.

3ware 9000 series controllers are not affected by this issue.

I'll try this.

3ware Raid controllers and 64bit Etch

Well it seems to work perfectly.

I just replaced 3w-xxx.[ch] with those given by 3ware on the link above in the kernel source tree and rebuilt.

Re: 3ware Raid controllers and 64bit Etch

I build a new file 3w-xxxx.ko and placed it in the modules directory:
/lib/modules/2.6.18-5-xen-vserver-amd64/drivers/scsi

After a restart of the server the version remains the same.
I expected Version 1.26.03.000
but it was the former v2.26.02.007

What could be wrong?

Thank you!

3ware Raid controllers and 64bit Etch

tw_cli from the 3ware site.

Andrew Miehs wrote:
> After searching the web...
>
> Looks like it is a "3ware 8006-2LP" controller...
>
> Andrew
>
> On 11/01/2008, at 7:10 PM, Andrew Miehs wrote:
>
>> The machines are using a 3ware 64 bit PCI Raid controller - with 2 (or
>> 4 can't remember exactly).
>> They are in a 32bit PCI slot and the disks are SATA and mirrored (2
>> disks)
>> Unfortunately I don't know exactly which controller as I do not have
>> physical access to the machines.
>> (The machines are supplied by a hosting provider) - Is there anyway to
>> find out exactly which cards they
>> are from the OS?
>>
>> Cheers
>>
>> Andrew
>>

--

Syndicate content