Quantcast

mfi timeout issues and patch that seems to work (PERC H800)

classic Classic list List threaded Threaded
4 messages Options
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate
star

mfi timeout issues and patch that seems to work (PERC H800)

Csillag Tamas
Hi,

I had the same issues with the Perc H800 controller as it is described
here:
http://www.freebsd.org/cgi/query-pr.cgi?pr=kern/140416
(Just for google I include the error here:
mfi0: COMMAND 0xffffff80009c4b90 TIMEOUT AFTER 41 SECONDS)

mfsbsd# mfiutil show adapter
mfi0 Adapter:
    Product Name: PERC H800 Adapter
   Serial Number: 1A8006L
        Firmware: 12.10.2-0004
     RAID Levels: JBOD, RAID0, RAID1, RAID5, RAID6, RAID10, RAID50
  Battery Backup: present
           NVRAM: 32K
  Onboard Memory: 1024M
  Minimum Stripe: 8k
  Maximum Stripe: 1M

mfsbsd# mfiutil show firmware
mfi0 Firmware Package Version: 12.10.2-0004
mfi0 Firmware Images:
Name  Version                        Date           Time      Status
BIOS  3.18.00_4.09.05.00_0x0416A000  00_0x0416A000            active
APP   2.100.03-1405                  Sep 19 2011    17:58:36  active
PCLI  04.04-010:#%00008              May 31 2010    20:21:52  active
CTLR  2.02-0025.1                    Aug 22 2011    11:37:38  active
NVDT  2.07.03-0003                   Jul 14 2010    15:53:29  active
BTBL  2.02.00.00-0000                Sep 16 2009    21:37:06  active
BOOT  01.250.04.219                  4/28/2009      12:51:38  active

however getting and compiling the newest kernel did NOT fix it for me.
Issuing commands with mfiutils still fixed the hang and everything
returned to normal.

It seems that intensive read triggers the issue, but if you have write
concurrently you are fine (mostly). Restarting rsync is ideal for
triggering this buggy condition.

I tried to poke around in the source code and in the end tweaked the
patch (seen here before
http://lists.freebsd.org/pipermail/freebsd-scsi/2011-March/004839.html):

replacing
(void)sc->mfi_read_fw_status(sc);
with
mfi_get_controller_info(sc);
around line 933

and after testing it for a day it seems to be solid.
(The original patch did not help me.)

Can someone expert in the topic can confirm if any of this is correct?

Thanks in advance!

Regards,
  cstamas
--
CSILLAG Tamas (cstamas) - http://digitus.itk.ppke.hu/~cstamas

Arguing with an engineer is like wrestling with a pig in mud. After a while,
you realise the pig is enjoying it.                  -- Jamie Lawrence.

_______________________________________________
[hidden email] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-scsi
To unsubscribe, send any mail to "[hidden email]"
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate
star

Re: mfi timeout issues and patch that seems to work (PERC H800)

Csillag Tamas
On Fri, Feb 24, 2012 at 12:59:33AM +0100, Csillag Tamas wrote:
...

> patch (seen here before
> http://lists.freebsd.org/pipermail/freebsd-scsi/2011-March/004839.html):
>
> replacing
> (void)sc->mfi_read_fw_status(sc);
> with
> mfi_get_controller_info(sc);
> around line 933
>
> and after testing it for a day it seems to be solid.
> (The original patch did not help me.)

I have a liveCD here what I am using right now for testing:
https://filesender.niif.hu/filesender/?vid=3f96fcb5-d2a7-e909-1a77-000011b7d001

This is a mfsbsd (mmatuska-mfsbsd-a7082ca) built with a custom kernel.

Regards,
  cstamas
--
CSILLAG Tamas (cstamas) - http://digitus.itk.ppke.hu/~cstamas

_______________________________________________
[hidden email] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-scsi
To unsubscribe, send any mail to "[hidden email]"
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate
star

Re: mfi timeout issues and patch that seems to work (PERC H800)

Jan Mikkelsen-3
In reply to this post by Csillag Tamas

On 24/02/2012, at 10:59 AM, Csillag Tamas wrote:

> Hi,
>
> I had the same issues with the Perc H800 controller as it is described
> here:
> http://www.freebsd.org/cgi/query-pr.cgi?pr=kern/140416
> (Just for google I include the error here:
> mfi0: COMMAND 0xffffff80009c4b90 TIMEOUT AFTER 41 SECONDS)
> [ … ]
> however getting and compiling the newest kernel did NOT fix it for me.
> Issuing commands with mfiutils still fixed the hang and everything
> returned to normal.
>
> It seems that intensive read triggers the issue, but if you have write
> concurrently you are fine (mostly). Restarting rsync is ideal for
> triggering this buggy condition.
>
> I tried to poke around in the source code and in the end tweaked the
> patch (seen here before
> http://lists.freebsd.org/pipermail/freebsd-scsi/2011-March/004839.html):
>
> replacing
> (void)sc->mfi_read_fw_status(sc);
> with
> mfi_get_controller_info(sc);
> around line 933
>
> and after testing it for a day it seems to be solid.
> (The original patch did not help me.)
>
> Can someone expert in the topic can confirm if any of this is correct?


The default value for hw.mfi.msi is 0. I had to set it to 1 in /boot/loader.conf when using a 9261-8i controller. It might be worth retesting with that setting and the standard kernel.

Regards,

Jan.

_______________________________________________
[hidden email] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-scsi
To unsubscribe, send any mail to "[hidden email]"
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate
star

Re: mfi timeout issues and patch that seems to work (PERC H800)

Csillag Tamas
On Sat, Feb 25, 2012 at 12:04:05PM +1100, Jan Mikkelsen wrote:

> On 24/02/2012, at 10:59 AM, Csillag Tamas wrote:
> > Hi,
> >
> > I had the same issues with the Perc H800 controller as it is described
> > here:
> > http://www.freebsd.org/cgi/query-pr.cgi?pr=kern/140416
> > (Just for google I include the error here:
> > mfi0: COMMAND 0xffffff80009c4b90 TIMEOUT AFTER 41 SECONDS)
> > [ … ]
> > however getting and compiling the newest kernel did NOT fix it for me.
> > Issuing commands with mfiutils still fixed the hang and everything
> > returned to normal.
> >
> > It seems that intensive read triggers the issue, but if you have write
> > concurrently you are fine (mostly). Restarting rsync is ideal for
> > triggering this buggy condition.
> >
> > I tried to poke around in the source code and in the end tweaked the
> > patch (seen here before
> > http://lists.freebsd.org/pipermail/freebsd-scsi/2011-March/004839.html):
> >
> > replacing
> > (void)sc->mfi_read_fw_status(sc);
> > with
> > mfi_get_controller_info(sc);
> > around line 933
> >
> > and after testing it for a day it seems to be solid.
> > (The original patch did not help me.)
> >
> > Can someone expert in the topic can confirm if any of this is correct?
>
> The default value for hw.mfi.msi is 0. I had to set it to 1 in
> /boot/loader.conf when using a 9261-8i controller. It might be worth
> retesting with that setting and the standard kernel.

I am not sure if I will have enough time for this (others starts to
work on this machine), but will make a try.

Thanks

Regards,
  cstamas
--
CSILLAG Tamas (cstamas) - http://digitus.itk.ppke.hu/~cstamas

_______________________________________________
[hidden email] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-scsi
To unsubscribe, send any mail to "[hidden email]"
Loading...