Asterisk 1.6.2.10 Random Crashes

Hi All

I am at my wits end at the moment. I have a newly implemented system using 64 bit Gentoo Linux on a Core i7 930 with 6GB of memory (3x 2GB in triple channel) on an Intel DX58SO Motherboard. I have 4x BRI and 4x POTS lines coming in to the organisation. I am using a Digium HA8 Card with 1x B400M ISDN Module. I have an X400M FXO module ready to go onto the card also.

I have everything configured using FreePBX 2.8.0

I have 38 configured SIP Endpoints which are a combination of Grandstream 2020, 280 and HT-502’s.

The system works well 90% of the time. the other 10% of the time it crashes randomly and does not write any meaningful information into any of the logs even with debug and verbose set to at least 9.

Any help would be greatly accepted with this issue.

Additional Information:

zapbx ~ # uname -a
Linux zapbx 2.6.34-gentoo-r1 #2 SMP Thu Jul 29 08:48:38 SAST 2010 x86_64 Intel® Core™ i7 CPU 930 @ 2.80GHz GenuineIntel GNU/Linux

zapbx ~ # lscpu
Architecture: x86_64
CPU op-mode(s): 32-bit, 64-bit
CPU(s): 8
Thread(s) per core: 2
Core(s) per socket: 4
CPU socket(s): 1
Vendor ID: GenuineIntel
CPU family: 6
Model: 26
Stepping: 5
CPU MHz: 2793.000
Virtualization: VT-x
L1d cache: 32K
L1i cache: 32K
L2 cache: 256K
L3 cache: 8192K

zapbx ~ # lspci
00:00.0 Host bridge: Intel Corporation X58 I/O Hub to ESI Port (rev 13)
00:01.0 PCI bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 1 (rev 13)
00:03.0 PCI bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 3 (rev 13)
00:07.0 PCI bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 7 (rev 13)
00:10.0 PIC: Intel Corporation 5520/5500/X58 Physical and Link Layer Registers Port 0 (rev 13)
00:10.1 PIC: Intel Corporation 5520/5500/X58 Routing and Protocol Layer Registers Port 0 (rev 13)
00:14.0 PIC: Intel Corporation 5520/5500/X58 I/O Hub System Management Registers (rev 13)
00:14.1 PIC: Intel Corporation 5520/5500/X58 I/O Hub GPIO and Scratch Pad Registers (rev 13)
00:14.2 PIC: Intel Corporation 5520/5500/X58 I/O Hub Control Status and RAS Registers (rev 13)
00:14.3 PIC: Intel Corporation 5520/5500/X58 I/O Hub Throttle Registers (rev 13)
00:19.0 Ethernet controller: Intel Corporation 82567LM-2 Gigabit Network Connection
00:1a.0 USB Controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #4
00:1a.1 USB Controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #5
00:1a.2 USB Controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #6
00:1a.7 USB Controller: Intel Corporation 82801JI (ICH10 Family) USB2 EHCI Controller #2
00:1b.0 Audio device: Intel Corporation 82801JI (ICH10 Family) HD Audio Controller
00:1c.0 PCI bridge: Intel Corporation 82801JI (ICH10 Family) PCI Express Port 1
00:1c.1 PCI bridge: Intel Corporation 82801JI (ICH10 Family) PCI Express Port 2
00:1c.4 PCI bridge: Intel Corporation 82801JI (ICH10 Family) PCI Express Port 5
00:1d.0 USB Controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #1
00:1d.1 USB Controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #2
00:1d.2 USB Controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #3
00:1d.7 USB Controller: Intel Corporation 82801JI (ICH10 Family) USB2 EHCI Controller #1
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev 90)
00:1f.0 ISA bridge: Intel Corporation 82801JIR (ICH10R) LPC Interface Controller
00:1f.2 IDE interface: Intel Corporation 82801JI (ICH10 Family) 4 port SATA IDE Controller
00:1f.3 SMBus: Intel Corporation 82801JI (ICH10 Family) SMBus Controller
00:1f.5 IDE interface: Intel Corporation 82801JI (ICH10 Family) 2 port SATA IDE Controller
02:00.0 VGA compatible controller: nVidia Corporation G98 [GeForce 8400 GS] (rev a1)
06:00.0 IDE interface: Marvell Technology Group Ltd. 88SE6121 SATA II Controller (rev b2)
07:02.0 Ethernet controller: Digium, Inc. Device 8007 (rev 11)
07:03.0 FireWire (IEEE 1394): Texas Instruments TSB43AB22/A IEEE-1394a-2000 Controller (PHY/Link)

zapbx ~ # lsdahdi

Span 1: WCBRI/0/0 “HA8-0000 Board 1” (MASTER) AMI/CCS

1 BRI Clear (In use)
2 BRI Clear (In use)
3 BRI Hardware-assisted HDLC (In use)

Span 2: WCBRI/0/1 “HA8-0000 Board 1” AMI/CCS

4 BRI Clear (In use)
5 BRI Clear (In use)
6 BRI Hardware-assisted HDLC (In use)

Span 3: WCBRI/0/2 “HA8-0000 Board 1” AMI/CCS

7 BRI Clear (In use)
8 BRI Clear (In use)
9 BRI Hardware-assisted HDLC (In use)

Span 4: WCBRI/0/3 “HA8-0000 Board 1” AMI/CCS

10 BRI Clear (In use)
11 BRI Clear (In use)
12 BRI Hardware-assisted HDLC (In use)

Check for wrong version modules, then do as described in backtrace.txt in the documentation.

Check for a hotfix for your server. I had an HP G6 doing the same thing. There was a problem with the BIOS and intel chipset.

This was found out after we upgraded all our modules and made sure all versions matched. We even replaced all the memory sticks.

edited