Re: Gnugk crashing at high volumes.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




It appears you have a bad GW running at xxx.xxx.168.187.. Its trying to tell GNUGK that it's behind a NAT in a very INCORRECT manner. GNUGK has not notified the GW that it is behind a NAT (because the proxy has not been enabled and the GW actually may not be? ) in the RCF (registrationConfirm) but the EP is still sending a Facility message notifying the GK that it is. This is BAD! BAD! BAD! This is what is causing the crash in GNUGK. 

I suggest you remove the GW at xxx.xxx.168.187.

Simon

At 06:06 AM 1/02/2006, you wrote:
We are having an issue that may be similar to the one in this archive:

http://sourceforge.net/mailarchive/forum.php?thread_id=9573995&forum_id=8549

Gnugk is crashing seemingly at random times.  In watching the status monitor it seems to crash when there are high call volumes, but we have not been able to verify this.  We are currently running 2.2.2 and are using a system to restart it automatically when it crashes ("monit").  We would like to figure out what is causing it to crash so that we can use it in our carrier grade platform.  We have tried setting the verbosity of the debug to a very high level to try and capture the error, but the log doesn't appear to help us much (no errors that stand out at "-ttttt" verbosity saved to a log).

I am listing the last hundred lines of the log file (up to the time of crash) as well as our config file below.  Please let me know if you would like the entire log file or more information.  Thank you in advance for your help.  "X" in the config represents the info that would be there that we know allready works.

################################CONFIG################################
[Gatekeeper::Main]
Home=X <IP of NIC we want GNUGK to run on on the server)
Name=X
TimeToLive=60

[RoutedMode]
GKRouted=1
SupportNATedEndpoints=1
CallSignalPort=1720
AcceptUnregisteredCalls=0
AcceptNeighborsCalls=1

[GkStatus::Auth]
rule=password
keyfilled=X
X=x
X=x

[RasSrv::GWPrefixes]
X=X ... etc

[RasSrv::PermanentEndpoints]
X=X ... etc

[RasSrv::GWRewriteE164]
X=in=4=011542234

[RasSrv::RewriteE164]
X=X ...  etc

[Proxy]
Enable=0

[Gatekeeper::Acct]
FileAcct=required
SQLAcct=required;start,update,stop

[FileAcct]
DetailFile=/gnugk/log/cdr.log
Rotate=L10000

[SQLAcct]
Driver=MySQL
Host=127.0.0.1
Database=caflobvi
Username=X
Password=X
StartQuery=INSERT INTO call (gkname, sessid, callno, username, calling, called, duration, disconnectcause, TimeStamp, long_distance, rated, DESCRIPTION, ip) VALUES ('%g', '%s', %n, '%u', '%{Calling-Station-Id}', '%{Called-Station-Id}', '%d', '%c', NOW(), IF(%{Called-Station-Id} REGEXP '^[01689]','1','0'), IF(%{Called-Station-Id} REGEXP '^[01689]','0','1'), IF(%{Called-Station-Id} REGEXP '^[01689]','0','Llamada iPort On Net'), '%{caller-ip}')
StartQueryAlt=UPDATE call SET username = '%u', calling = '%{Calling-Station-Id}', called = '%{Called-Station-Id}', duration = %d, disconnectcause = %c WHERE gkname = '%g' AND sessid = '%s'
UpdateQuery=UPDATE call SET duration = %d, call_done = '1' WHERE gkname = '%g' AND sessid = '%s'
StopQuery=UPDATE call SET call_done = '1', username = '%u', calling = '%{Calling-Station-Id}', called = '%{Called-Station-Id}', duration = %d, disconnectcause = %c WHERE gkname = '%g' AND sessid = '%s'
StopQueryAlt=INSERT INTO call (gkname, sessid, callno, username, calling, called, duration, disconnectcause, call_done) VALUES ('%g', '%s', %n, '%u', '%{Calling-Station-Id}', '%{Called-Station-Id}', %d, %c, '1')

[Gatekeeper::Auth]
RadAliasAuth=sufficient;RRQ,ARQ
default=allow

[RadAliasAuth]
Servers=127.0.0.1
LocalInterface=127.0.0.1
RadiusPortRange=10000-11000
DefaultAuthPort=1812
SharedSecret=X
RequestTimeout=2000
IdCacheTimeout=9000
SocketDeleteTimeout=60000
RequestRetransmissions=1
RoundRobinServers=1
AppendCiscoAttributes=1
IncludeTerminalAliases=1
IncludeEndpointIP=1

[RasSrv::Neighbors]
X1=CiscoGK
X2=CiscoGK
X3=CiscoGK

[Neighbor::X1]
GatekeeperIdentifier=X1
Host=X
AcceptForwardedLRQ=1
SendPrefixes=10500#
ForwardResponse=1
ForwardLRQ=always
ForwardHopCount=3

[Neighbor::X2]
GatekeeperIdentifier=X2
Host=X
AcceptPrefixes=
AcceptForwardedLRQ=1
ForwardResponse=1
ForwardLRQ=always
ForwardHopCount=3

[Neighbor::X3]
GatekeeperIdentifier=X3
Host=X
AcceptForwardedLRQ=1
SendPrefixes=669
ForwardResponse=1
ForwardLRQ=always
ForwardHopCount=3

################################DEBUG: LAST 100 LINES################################

[ed@localhost 2006-01-31]$ tail -n 100 look2006-01-30.log
        0070 006c 0061 006e 006e 006f 0031        planno1
      }
      [1]=dialedDigits "74081"
      [2]=dialedDigits "74078"
      [3]=dialedDigits "74079"
      [4]=dialedDigits "74080"
    }
    gatekeeperIdentifier =  8 characters {
      0063 0061 0066 006c 006f 0067 006b 0034   caflogk4
    }
    endpointVendor = {
      vendor = {
        t35CountryCode = 0
        t35Extension = 0
        manufacturerCode = 0
      }
      productId =  4 octets {
        34 46 58 53                                        4FXS
      }
      versionId =  3 octets {
        32 30 35                                           205
      }
    }
    timeToLive = 60
    keepAlive = TRUE
    endpointIdentifier =  9 characters {
      0039 0037 0030 0033 005f 0065 006e 0064   9703_end
      0070                                      p
    }
    willSupplyUUIEs = FALSE
    maintainConnection = FALSE
  }
2006/01/31 14:50:35.016 5                job.cxx(352)   JOB     Worker threads: 10 total - 6 busy, 4 idle
2006/01/31 14:50:35.016 5                job.cxx(180)   JOB     Starting Job RRQ at Worker thread 4964360
2006/01/31 14:50:35.016 1             RasSrv.cxx(330)   RAS     RRQ Received
2006/01/31 14:50:35.017 3             RasSrv.cxx(221)   RAS     Send to 201.128.168.187:1034
registrationConfirm {
    requestSeqNum = 29083
    protocolIdentifier = 0.0.8.2250.0.4
    callSignalAddress = 1 entries {
      [0]=ipAddress {
        ip =  4 octets {
          c8 44 57 73                                        .DWs
        }
        port = 1720
      }
    }
    terminalAlias = 5 entries {
      [0]=h323_ID  7 characters {
        0070 006c 0061 006e 006e 006f 0031        planno1
      }
      [1]=dialedDigits "74081"
      [2]=dialedDigits "74078"
      [3]=dialedDigits "74079"
      [4]=dialedDigits "74080"
    }
    gatekeeperIdentifier =  8 characters {
      0063 0061 0066 006c 006f 0067 006b 0034   caflogk4
    }
    endpointIdentifier =  9 characters {
      0039 0037 0030 0033 005f 0065 006e 0064   9703_end
      0070                                      p
    }
    timeToLive = 60
    willRespondToIRR = FALSE
    maintainConnection = FALSE
  }
2006/01/31 14:50:35.017 5             RasSrv.cxx(235)   RAS     Sent Successful
2006/01/31 14:50:35.017 5                job.cxx(415)   JOB     Job RRQ deleted
2006/01/31 14:50:35.017 5                job.cxx(404)   JOB     Worker threads: 10 total - 5 busy, 5 idle
2006/01/31 14:50:35.186 5           yasocket.cxx(709)   GkStatus 1 sockets selected from 3, total 3/0
2006/01/31 14:50:35.187 5                job.cxx(352)   JOB     Worker threads: 10 total - 6 busy, 4 idle
2006/01/31 14:50:35.187 5                job.cxx(180)   JOB     Starting Job StatusCmd !! at Worker thread 4964360
2006/01/31 14:50:35.187 5           GkStatus.cxx(1018)  STATUS  Got command !! from client 68.157.82.98:3463
2006/01/31 14:50:35.187 5           yasocket.cxx(709)   GkStatus 1 sockets selected from 3, total 3/0
2006/01/31 14:50:35.187 3            SoftPBX.cxx(80)    GK      SoftPBX: PrintCurrentCalls
2006/01/31 14:50:35.189 5                job.cxx(415)   JOB     Job StatusCmd !! deleted
2006/01/31 14:50:35.189 5                job.cxx(404)   JOB     Worker threads: 10 total - 5 busy, 5 idle
2006/01/31 14:50:35.475 5           yasocket.cxx(709)   TCPServer 1 sockets selected from 2, total 2/0
2006/01/31 14:50:35.475 4           yasocket.cxx(816)   TCP     Accept request on 200.68.87.115:1720
2006/01/31 14:50:35.475 5                job.cxx(352)   JOB     Worker threads: 10 total - 6 busy, 4 idle
2006/01/31 14:50:35.475 5                job.cxx(180)   JOB     Starting Job Acceptor at Worker thread 4964360
2006/01/31 14:50:35.485 5       ProxyChannel.cxx(501)   Q931s   Reading from 201.128.168.187:1833
2006/01/31 14:50:35.485 3       ProxyChannel.cxx(744)   Q931s   Received: Information CRV=0 from 201.128.168.187:1833
2006/01/31 14:50:35.486 4       ProxyChannel.cxx(686)   Q931    Received: {
  q931pdu = {
    protocolDiscriminator = 8
    callReference = 0
    from = originator
    messageType = Information
    IE: Call-State = {
      09                                                 .
    }
    IE: Facility = {
      39 37 30 33 5f 65 6e 64  70                        9703_endp
    }
  }
}
2006/01/31 14:50:35.486 3       ProxyChannel.cxx(1896)  Q931s   201.128.168.187:1833 NAT Information message from EPID = 9703_endp
2006/01/31 14:50:35.486 3             RasTbl.cxx(255)   Q931    NAT socket detected at 201.128.168.187:1833 for endpoint 9703_endp




--

Thanks,

C. Ed Felt Caflo Network Engineer/Programmer
115115a2.jpg
efelt@xxxxxxxxxxxx





[Index of Archives]     [SIP]     [Open H.323]     [Gnu Gatekeeper]     [Asterisk PBX]     [ISDN Cause Codes]     [Yosemite News]

  Powered by Linux