Intermittent SSLVPN / RADIUS issues
I'm hoping for a bit of advice from the experts, as I'm fairly stuck with an SSL VPN issue. I'm running a 100D on 5.2.2 with SSL VPN authenticating via a Windows RADIUS server.
Symptoms:
User will connect either via the SSL web portal or SSL client, client will reach ~90% completion then give error -12. User tries again maybe 2 or 3 times, then will authenticate successfully.
VPN logs during these connection attempts will show several entries with 'SSL new connection', but with the user as 'N/A'. There is also usually another entry with 'SSL user failed to login', with the username showing correctly.
Up until now I've had a great deal of difficulty replicating this issue. Not all users are affected, and the ones that are seem to resolve the issue with 2-3 re-tries. This means that by the time the issue is reported from the Helpdesk, it's tough to setup the proper debugs. I've recently found a user that will consistently get 'SSL user failed to login'. I'm not entirely sure if this is the same issue, but it seems related, so I'm hoping I can get some assistance now that I can produce some debug logs.
Debug from Failed User:
2015-08-27 12:38:01 [97:root:10239]sslvpn_authenticate_user:168 authenticate user: [USERNAME_A]
2015-08-27 12:38:01 [97:root:10239]sslvpn_authenticate_user:175 create fam state
2015-08-27 12:38:01 [97:root:10239]fam_auth_send_req:514 with server blacklist:
2015-08-27 12:38:01 [97:root:10239]fam_auth_send_req_internal:414 fnbam_auth return: 4
2015-08-27 12:38:10 [97:root:10239]fam_auth_send_req:514 with server blacklist: #RADIUS_SERVER
2015-08-27 12:38:10 [97:root:10239]fam_auth_send_req:600 task finished with 5
2015-08-27 12:38:10 [97:root:10239]rmt_logincheck.c:250 user[USERNAME_A],auth_type=1 failed [sslvpn_login_unknown_user]
Debug from Successful User:
2015-08-27 14:03:01 [99:root:10245]sslvpn_authenticate_user:168 authenticate user: [USERNAME_B]
2015-08-27 14:03:01 [99:root:10245]sslvpn_authenticate_user:175 create fam state
2015-08-27 14:03:01 [99:root:10245]fam_auth_send_req:514 with server blacklist:
2015-08-27 14:03:01 [99:root:10245]fam_auth_send_req_internal:414 fnbam_auth return: 4
2015-08-27 14:03:06 [99:root:10245]Auth successful for group SSL_VPN_Group
2015-08-27 14:03:06 [99:root:10245]fam_do_cb:463 fnbamd return auth success.
2015-08-27 14:03:06 [99:root:10245]SSL VPN login matched rule (1).
Note the line "514 with server blacklist" with the return "414 fnbam_auth return: 4". This is reflected in the RADIUS logs from both user connections, with the RADIUS Reason Code 66 = IAS_INVALID_AUTH_TYPE. The difference being that on the second request the RADIUS logs show a return a "Packet Type 2 = Access-Accept" for the successful user, and a "Packet Type 3 = Access-Reject" on the failed user.
The only other major discrepancy I could find is the RADIUS logs reporting the failed user's OU as "domain\username", where as it should be in "domain\container1\container2\username". The two accounts looks identical in AD from what I can see, which is why I'm leaning more towards a configuration issue/bug on the Fortigate.
I'm stumped. Any suggestions on what to try next are welcome. I'm happy to post any further debugs or logs that might help.
Cheers.
