Hi!
As I understand, "port_ha" is virtual manifestation of physical "ha1" or "ha2" (depending on priority), so, why am I seeing its drops counter increasing but not either of latter?
<node_redacted> (root) # diagnose netlink device list | grep port
Inter-|Receive |Transmit
face |bytes packets errs drop fifo frame compressed multicast|bytes packets errs drop fifo colls carrier compressed
:
port_ha: 59307243213 183676419 0 268802671 0 0 0 0 |2695447833533 2029323361 0 1 0 0 0 0
: |
ha2: 442088029516 531221403 0 0 0 2 0 4059668 |2709586459181 2051768332 0 0 0 0 0 0
ha1: 43740481445 77620950 0 0 0 0 0 2989697 |13166888759 21729664 0 0 0 0 0 0
: |Thanks!
Hi @AlexFerenX
Could you please share the information of your device? together with the configuration of HA and snapshot of output of the errs. Thank you
Bill
Created on 12-08-2025 09:09 PM Edited on 12-08-2025 09:16 PM
Hi @BillH_FTNT
I'm seeing this on both of our HA clusters - a "Border" HA cluster 2200E (with "set group_id 60") and a "LAN" HA 2200E cluster (with "set group_id 50"), former v7.2.11, latter v7.4.8.
Primary and Subordinate's HA1 and HA2 interfaces aren't back-to-back connected - they're geographically disparate and follow diverse paths - HA1 one via North path, HA2 via South path.
Perhaps something noteworthy: both clusters share HA1 and HA2 paths - so, "Border" HA1 and "LAN" HA1 are in same collision domain; and "Border" HA2 and "LAN" HA2 are in same collision domain. Each cluster has different HA Group Id, so, their HA are not conflated, however, it may cause the drop count we're seeing?
Alex.
Hi AlexFerenX,
Kindly check the CPU usage for the device and verify if any particular core is going HIGH, if yes then it might cause similar issue.
Refer below article for more detail:
Created on 12-09-2025 02:45 PM Edited on 12-09-2025 02:51 PM
There are no error-indicative Log Messages pertaining to HA.
I've followed the instructions in "How to troubleshoot HA 'Heartbeat packet lost' issues in a FortiGate HA Cluster" - and noticed no overt CPU utilisation.
Printout of "fnsysctl cat /proc/interrupts" showed distribution of "i40e-ha1-TxRx-x" and "i40e-ha2-TxRx-y" among many of the 24 processors, albeit uneven - some more than others.
I didn't do packet capture comparison between received/sent HA1/HA2 packets - even if some are missing, what would it mean, given that individual HA1 and HA2 packet counters show no drops?
You've not answered a fundamental question: can "port_ha" drops be indicative of multiple HA clusters sharing same collision domain in transport of HA1 and HA2 packets between HA cluster members?
I've touched on this problem in Ticket 9166052, but only as a sideline query. Does above observation warrant a new Ticket?
Alex
| User | Count |
|---|---|
| 2836 | |
| 1433 | |
| 812 | |
| 796 | |
| 455 |
The Fortinet Security Fabric brings together the concepts of convergence and consolidation to provide comprehensive cybersecurity protection for all users, devices, and applications and across all network edges.
Copyright 2025 Fortinet, Inc. All Rights Reserved.