mlx rdma网卡指标参数简介
生活随笔
收集整理的這篇文章主要介紹了
mlx rdma网卡指标参数简介
小編覺得挺不錯的,現在分享給大家,幫大家做個參考.
mlx rdma網卡指標參數簡介
- 綜述
- hw_counter
- counter
- 參考鏈接
綜述
mlx5 driver在linux sysfs下有一系列的mlx網卡參數和計數器分布在/sys/class/infiniband/mlx5_x/ports/1/counters和/sys/class/infiniband/mlx5_x/ports/1/hw_counters目錄下,這些參數統計了某種類型的事件發生的次數,如某種錯誤數,收包數等等。理解這些參數,可以幫助我們更好的理解mlx網卡的運行狀態,通過監控,可以更快的定位rdma報錯的根因
hw_counter
- rnr_nak_retry_err:本機作為發送方,收到對端發來的RNR NAK包的數量。如果接收方qp的srq沒有空閑了,這個計數會漲
- out_of_buffer:本機作為接收方,收包的時候發現沒有buffer了,如果自己qp的srq滿了,這個計數會漲
- out_of_sequence:收包亂序
- local_ack_timeout_err:發送的rdma請求超時計數
- packet_seq_err:本機收到NAK包計數
- req_cqe_error:本機CQE報錯計數
- duplicate_request:本機收到重復包
- np_ecn_marked_roce_packets:本機收到的ecn
counter
- port_rcv_data: Total number of data octets, divided by 4 (lanes), received on all VLs. This is 64 bit counter.
- port_rcv_packets: Total number of packets (this may include packets containing Errors. This is 64 bit counter.
- port_xmit_data: Total number of data octets, divided by 4 (lanes), transmitted on all VLs. This is 64 bit counter.
- port_xmit_packets: Total number of packets transmitted on all VLs from this port. This may include packets with errors.
- unicast_rcv_packets: Total number of unicast packets, including unicast packets containing errors.
- unicast_xmit_packets: Total number of unicast packets transmitted on all VLs from the port. This may include unicast packets with errors.
參考鏈接
總結
以上是生活随笔為你收集整理的mlx rdma网卡指标参数简介的全部內容,希望文章能夠幫你解決所遇到的問題。
- 上一篇: 虚拟化技术概念基础
- 下一篇: linux网络编程--阻塞与非阻塞