Difference between revisions of "VPP/AArch64"
Lijian-Zhang (Talk | contribs) (→Meeting Minutes) |
Lijian-Zhang (Talk | contribs) (→Meeting Minutes) |
||
Line 380: | Line 380: | ||
** Tina Tsou | ** Tina Tsou | ||
** Jieqiang Wang | ** Jieqiang Wang | ||
+ | ** Arthur Marshall | ||
* General | * General | ||
* CSIT | * CSIT | ||
** VPP Performance Test | ** VPP Performance Test | ||
+ | *** Two failures in performance testing | ||
+ | **** one failure is related with CSIT script, NAT44 is common issue, failing with x86 also. | ||
+ | **** the other failure is related with VPP image on Arm, IPSec tunnel configuration issue. | ||
*** iommu_passthrough=1 does not make any differences on Taishan server - Lijian | *** iommu_passthrough=1 does not make any differences on Taishan server - Lijian | ||
*** We cannot do kernel upgrade with Ubuntu-18.04.1/Ubuntu-18.04.2/Ubuntu-18.04.3/Ubuntu-18.04.4 on Taishan. | *** We cannot do kernel upgrade with Ubuntu-18.04.1/Ubuntu-18.04.2/Ubuntu-18.04.3/Ubuntu-18.04.4 on Taishan. | ||
− | **** For now, can the kernel of Taishan server be left as it is now. | + | **** For now, can the kernel of Taishan server be left as it is now, linux-4.15.0.54. - Juraj |
**** One possible option/improvement is to port FD.io CSIT performance testing to some more advanced Arm servers, e.g., Ampere | **** One possible option/improvement is to port FD.io CSIT performance testing to some more advanced Arm servers, e.g., Ampere | ||
** VPP Path | ** VPP Path | ||
*** Investigate Ubuntu-20.04 on Arm servers - Juraj & Jieqiang | *** Investigate Ubuntu-20.04 on Arm servers - Juraj & Jieqiang | ||
*** Investigate adding CentOS on Arm Jenkins jobs - Juraj & Jieqiang | *** Investigate adding CentOS on Arm Jenkins jobs - Juraj & Jieqiang | ||
− | **** | + | **** Will send email to community about two options to resolve gcc-7 issue with CentOS-7 |
− | **** | + | ***** 1. update gcc-7 requirement to gcc-8 in Makefile |
− | **** | + | ***** 2. remove gcc-7 limitation in Makefile, and get user install gcc-8 manually |
− | + | ||
*** By fixing software bug, VPP can boot up normally with 16K/64K page size. | *** By fixing software bug, VPP can boot up normally with 16K/64K page size. | ||
**** There's about 4-5 test failures in 'make test' when system is configured with 16K/64K page size - Lijian | **** There's about 4-5 test failures in 'make test' when system is configured with 16K/64K page size - Lijian | ||
Line 403: | Line 406: | ||
*** https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/ | *** https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/ | ||
* FD.io lab | * FD.io lab | ||
+ | ** Update server information to CSIT documentation. - Juraj & Jieqiang | ||
+ | ** Prepare CSIT script patch for adding those two ThunderX2 servers. - Juraj & Jieqiang | ||
** The servers, intel NICs, and Mellanox NICs works good so far. | ** The servers, intel NICs, and Mellanox NICs works good so far. | ||
*** Root-causing the RDMA issue with Mellanox NIC. | *** Root-causing the RDMA issue with Mellanox NIC. | ||
Line 417: | Line 422: | ||
*** https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495 | *** https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495 | ||
* VPP | * VPP | ||
+ | ** Enabled RDMA driver on Arm, and Mellanox DPDK PMD driver is also working on ThunderX2 | ||
+ | *** https://gerrit.oss.arm.com/#/c/160812/ | ||
** Investigate bihash operations in L2 throughput are hot-spots | ** Investigate bihash operations in L2 throughput are hot-spots | ||
*** To confirm with Damjan if he has plan to rewrite l2-nodes or not - Lijian | *** To confirm with Damjan if he has plan to rewrite l2-nodes or not - Lijian | ||
Line 422: | Line 429: | ||
** Benchmarking, profiling and comparation btw VPP and testpmd are done. Will review with team. - Lijian | ** Benchmarking, profiling and comparation btw VPP and testpmd are done. Will review with team. - Lijian | ||
** N1SDP enablement. - Lijian | ** N1SDP enablement. - Lijian | ||
− | + | ** GCC-9.2.0 is used with "-march=armv8.2-a+crc+crypto -mtune=neoverse-n1" compiler options. | |
− | + | ||
*** Multi-arch, arch-specific compiling and dynamic function selection patch is ready. | *** Multi-arch, arch-specific compiling and dynamic function selection patch is ready. | ||
*** It seems dual loop-unrolling gives better performance over quad-loop-unrolling. | *** It seems dual loop-unrolling gives better performance over quad-loop-unrolling. |
Revision as of 13:59, 28 April 2020
Contents
Get Involved
Meeting Details
- Regular AArch64 meeting: Tuesdays at 06:00 PT (Pacific Time) (weekly). Convert to your timezone.
IRC Channel
#fdio-arm
on freenode.net
Slack
Request invitation at https://slack.fd.io/
Jira
Presentations
- The path to Fast Data on Arm [pdf] - FD.io Mini-Summit at KC+CNC EU 2018
- Vector Packet Processing (VPP) Arm Story: Now and Beyond [youtube] - FD.io Mini-summit at KC+CNC NA 2018
Release Milestones
18.10
18.07
18.04
- CI
- Upstream patch verification on ARMv8 machines
- .deb packages
Machines
The FD.io lab is hosted at VEXXHOST colocation centre in Montreal Québec, Canada.
- CI build servers integrated into Jenkins
- CSIT test beds (under construction)
Platform | Role | Status | Hostname | IP | IPMI | Cores | RAM | Ethernet | Distro |
---|---|---|---|---|---|---|---|---|---|
SoftIron OverDrive 1000 | CI nomad server | Used in CI | softiron-1 | 10.30.51.12 | N/A | 4 | 8GB | openSUSE | |
CI nomad server | Used in CI | softiron-2 | 10.30.51.13 | N/A | 4 | 8GB | openSUSE | ||
CI nomad server | Used in CI | softiron-3 | 10.30.51.14 | N/A | 4 | 8GB | openSUSE | ||
Marvell ThunderX | Spare | Running | fdio-marvell1 | 10.30.51.38 | 10.30.50.38 | 96 | 128GB | 3x40GbE QSFP+ / 4x10GbE SFP+ | Ubuntu 18.04.2 |
Spare | Running | fdio-marvell2 | 10.30.51.39 | 10.30.50.39 | 96 | 128GB | 3x40GbE QSFP+ / 4x10GbE SFP+ | Ubuntu 18.04.2 | |
Spare | Running | fdio-marvell3 | 10.30.51.40 | 10.30.50.40 | 96 | 128GB | 3x40GbE QSFP+ / 4x10GbE SFP+ | Ubuntu 18.04.2 | |
VPP dev debug server | Running | fdio-marvell4 | 10.30.51.65 | 10.30.50.65 | 96 | 256GB | 2xQSFP+ / USB Ethernet | Ubuntu 18.04.2 | |
CI build server | Running in CI | nomad6arm | 10.30.51.66 | 10.30.50.66 | 96 | 256GB | 2xQSFP+ / USB Ethernet | Ubuntu 18.04.2 | |
CI build server | Running in CI | nomad7arm | 10.30.51.67 | 10.30.50.67 | 96 | 256GB | 2xQSFP+ / USB Ethernet | Ubuntu 18.04.2 | |
CI build server | Running in CI | nomad8arm | 10.30.51.68 | 10.30.50.68 | 96 | 256GB | 2xQSFP+ / USB Ethernet | Ubuntu 18.04.2 | |
Marvell ThunderX2 | VPP device server | Running | s27-t13-sut1 | 10.30.51.69 | 10.30.50.69 | 112 | 128GB | 3x40GbE QSFP+ XL710-QDA2 | Ubuntu 18.04.2 |
Huawei TaiShan 2280 | CSIT testbed | Running | s17-t33-sut1 | 10.30.51.36 | 10.30.50.36 | 64 | 128GB | 2x10GbE SFP+ Intel X520-DA2 / 2x25GbE SFP28 Mellanox CX-4 | 18.04.1 |
CSIT testbed | Running | s18-t33-sut2 | 10.30.51.37 | 10.30.50.37 | 64 | 128GB | 2x10GbE SFP+ Intel X520-DA2 / 2x25GbE SFP28 Mellanox CX-4 | 18.04.1 | |
Marvell MACCHIATObin | CSIT testbed | Running | s20-t34-sut1 | 10.30.51.41 | 10.30.51.49, then connect to /dev/ttyUSB0 | 4 | 16GB | 2x10GbE SFP+ | Ubuntu 16.04.4 |
CSIT testbed | Running | s21-t34-sut2 | 10.30.51.42 | 10.30.51.49, then connect to /dev/ttyUSB1 | 4 | 16GB | 2x10GbE SFP+ | Ubuntu 16.04.5 | |
VPP dev debug server | Running | fdio-mcbin3 | 10.30.51.43 | 10.30.51.49, then connect to /dev/ttyUSB2 | 4 | 16GB | 2x10GbE SFP+ | Ubuntu 16.04.5 |
Note: to get lab access, create a gpg key, upload it to keyserver, have it signed by a trusted anchor in a video call (fingerprint will be needed) and then an ARM authority (Tina) needs to send an e-mail to helpdesk@fd.io with your name, e-mail, keygrip and key fingerprint
CI
Covers automated build, unit test, and packaging for various Linux distros on ARMv8 machines.
Jenkins job | Status | Description |
---|---|---|
vpp-arm-verify-master-ubuntu1604 | Running | xxx |
vpp-arm-merge-master-ubuntu1604 | Running | xxx |
vpp-arm-verify-1804-ubuntu1604 | Running | xxx |
vpp-arm-merge-1804-ubuntu1604 | Running | xxx |
Next steps:
-
make test
added toverify
jobs - Clang build
- openSUSE Leap 15 | CentOS 7 | Ubuntu 18.04
-
vpp-csit-verify-virl-master
or equivalent CSIT functional testing
CSIT
Covers automated functional and performance integration testing on ARMv8 3-node and 2-node testbeds.
https://wiki.fd.io/view/CSIT/AArch64
Contiv-VPP
This Kubernetes network plugin uses FD.io VPP to provide network connectivity between PODs.
The installation guide of Contiv-VPP on Arm64 platform is
https://github.com/contiv/vpp/blob/master/docs/arm64/MANUAL_INSTALL_ARM64.md
Porting and Tuning Roadmap
- VPP Vectorization: Expanding the Neon Library for IPv4 forwarding code path - Sirshak/Lijian
- Tuning the quad loop/dual loop for small cores - Lijian
- General performance analysis and tuning of various graph nodes for IPv4 forwarding test case - Sirshak/Lijian
- Memory Ordering - Sirshak
- CSIT Performance Test - Khemendra
- CSIT Device Test - Juraj
- CSIT Path Test - Juraj
Known Issues
GCC 5.3 ICEs during FP register allocation. Please use GCC 5.4 or newer.
Activity
Recent Patches
Meeting Minutes
04/28/2020
- Attendees
- Govindarajan Mohandoss
- Honnappa Nagarahalli
- Juraj Linkes
- Tina Tsou
- Jieqiang Wang
- Arthur Marshall
- General
- CSIT
- VPP Performance Test
- Two failures in performance testing
- one failure is related with CSIT script, NAT44 is common issue, failing with x86 also.
- the other failure is related with VPP image on Arm, IPSec tunnel configuration issue.
- iommu_passthrough=1 does not make any differences on Taishan server - Lijian
- We cannot do kernel upgrade with Ubuntu-18.04.1/Ubuntu-18.04.2/Ubuntu-18.04.3/Ubuntu-18.04.4 on Taishan.
- For now, can the kernel of Taishan server be left as it is now, linux-4.15.0.54. - Juraj
- One possible option/improvement is to port FD.io CSIT performance testing to some more advanced Arm servers, e.g., Ampere
- Two failures in performance testing
- VPP Path
- Investigate Ubuntu-20.04 on Arm servers - Juraj & Jieqiang
- Investigate adding CentOS on Arm Jenkins jobs - Juraj & Jieqiang
- Will send email to community about two options to resolve gcc-7 issue with CentOS-7
- 1. update gcc-7 requirement to gcc-8 in Makefile
- 2. remove gcc-7 limitation in Makefile, and get user install gcc-8 manually
- Will send email to community about two options to resolve gcc-7 issue with CentOS-7
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- There's about 4-5 test failures in 'make test' when system is configured with 16K/64K page size - Lijian
- VPP Device
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Update server information to CSIT documentation. - Juraj & Jieqiang
- Prepare CSIT script patch for adding those two ThunderX2 servers. - Juraj & Jieqiang
- The servers, intel NICs, and Mellanox NICs works good so far.
- Root-causing the RDMA issue with Mellanox NIC.
- ThunderX2 servers are in Arm local lab. Dean is setting up the hardware.
- Two more ThunderX2 have just been ordered and are expected to arrive in Arm lab in April.
- We are about to purchase two official ThunderX2 servers in market.
- Raise the budget requirement from CE-OSS - Dean & Honnappa
- Check the ThunderX2 configurations required - Govind & Juraj
- Two ThunderX2 servers are installed in Arm lab.
- All Intel and Mellanox NICs have been verified on ThunderX2-02.
- Intel NIC can not be enumerated on one ThunderX2-01.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- VPP
- Enabled RDMA driver on Arm, and Mellanox DPDK PMD driver is also working on ThunderX2
- Investigate bihash operations in L2 throughput are hot-spots
- To confirm with Damjan if he has plan to rewrite l2-nodes or not - Lijian
- To confirm CRC32 calculation is compiled with O2 or O3 options - Lijian
- Benchmarking, profiling and comparation btw VPP and testpmd are done. Will review with team. - Lijian
- N1SDP enablement. - Lijian
- GCC-9.2.0 is used with "-march=armv8.2-a+crc+crypto -mtune=neoverse-n1" compiler options.
- Multi-arch, arch-specific compiling and dynamic function selection patch is ready.
- It seems dual loop-unrolling gives better performance over quad-loop-unrolling.
- iova_mode == VA not working issue is not root-caused
- DMA mapping btw iova & pa; VPP and DPDK are using va as iova, and then do the DMA mapping.
- However IOMMU on N1SDP requires a limited memory space, less than 40 bits?.
- This issue will not be seen in the latest N1 Firmware. Upgrading to latest Firmware is pending.
- Share L2/L3/ACL throughput wiz & wo L3 cache - Govind
- Will try with L3 cache enabled to see if performance drop as flow number increasing issue is fixed or not. - Govind
- The degradation is seen even when L3 cache is enabled.
- Trying to make IPsec enabled with Arm platform. - Govind
- Create Confluence page to record all the performance benchmarking data - Lijian
- Trying to reproduce the CSIT VPP device testing on local servers - Jieqiang
- After VPP device scripts can run well on local servers, Jieqiang can investigate more features, IPv4, IPv6, Tunnel, IPSec.
- Plans
- N1SDP performance investigation and improvement - Planned - Lijian
- ACL plugin investigation - Planned - Govind & Lijian
- IPsec investigation - Indicative - Govind
- Lockless data-plane investigation by Govind in backlog
- Continue investigating CSIT and CI management script, and run CSIT script on local servers - Jieqiang
- Jieqiang needs 2 Intel NICs to make the test bed ready for VPP Path tests. Jieqiang and Lijian to discuss with Juraj to run the jenkins job on CentOS.
04/21/2020
- Attendees
- Govindarajan Mohandoss
- Honnappa Nagarahalli
- Juraj Linkes
- Tina Tsou
- Jieqiang Wang
- General
- CSIT
- VPP Performance Test
- iommu_passthrough=1 does not make any differences on Taishan server - Lijian
- We cannot do kernel upgrade with Ubuntu-18.04.1/Ubuntu-18.04.2/Ubuntu-18.04.3/Ubuntu-18.04.4 on Taishan.
- For now, can the kernel of Taishan server be left as it is now. Please confirm with Peter. - Juraj
- One possible option/improvement is to port FD.io CSIT performance testing to some more advanced Arm servers, e.g., Ampere
- VPP Path
- Investigate Ubuntu-20.04 on Arm servers - Juraj & Jieqiang
- Investigate adding CentOS on Arm Jenkins jobs - Juraj & Jieqiang
- CentOS-8 is working fine. Will try CentOS-7 later.
- Is there any gcc version requirement in VPP official release?
- AES instructions in VPP source code requires gcc version newer than gcc-8.
- 'make install-deps' failure with CentOS-7 on Arm.
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- There's about 4-5 test failures in 'make test' when system is configured with 16K/64K page size - Lijian
- VPP Device
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- The servers, intel NICs, and Mellanox NICs works good so far.
- Root-causing the RDMA issue with Mellanox NIC.
- ThunderX2 servers are in Arm local lab. Dean is setting up the hardware.
- Two more ThunderX2 have just been ordered and are expected to arrive in Arm lab in April.
- We are about to purchase two official ThunderX2 servers in market.
- Raise the budget requirement from CE-OSS - Dean & Honnappa
- Check the ThunderX2 configurations required - Govind & Juraj
- Two ThunderX2 servers are installed in Arm lab.
- All Intel and Mellanox NICs have been verified on ThunderX2-02.
- Intel NIC can not be enumerated on one ThunderX2-01.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- The servers, intel NICs, and Mellanox NICs works good so far.
- VPP
- Investigate bihash operations in L2 throughput are hot-spots
- To confirm with Damjan if he has plan to rewrite l2-nodes or not - Lijian
- To confirm CRC32 calculation is compiled with O2 or O3 options - Lijian
- Benchmarking, profiling and comparation btw VPP and testpmd are done. Will review with team. - Lijian
- N1SDP enablement. - Lijian
- gcc-10 is not working so far.
- GCC-9.2.0 is used with "-march=armv8.2-a+crc+crypto -mtune=neoverse-n1" compiler options.
- Multi-arch, arch-specific compiling and dynamic function selection patch is ready.
- It seems dual loop-unrolling gives better performance over quad-loop-unrolling.
- iova_mode == VA not working issue is not root-caused
- DMA mapping btw iova & pa; VPP and DPDK are using va as iova, and then do the DMA mapping.
- However IOMMU on N1SDP requires a limited memory space, less than 40 bits?.
- This issue will not be seen in the latest N1 Firmware. Upgrading to latest Firmware is pending.
- Share L2/L3/ACL throughput wiz & wo L3 cache - Govind
- Will try with L3 cache enabled to see if performance drop as flow number increasing issue is fixed or not. - Govind
- The degradation is seen even when L3 cache is enabled.
- Trying to make IPsec enabled with Arm platform. - Govind
- Create Confluence page to record all the performance benchmarking data - Lijian
- Trying to reproduce the CSIT VPP device testing on local servers - Jieqiang
- After VPP device scripts can run well on local servers, Jieqiang can investigate more features, IPv4, IPv6, Tunnel, IPSec.
- Plans
- N1SDP performance investigation and improvement - Planned - Lijian
- ACL plugin investigation - Planned - Govind & Lijian
- IPsec investigation - Indicative - Govind
- Lockless data-plane investigation by Govind in backlog
- Continue investigating CSIT and CI management script, and run CSIT script on local servers - Jieqiang
- Jieqiang needs 2 Intel NICs to make the test bed ready for VPP Path tests. Jieqiang and Lijian to discuss with Juraj to run the jenkins job on CentOS.
- Investigate bihash operations in L2 throughput are hot-spots
04/14/2020
- Attendees
- Govindarajan Mohandoss
- Honnappa Nagarahalli
- Juraj Linkes
- Tina Tsou
- Jieqiang Wang
- General
- CSIT
- VPP Performance Test
- Upgrading kernel version to 72 cannot boot up normally, so have reverted back to previous version.
- Ubuntu-18.04 lts version is supposed to be kernel 4.15.72?
- Will try fresh install with local Taishan servers.
- Will try with Ubuntu-18.04.1/Ubuntu-18.04.2/Ubuntu-18.04.3/Ubuntu-18.04.4
- Will do fresh installation with Ubuntu-18.04.2 and then install kernel 4.15.72
- Upgrading kernel version to 72 cannot boot up normally, so have reverted back to previous version.
- VPP Path
- Try iommu_passthrough=1 in Taishan servers and see if it makes any differences - Lijian
- Investigate Ubuntu-20.04 on Arm servers - Juraj & Jieqiang
- Investigate adding CentOS on Arm Jenkins jobs - Juraj & Jieqiang
- CentOS-8 is working fine. Will try CentOS-7 later.
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- There's about 4-5 test failures in 'make test' when system is configured with 16K/64K page size - Lijian
- VPP Device
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- ThunderX2 servers are in Arm local lab. Dean is setting up the hardware.
- Two more ThunderX2 have just been ordered and are expected to arrive in Arm lab in April.
- We are about to purchase two official ThunderX2 servers in market.
- Raise the budget requirement from CE-OSS - Dean & Honnappa
- Check the ThunderX2 configurations required - Govind & Juraj
- Two ThunderX2 servers are installed in Arm lab.
- All Intel and Mellanox NICs have been verified on ThunderX2-02.
- Intel NIC can not be enumerated on one ThunderX2-01.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- VPP
- Investigate bihash operations in L2 throughput are hot-spots
- To confirm with Damjan if he has plan to rewrite l2-nodes or not - Lijian
- To confirm CRC32 calculation is compiled with O2 or O3 options - Lijian
- Benchmarking, profiling and comparation btw VPP and testpmd are done. Will review with team. - Lijian
- N1SDP enablement. - Lijian
- GCC-9.2.0 is used with "-march=armv8.2-a+crc+crypto -mtune=neoverse-n1" compiler options.
- Multi-arch, arch-specific compiling and dynamic function selection patch is ready.
- It seems dual loop-unrolling gives better performance over quad-loop-unrolling.
- iova_mode == VA not working issue is not root-caused
- DMA mapping btw iova & pa; VPP and DPDK are using va as iova, and then do the DMA mapping.
- However IOMMU on N1SDP requires a limited memory space, less than 40 bits?.
- This issue will not be seen in the latest N1 Firmware. Upgrading to latest Firmware is pending.
- Share L2/L3/ACL throughput wiz & wo L3 cache - Govind
- Will try with L3 cache enabled to see if performance drop as flow number increasing issue is fixed or not. - Govind
- The degradation is seen even when L3 cache is enabled.
- Trying to make IPsec enabled with Arm platform. - Govind
- Create Confluence page to record all the performance benchmarking data - Lijian
- Trying to reproduce the CSIT VPP device testing on local servers - Jieqiang
- After VPP device scripts can run well on local servers, Jieqiang can investigate more features, IPv4, IPv6, Tunnel, IPSec.
- Plans
- N1SDP performance investigation and improvement - Planned - Lijian
- ACL plugin investigation - Planned - Govind & Lijian
- IPsec investigation - Indicative - Govind
- Lockless data-plane investigation by Govind in backlog
- Continue investigating CSIT and CI management script, and run CSIT script on local servers - Jieqiang
- Jieqiang needs 2 Intel NICs to make the test bed ready for VPP Path tests. Jieqiang and Lijian to discuss with Juraj to run the jenkins job on CentOS.
- Investigate bihash operations in L2 throughput are hot-spots
04/07/2020
- Attendees
- Govindarajan Mohandoss
- Honnappa Nagarahalli
- Juraj Linkes
- Tina Tsou
- Jieqiang Wang
- General
- CSIT
- VPP Performance Test
- Upgrading kernel version to 72 cannot boot up normally, so have reverted back to previous version.
- Ubuntu-18.04 lts version is supposed to be kernel 4.15.72?
- Will try cobbler with local Taishan servers, to try fresh install.
- Jieqiang will try fresh installation of kernel 4.15.72 in local Taishan through cobbler.
- Upgrading kernel version to 72 cannot boot up normally, so have reverted back to previous version.
- VPP Path
- Investigate Ubuntu-20.04 on Arm servers - Juraj & Jieqiang
- Investigate adding CentOS on Arm to Jenkins jobs - Juraj & Jieqiang
- Jieqiang updated docker file locally to add centOS as part of CI and facing some issues.
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- There's about 4-5 test failures in 'make test' when system is configured with 16K/64K page size - Lijian
- VPP Device
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- Need 2 Thunderx2 servers to run the jobs for every VPP/CSIT patch submission instead of every half hour with a new VPP build. The current
- ThunderX2 server doesn't respond when the jobs are requested to run for every patch submission. No voting rights (+1 from CI) for VPP device
- suite.
- VPP Performance Test
- FD.io lab
- Two more ThunderX2 have just been ordered and are expected to arrive in Arm lab in April.
- We are about to purchase two official ThunderX2 servers in market.
- Raise the budget requirement from CE-OSS - Dean & Honnappa
- Check the ThunderX2 configurations required - Govind & Juraj
- Two ThunderX2 servers are installed in Arm lab.
- All Intel and Mellanox NICs have been verified on ThunderX2-02.
- Intel NIC can not be enumerated on one ThunderX2-01.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- Two more ThunderX2 have just been ordered and are expected to arrive in Arm lab in April.
- VPP
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- These patches are kept in backlog for now.
- Investigate bihash operations in L2 throughput are hot-spots
- To confirm with Damjan if he has plan to rewrite l2-nodes or not - Lijian
- To confirm CRC32 calculation is compiled with O2 or O3 options - Lijian
- Benchmarking, profiling and comparation btw VPP and testpmd are done. Will review with team. - Lijian
- N1SDP enablement. - Lijian
- GCC-9.2.0 is used with "-march=armv8.2-a+crc+crypto -mtune=neoverse-n1" compiler options.
- Multi-arch, arch-specific compiling and dynamic function selection patch is ready.
- It seems dual loop-unrolling gives better performance over quad-loop-unrolling.
- iova_mode == VA not working issue is not root-caused
- DMA mapping btw iova & pa; VPP and DPDK are using va as iova, and then do the DMA mapping.
- However IOMMU on N1SDP requires a limited memory space, less than 40 bits?.
- This issue will not be seen in the latest N1 Firmware. Upgrading to latest Firmware is pending.
- Share L2/L3/ACL throughput wiz & wo L3 cache - Govind
- Will try with L3 cache enabled to see if performance drop as flow number increasing issue is fixed or not. - Govind
- The degradation is seen even when L3 cache is enabled.
- Trying to make IPsec enabled with Arm platform. - Govind
- Create Confluence page to record all the performance benchmarking data - Lijian
- Plans
- N1SDP performance investigation and improvement - Planned - Lijian
- ACL plugin investigation - Planned - Govind & Lijian
- IPsec investigation - Indicative - Govind
- Lockless data-plane investigation by Govind in backlog
- Continue investigating CSIT and CI management script, and run CSIT script on local servers - Jieqiang
- Jieqiang needs 2 Intel NICs to make the test bed ready for VPP Path tests. Jieqiang and Lijian to discuss with Juraj to run the jenkins job on CentOS.
- Vectorization
03/31/2020
- Attendees
- Govindarajan Mohandoss
- Honnappa Nagarahalli
- Lijian Zhang
- Juraj Linkes
- Tina Tsou
- Jieqiang Wang
- Michaela Tahiri
- General
- CSIT
- VPP Performance Test
- Upgrading kernel version to 72 cannot boot up normally, so have reverted back to previous version.
- Ubuntu-18.04 lts version is supposed to be kernel 4.15.72?
- Will try cobbler with local Taishan servers, to try fresh install.
- Please double confirm if there's any failures with weekly test - Juraj
- https://docs.fd.io/csit/rls1908/report/
- https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending wiki: https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- https://docs.fd.io/csit/master/trending/introduction/failures.html#n-tsh
- https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh/
- https://logs.fd.io/production/vex-yul-rot-jenkins-1/csit-vpp-perf-mrr-daily-master-3n-tsh/161/archives/log.html.gz
- Upgrading kernel version to 72 cannot boot up normally, so have reverted back to previous version.
- VPP Path
- Investigate Ubuntu-20.04 on Arm servers - Juraj & Jieqiang
- Investigate adding CentOS on Arm to Jenkins jobs - Juraj & Jieqiang
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- There's about 4-5 test failures in 'make test' when system is configured with 16K/64K page size - Lijian
- VPP Device
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Two more ThunderX2 have just been ordered and are expected to arrive in Arm lab in Apil.
- We are about to purchase two official ThunderX2 servers in market.
- Raise the budget requirement from CE-OSS - Dean & Honnappa
- Check the ThunderX2 configurations required - Govind & Juraj
- Two ThunderX2 servers are installed in Arm lab.
- All Intel and Mellanox NICs have been verified on ThunderX2-02.
- Intel NIC can not be enumerated on one ThunderX2-01.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- Two more ThunderX2 have just been ordered and are expected to arrive in Arm lab in Apil.
- VPP
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- Investigate bihash operations in L2 throughput are hot-spots
- To confirm with Damjan if he has plan to rewrite l2-nodes or not - Lijian
- To confirm CRC32 calculation is compiled with O2 or O3 options - Lijian
- Benchmarking, profiling and comparation btw VPP and testpmd are done. Will review with team. - Lijian
- N1SDP enablement. - Lijian
- GCC-9.2.0 is used with "-march=armv8.2-a+crc+crypto -mtune=neoverse-n1" compiler options.
- Multi-arch, arch-specific compiling and dynamic function selection patch is ready.
- It seems dual loop-unrolling gives better performance over quad-loop-unrolling.
- iova_mode == VA not working issue is not root-caused
- DMA mapping btw iova & pa; VPP and DPDK are using va as iova, and then do the DMA mapping.
- However IOMMU on N1SDP requires a limited memory space, less than 40 bits?.
- Share L2/L3/ACL throughput wiz & wo L3 cache - Govind
- Will try with L3 cache enabled to see if performance drop as flow number increasing issue is fixed or not. - Govind
- Trying to make IPsec enabled with Arm platform. - Govind
- Create Confluence page to record all the performance benchmarking data - Lijian
- Plans
- N1SDP performance investigation and improvement - Planned - Lijian
- ACL plugin investigation - Planned - Govind & Lijian
- IPsec investigation - Indicative - Govind
- Lockless data-plane investigation by Govind in backlog
- Continue investigating CSIT and CI management script, and run CSIT script on local servers - Jieqiang
- Vectorization
03/24/2020
- Attendees
- Govindarajan Mohandoss
- Honnappa Nagarahalli
- Lijian Zhang
- Juraj Linkes
- Tina Tsou
- Jieqiang Wang
- Michaela Tahiri
- General
- CSIT
- VPP Performance Test
- Upgrading kernel version to 72 cannot boot up normally, so have reverted back to previous version.
- Please double confirm if there's any failures with weekly test - Juraj
- https://docs.fd.io/csit/rls1908/report/
- https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending wiki: https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- VPP Path
- Investigate Ubuntu-20.04 on Arm servers - Juraj & Jieqiang
- Investigate adding CentOS on Arm to Jenkins jobs - Juraj & Jieqiang
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- There's about 4-5 test failures in 'make test' when system is configured with 16K/64K page size - Lijian
- We are required to provide justification and use case for cross-compilation for VPP on Arm - Juraj
- Govind will lead the cross compilation justification discussion internally and externally with NXP.
- make build/build-release TARGET_PLATFORM=n1sdp // for n1sdp cross compiling
- make build/build-release // for generic vpp image
- make build/build-release TARGET_PLATFORM=native // for native vpp image
- Govind will lead the cross compilation justification discussion internally and externally with NXP.
- VPP Device
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Two more ThunderX2 have just been ordered and are expected to arrive in Arm lab in Apil.
- We are about to purchase two official ThunderX2 servers in market.
- Raise the budget requirement from CE-OSS - Dean & Honnappa
- Check the ThunderX2 configurations required - Govind & Juraj
- Two ThunderX2 servers are installed in Arm lab.
- All Intel and Mellanox NICs have been verified on ThunderX2-02.
- Intel NIC can not be enumerated on one ThunderX2-01.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- Two more ThunderX2 have just been ordered and are expected to arrive in Arm lab in Apil.
- VPP
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- Investigate bihash operations in L2 throughput are hot-spots
- To confirm with Damjan if he has plan to rewrite l2-nodes or not - Lijian
- To confirm CRC32 calculation is compiled with O2 or O3 options - Lijian
- Benchmarking, profiling and comparation btw VPP and testpmd are done. Will review with team. - Lijian
- N1SDP enablement. - Lijian
- GCC-9.2.0 is used with "-march=armv8.2-a+crc+crypto -mtune=neoverse-n1" compiler options.
- Multi-arch, arch-specific compiling and dynamic function selection patch is ready.
- It seems dual loop-unrolling gives better performance over quad-loop-unrolling.
- iova_mode == VA not working issue is not root-caused
- DMA mapping btw iova & pa; VPP and DPDK are using va as iova, and then do the DMA mapping.
- However IOMMU on N1SDP requires a limited memory space, less than 40 bits?.
- Share L2/L3/ACL throughput wiz & wo L3 cache - Govind
- Will try with L3 cache enabled to see if performance drop as flow number increasing issue is fixed or not. - Govind
- Create Confluence page to record all the performance benchmarking data - Lijian
- Plans
- N1SDP performance investigation and improvement - Planned - Lijian
- ACL plugin investigation - Planned - Govind & Lijian
- IPsec investigation - Indicative - Govind
- Lockless data-plane investigation by Govind in backlog
- Continue investigating CSIT and CI management script, and run CSIT script on local servers - Jieqiang
- Vectorization
03/17/2020
- Attendees
- Govindarajan Mohandoss
- Honnappa Nagarahalli
- Lijian Zhang
- Juraj Linkes
- Tina Tsou
- Jieqiang Wang
- Michaela Tahiri
- General
- CSIT
- VPP Performance Test
- Upgrading kernel version to 72 cannot boot up normally, so have reverted back to previous version.
- Please double confirm if there's any failures with weekly test - Juraj
- https://docs.fd.io/csit/rls1908/report/
- https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending wiki: https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- VPP Path
- Investigate Ubuntu-20.04 on Arm servers - Juraj & Jieqiang
- Investigate adding CentOS on Arm to Jenkins jobs - Juraj & Jieqiang
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- There's about 4-5 test failures in 'make test' when system is configured with 16K/64K page size - Lijian
- We are required to provide justification and use case for cross-compilation for VPP on Arm - Juraj
- Govind will lead the cross compilation justification discussion internally and externally with NXP.
- VPP Device
- Issue of huge pages running out has been resolved by resetting the servers.
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Two more ThunderX2 have just been ordered and are expected to arrive in Arm lab in Apil.
- We are about to purchase two official ThunderX2 servers in market.
- Raise the budget requirement from CE-OSS - Dean & Honnappa
- Check the ThunderX2 configurations required - Govind & Juraj
- Two ThunderX2 servers are installed in Arm lab.
- All Intel and Mellanox NICs have been verified on ThunderX2-02.
- Intel NIC can not be enumerated on one ThunderX2-01.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- Two more ThunderX2 have just been ordered and are expected to arrive in Arm lab in Apil.
- VPP
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - Usage of MAP is recorded in confluence
- https://confluence.arm.com/display/BSGSoftware/An+introduction+to+using+MAP+with+VPP
- Patch is upstreamed for community review
- VPP+DPDK (5.18Mpps/5.21Mpps) vs VPP+AVF (8.39Mpps/8.38Mpps) on ThunderX2.
- AVF failed to create AVF interface on SMP CPU on N1SDP/Qualcomm - Jieqiang
- Confirm if community agrees with patch - Lijian
- Check how DPDK is detecting numa-id for a specific NIC device - Lijian
- Investigate bihash operations in L2 throughput are hot-spots
- To confirm with Damjan if he has plan to rewrite l2-nodes or not - Lijian
- To confirm CRC32 calculation is compiled with O2 or O3 options - Lijian
- Benchmarking, profiling and comparation btw VPP and testpmd are done. Will review with team. - Lijian
- N1 SDP enablement. - Lijian
- GCC-9.2.0 is used with "-march=armv8.2-a+crc+crypto -mtune=neoverse-n1" compiler options.
- Sending Govind the steps on installing GCC-9.2.0
- Multi-arch, arch-specific compiling and dynamic function selection patch is ready.
- It seems dual loop-unrolling gives better performance over quad-loop-unrolling.
- Will sync up with James Yang about cache line fill buffers
- GCC-9.2.0 is used with "-march=armv8.2-a+crc+crypto -mtune=neoverse-n1" compiler options.
- Investigating mtrie data structure on control-plane. - Govind
- 3% macro-benchmarking by adding prefetches to adj table on ThunderX2
- iova_mode == VA not working issue is not root-caused
- DMA mapping btw iova & pa; VPP and DPDK are using va as iova, and then do the DMA mapping.
- However IOMMU on N1SDP requires a limited memory space, less than 40 bits?.
- Share L2/L3/ACL throughput wiz & wo L3 cache - Govind
- Create Confluence page to record all the performance benchmarking data - Lijian
- Plans
- N1SDP performance investigation and improvement - Planned - Lijian
- ACL plugin investigation - Planned - Govind & Lijian
- IPsec investigation - Indicative - Govind
- Lockless data-plane investigation by Govind in backlog
- Continue investigating CSIT and CI management script, and run CSIT script on local servers - Jieqiang
- Vectorization
03/10/2020
- Attendees
- Govindarajan Mohandoss
- Honnappa Nagarahalli
- Lijian Zhang
- Juraj Linkes
- Tina Tsou
- Jieqiang Wang
- Michaela Tahiri
- General
- CSIT
- VPP Performance Test
- Upgrading kernel version to 72 cannot boot up normally, so have reverted back to previous version.
- Please double confirm if there's any failures with weekly test - Juraj
- https://docs.fd.io/csit/rls1908/report/
- https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending wiki: https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- VPP Path
- Investigate Ubuntu-20.04 on Arm servers - Juraj & Jieqiang
- Investigate adding CentOS on Arm to Jenkins jobs - Juraj & Jieqiang
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- There's about 4-5 test failures in 'make test' when system is configured with 16K/64K page size - Lijian
- We are required to provide justification and use case for cross-compilation for VPP on Arm - Juraj
- Govind will lead the cross compilation justification discussion internally and externally with NXP.
- VPP Device
- Issue of huge pages running out has been resolved by resetting the servers.
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Two more ThunderX2 have just been ordered and are expected to arrive in Arm lab in Apil.
- We are about to purchase two official ThunderX2 servers in market.
- Raise the budget requirement from CE-OSS - Dean & Honnappa
- Check the ThunderX2 configurations required - Govind & Juraj
- Two ThunderX2 servers are installed in Arm lab.
- All Intel and Mellanox NICs have been verified on ThunderX2-02.
- Intel NIC can not be enumerated on one ThunderX2-01.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- Two more ThunderX2 have just been ordered and are expected to arrive in Arm lab in Apil.
- VPP
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - Usage of MAP is recorded in confluence
- https://confluence.arm.com/display/BSGSoftware/An+introduction+to+using+MAP+with+VPP
- Check if detecting the source of SIGPROF is possible - Govind
- Confirm with Community about the possible solutions to this issue - Lijian
- Benchmarking AVF drivers on Arm servers - Jieqiang
- VPP+DPDK (5.18Mpps/5.21Mpps) vs VPP+AVF (8.39Mpps/8.38Mpps) on ThunderX2.
- AVF failed to create AVF interface on SMP CPU on N1SDP/Qualcomm - Jieqiang
- Confirm if community agrees with patch - Lijian
- Check how DPDK is detecting numa-id for a specific NIC device - Lijian
- Investigate bihash operations in L2 throughput are hot-spots
- To confirm with Damjan if he has plan to rewrite l2-nodes or not - Lijian
- To confirm CRC32 calculation is compiled with O2 or O3 options - Lijian
- Benchmarking, profiling and comparation btw VPP and testpmd are done. Will review with team. - Lijian
- Fix Makefile issue recently introduced on Arm machine - Jieqiang
- Patch is upstreamed for code review - https://gerrit.fd.io/r/c/vpp/+/25259
- N1 SDP enablement. - Lijian
- GCC-9.2.0 is used with "-march=armv8.2-a+crc+crypto -mtune=neoverse-n1" compiler options.
- Sending Govind the steps on installing GCC-9.2.0
- Multi-arch, arch-specific compiling and dynamic function selection patch is ready.
- It seems dual loop-unrolling gives better performance over quad-loop-unrolling.
- Will sync up with James Yang about cache line fill buffers
- GCC-9.2.0 is used with "-march=armv8.2-a+crc+crypto -mtune=neoverse-n1" compiler options.
- Investigating mtrie data structure on control-plane. - Govind
- 3% macro-benchmarking by adding prefetches to adj table on ThunderX2
- iova_mode == VA not working issue is not root-caused
- DMA mapping btw iova & pa; VPP and DPDK are using va as iova, and then do the DMA mapping.
- However IOMMU on N1SDP requires a limited memory space, less than 40 bits?.
- Share L2/L3/ACL throughput wiz & wo L3 cache - Govind
- Create Confluence page to record all the performance benchmarking data - Lijian
- Vectorization
03/03/2020
- Attendees
- Govindarajan Mohandoss
- Honnappa Nagarahalli
- Lijian Zhang
- Juraj Linkes
- Tina Tsou
- Jieqiang Wang
- Michaela Tahiri
- General
- CSIT
- VPP Performance Test
- Please double confirm if there's any failures with weekly test - Juraj
- https://docs.fd.io/csit/rls1908/report/
- https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending wiki: https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- VPP Path
- Investigate Ubuntu-20.04 on Arm servers - Juraj & Jieqiang
- Investigate adding CentOS on Arm to Jenkins jobs - Juraj & Jieqiang
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- There's about 4-5 test failures in 'make test' when system is configured with 16K/64K page size - Lijian
- We are required to provide justification and use case for cross-compilation for VPP on Arm - Juraj
- Govind will lead the cross compilation justification discussion internally and externally with NXP.
- VPP Device
- Issue of huge pages running out has been resolved by resetting the servers.
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- The current ThunderX2 in Arm lab are pre-production servers.
- We are about to purchase two official ThunderX2 servers in market.
- Raise the budget requirement from CE-OSS - Dean & Honnappa
- Check the ThunderX2 configurations required - Govind & Juraj
- Two ThunderX2 servers are installed in Arm lab.
- All Intel and Mellanox NICs have been verified on ThunderX2-02.
- Intel NIC can not be enumerated on one ThunderX2-01.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- The current ThunderX2 in Arm lab are pre-production servers.
- VPP
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - Usage of MAP is recorded in confluence
- Benchmarking AVF drivers on Arm servers - Jieqiang
- VPP+DPDK (5.18Mpps/5.21Mpps) vs VPP+AVF (8.39Mpps/8.38Mpps) on ThunderX2.
- AVF failed to create AVF interface on SMP CPU on N1SDP/Qualcomm - Jieqiang
- Investigate bihash operations in L2 throughput are hot-spots
- To confirm with Damjan if he has plan to rewrite l2-nodes or not - Lijian
- To confirm CRC32 calculation is compiled with O2 or O3 options - Lijian
- Benchmarking, profiling and comparation btw VPP and testpmd are done. Will review with team. - Lijian
- Fix Makefile issue recently introduced on Arm machine - Jieqiang
- Patch is upstreamed for code review - https://gerrit.fd.io/r/c/vpp/+/25259
- Investigating memory copy in ip4-rewrite on ThunderX2 - Govind
- Check the assembly code with other Arm CPU also.
- Send Govind the memory copy with fixed length.
- N1 SDP enablement. - Lijian
- GCC-9.2.0 is used with "-march=armv8.2-a+crc+crypto -mtune=neoverse-n1" compiler options.
- Sending Govind the steps on installing GCC-9.2.0
- Multi-arch, arch-specific compiling and dynamic function selection patch is ready.
- It seems dual loop-unrolling gives better performance over quad-loop-unrolling.
- Will sync up with James Yang about cache line fill buffers
- GCC-9.2.0 is used with "-march=armv8.2-a+crc+crypto -mtune=neoverse-n1" compiler options.
- Investigating mtrie data structure on control-plane. - Govind
- 3% macro-benchmarking by adding prefetches to adj table on ThunderX2
- iova_mode == VA not working issue is not root-caused
- DMA mapping btw iova & pa; VPP and DPDK are using va as iova, and then do the DMA mapping.
- However IOMMU on N1SDP requires a limited memory space, less than 40 bits?.
- Share L2/L3/ACL throughput wiz & wo L3 cache - Govind
- Create Confluence page to record all the performance benchmarking data - Lijian
- Vectorization
02/25/2020
- Attendees
- Govindarajan Mohandoss
- Honnappa Nagarahalli
- Lijian Zhang
- Juraj Linkes
- Tina Tsou
- Jieqiang Wang
- Michaela Tahiri
- General
- CSIT
- VPP Performance Test
- Govind will talk with George Zhao for Taishan fw version supporting Meltdown issue.
- Huawei is investigating which fw version of Taishan server supporting Meltdown issue. Will update with us soon.
- Please double confirm if there's any failures with weekly test - Juraj
- https://docs.fd.io/csit/rls1908/report/
- https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending wiki: https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- VPP Path
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- There's about 4-5 test failures in 'make test' when system is configured with 16K/64K page size - Lijian
- We are required to provide justification and use case for cross-compilation for VPP on Arm - Juraj
- Govind will lead the cross compilation justification discussion internally and externally with NXP.
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- VPP Device
- Issue of huge pages running out has been resolved by resetting the servers.
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- The current ThunderX2 in Arm lab are pre-production servers.
- We are about to purchase two official ThunderX2 servers in market.
- Raise the budget requirement from CE-OSS - Dean & Honnappa
- Check the ThunderX2 configurations required - Govind & Juraj
- Two ThunderX2 servers are installed in Arm lab.
- All Intel and Mellanox NICs have been verified on ThunderX2-02.
- Intel NIC can not be enumerated on one ThunderX2-01.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- The current ThunderX2 in Arm lab are pre-production servers.
- VPP
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - Usage of MAP is recorded in confluence
- Benchmarking AVF drivers on Arm servers - Jieqiang
- VPP+DPDK (5.18Mpps/5.21Mpps) vs VPP+AVF (8.39Mpps/8.38Mpps) on ThunderX2.
- AVF failed to create AVF interface on SMP CPU on N1SDP/Qualcomm - Jieqiang
- Investigate bihash operations in L2 throughput are hot-spots
- To confirm with Damjan if he has plan to rewrite l2-nodes or not - Lijian
- To confirm CRC32 calculation is compiled with O2 or O3 options - Lijian
- Benchmarking, profiling and comparation btw VPP and testpmd are done. Will review with team. - Lijian
- Fix Makefile issue recently introduced on Arm machine - Jieqiang
- Patch is upstreamed for code review - https://gerrit.fd.io/r/c/vpp/+/25259
- Investigating memory copy in ip4-rewrite on ThunderX2 - Govind
- Check the assembly code with other Arm CPU also.
- Send Govind the memory copy with fixed length.
- N1 SDP enablement. - Lijian
- GCC-9.2.0 is used with "-march=armv8.2-a+crc+crypto -mtune=neoverse-n1" compiler options.
- Sending Govind the steps on installing GCC-9.2.0
- Multi-arch, arch-specific compiling and dynamic function selection patch is ready.
- It seems dual loop-unrolling gives better performance over quad-loop-unrolling.
- Will sync up with James Yang about cache line fill buffers
- GCC-9.2.0 is used with "-march=armv8.2-a+crc+crypto -mtune=neoverse-n1" compiler options.
- Investigating mtrie data structure on control-plane. - Govind
- 3% macro-benchmarking by adding prefetches to adj table on ThunderX2
- Vectorization
02/18/2020
- Attendees
- Govindarajan Mohandoss
- Honnappa Nagarahalli
- Lijian Zhang
- Juraj Linkes
- Tina Tsou
- General
- CSIT
- VPP Performance Test
- VM-VHost test failing on 3n-tsh server.
- Issue with testpmd failure in VM has been resolved and merged.
- Govind will talk with Geoge for Taishan fw version supporting Meltdown issue.
- Please double confirm if there's any failures with weekly test - Juraj
- https://docs.fd.io/csit/rls1908/report/
- https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- VM-VHost test failing on 3n-tsh server.
- VPP Path
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- There's about 4-5 test failures in 'make test' when system is configured with 16K/64K page size - Lijian
- Will discuss about the cross compilation with qemu emulation solution in the monthly VPP call tomorrow - Juraj
- Govind will lead the cross compilation justification discussion internally and with NXP.
- VPP crash issue on Taishan server is resolved and patch is resolved.
- ThunderX2 has the same issue and has been resolved also.
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- VPP Device
- Issue of huge pages running out has been resolved by resetting the servers.
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Two ThunderX2 servers are installed in Arm lab.
- All Intel and Mellanox NICs have been verified on ThunderX2-02.
- Intel NIC can not be enumerated on one ThunderX2-01.
- Marvel engineers claims ThunderX2 does not support i40e intel NIC, which seems not correct.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- Two ThunderX2 servers are installed in Arm lab.
- VPP
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Usage of MAP is recorded in confluence
- Will update the patch to ignore sigprof singal - Jieqiang
- Patch is updated by adding more comments. - Jieqiang
- Benchmarking AVF drivers on Arm servers - Jieqiang
- VPP+DPDK (5.18Mpps/5.21Mpps) vs VPP+AVF (8.39Mpps/8.38Mpps) on ThunderX2.
- AVF failed to create AVF interface on SMP CPU on N1SDP/Qualcomm - Jieqiang
- Investigate bihash operations in L2 throughput are hot-spots
- To confirm with Damjan if he has plan to rewrite l2-nodes or not - Lijian
- To confirm CRC32 calculation is compiled with O2 or O3 options - Lijian
- Benchmarking, profiling and comparation btw VPP and testpmd are done. Will review with team. - Lijian
- Fix Makefile issue recently introduced on Arm machine - Jieqiang
- Patch is ready for code review.
- Investigating memory in ip4-rewrite on ThunderX2 - Govind
- Check the assembly with other Arm CPU also.
- N1 SDP enablement. - Lijian
- GCC-9.2.0 is used with "-march=armv8.2-a+crc+crypto -mtune=neoverse-n1" compiler options.
- Multi-arch, arch-specific compiling and dynamic function selection patch is ready.
- It seems dual loop-unrolling gives better performance over quad-loop-unrolling.
- Investigating mtrie data structure on control-plane. - Govind
- 3% macro-benchmarking by adding prefetches to adj table on ThunderX2
- Vectorization
02/11/2020
- Attendees
- Govindarajan Mohandoss
- Honnappa Nagarahalli
- Lijian Zhang
- Juraj Linkes
- Tina Tsou
- General
- CSIT
- VPP Performance Test
- VM-VHost test failing on 3n-tsh server.
- Tina to confirm which BIOS version on Taishan server support Meldown.
- NICs cannot be bound to VFIO_PCI driver in VM which caused the failure.
- Will try iommu-passthrough=0/1 - Juraj
- Will confirm with Joyce about this issue - Lijian
- Please double confirm if there's any failures with weekly test - Juraj
- https://docs.fd.io/csit/rls1908/report/
- https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- VPP Path
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- There's about 4-5 test failures in 'make test' when system is configured with 16K/64K page size - Lijian
- Will discuss about the cross compilation with qemu emulation solution in the monthly VPP call tomorrow - Juraj
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- VPP Device
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Two ThunderX2 servers are installed in Arm lab.
- All Intel and Mellanox NICs have been verified on ThunderX2-02.
- Intel NIC can not be enumerated on one ThunderX2-01.
- Marvel engineers claims ThunderX2 does not support i40e intel NIC, which seems not correct.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- Two ThunderX2 servers are installed in Arm lab.
- VPP
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Usage of MAP is recorded in confluence
- Will update the patch to ignore sigprof singal - Jieqiang
- Benchmarking AVF drivers on Arm servers - Jieqiang
- VPP+DPDK (5.18Mpps/5.21Mpps) vs VPP+AVF (8.39Mpps/8.38Mpps) on ThunderX2.
- AVF failed to create AVF interface on SMP CPU on N1SDP/Qualcomm - Jieqiang
- Investigate bihash operations in L2 throughput are hot-spots
- To confirm with Damjan if he has plan to rewrite l2-nodes or not - Lijian
- To confirm CRC32 calculation is compiled with O2 or O3 options - Lijian
- Benchmarking, profiling and comparation btw VPP and testpmd are done. Will review with team. - Lijian
- Fix Makefile issue recently introduced on Arm machine - Jieqiang
- Investigating memory in ip4-rewrite on ThunderX2 - Govind
- Check the assembly with other Arm CPU also.
- Vectorization
02/04/2020
- Attendees
- Govindarajan Mohandoss
- Honnappa Nagarahalli
- Lijian Zhang
- Juraj Linkes
- Tina Tsou
- General
- CSIT
- VPP Performance Test
- VM-VHost test failing on 3n-tsh server.
- Govind to send background details about Taishan kernel upgrade to Tina to confirm with George Zhao.
- The VM-VHost test cases have never passed before as per the previous logs in Taishan server.
- Issue is not reproducible locally - VHost/Virtual Ethernet interface creation passes in Taishan server in local setup.
- Next Steps: Follow up with Peter Mikus to debug the issue in Taishan server in CSIT lab.
- Build a local test setup to run the Testpmd application in VM.
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- 'show pci' command will cause crash issue, which affects all performance tests on Taishan server only.
- Issue is root-caused and the patch is in community review - https://gerrit.fd.io/r/c/vpp/+/23849
- 'show pci' is replaced with 'show ver' temporarily. Now performance test is running fine.
- Will change performance job to be running daily from current weekly running. - Juraj
- Patch to resolve "show pci" crash issue is merged. Will ask CSIT team to remove the workaround. - Lijian
- Have upgraded Python2 to Python3 successfully.
- Ask CSIT community how to identify performance hold/stop issue asap - Juraj
- Will send email to CSIT-dev on how to avoid the similar case/issue.
- VPP Path
- Verifying VPP on Centos/Arm - Juraj
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- Will investigate 4-5 test failures in 'make test' - Lijian
- Looks like it would be only build and unit test. VPP device and performance tests would run on the physical devices in the lab.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilation
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- Loop Nitin, Sachin, Honnappa, Lijian in container cross-compilation discussion.
- Verifying VPP on Centos/Arm - Juraj
- VPP Device
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Two ThunderX2 servers are installed in Arm lab. But intel NIC can not be enumerated on one ThunderX2-01.
- Cables for intel NICs have been ordered.
- Universal rails will be tried with ThunderX2 servers. If it works, will send the rails to FD.io lab.
- Script/commands to verify the NICs are ready. Will try with Mellanox NIC on ThunderX2-02 firstly - Lijian
- Confirm the power cable requirements for the Vexxhost lab Inform about the 2 servers coming to the lab - Juraj
- Current Configurations:
- RAM: 256G
- Disk: 480G SSD
- The boxes are coming with Qlogic cards which are not supported in VPP.
- Changes required to the servers:
- The power cable specifications will be 4 x 6ft 14 AWG C13 to C14 Power Cables.
- Need 2 Intel NICs XL710-QDA2 for each server.
- If there is space for 2 more cards, we should add 2 Mellanox cards SR-IOV capability.
- Disk size to 480G
- Both the servers should have the NICs in the same PCIe slots. It does not matter which PCIe slots the cards are in.
- Cables: N1, P1 to N2, P1 and so on
- Cables for IPMI and Management port: 2
- Will call a meeting once the ThunderX2 arrives to Dean, inviting Juraj/Vxxhost people in FD.io lab to make sure hardware ready.
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- OS: Ubuntu 18.04
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- Two ThunderX2 servers are installed in Arm lab. But intel NIC can not be enumerated on one ThunderX2-01.
- VPP
- Align Arm patches with VPP release plan.
- F0 2020-01-08 APIs frozen. Only low-risk changes accepted on main branch.
- RC1 2020-01-15 (F0+7) Code complete, pull first release throttle branch, only bug fixes accepted on throttle train. After pull: main branch reopens for new feature / risky commits. First artifacts posted.
- RC2 2020-01-22 (RC1+7) Second artifacts posted.
- Formal Release 2020-01-29 (RC2+7) 20.01 release artifacts available
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Usage of MAP is recorded in confluence
- Will update the patch to ignore sigprof singal - Jieqiang
- Benchmarking AVF drivers on Arm servers - Jieqiang
- VPP+DPDK (5.18Mpps/5.21Mpps) VS VPP+AVF (8.39Mpps/8.38Mpps) on ThunderX2.
- Check if performance tests includes AVF driver or not?
- AVF failed to create AVF interface on SMP CPU on N1SDP/Qualcomm - Jieqiang
- Current N1SDP does not support SRIOV, so cannot run AVF on N1SDP.
- Will try one patch to enable N1SDP board.
- Please try AVF with Mcbin if possible.
- Investigate bihash operations in L2 throughput are hot-spots
- Apply prefetches and dual loop with l2_fwd node, and failed to l2_learn
- Lock-free allocation/free give 7%-11% improvement on ThunderX2, but no improvement on x86 and CortexA72.
- Cache misses and CRC32 calculation are possible opportunities.
- To check cycles by applying CRC32 calculation unrolling
- Bench-mark VPP on Dawn N1SDP board
- Done finishing single flow with L2/L3/input-ACL on N1SDP board, will share the data.
- Investigating bi-hash lockless implementation - Jason
- Firstly apply make_working_copy for all bihash update operations, and then apply RCU to make look-up lock-less
- Internal CI is not working due to Python3.6 upgrade in vpp code repository. - Jieqiang
- EPIC for next quarter:
- Align Arm patches with VPP release plan.
01/28/2020
- Attendees
- Govindarajan Mohandoss
- Honnappa Nagarahalli
- General
- CSIT
- VPP Performance Test
- VM-VHost test failing on 3n-tsh server.
- The VM-VHost test cases have never passed before as per the previous logs in Taishan server.
- Issue is not reproducible locally - VHost/Virtual Ethernet interface creation passes in Taishan server in local setup.
- Next Steps: Follow up with Peter Mikus to debug the issue in Taishan server in CSIT lab.
- Build a local test setup to run the Testpmd application in VM.
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- 'show pci' command will cause crash issue, which affects all performance tests on Taishan server only.
- Issue is root-caused and the patch is in community review - https://gerrit.fd.io/r/c/vpp/+/23849
- 'show pci' is replaced with 'show ver' temporarily. Now performance test is running fine.
- Will change performance job to be running daily from current weekly running. - Juraj
- Patch to resolve "show pci" crash issue is merged. Will ask CSIT team to remove the workaround. - Lijian
- Have upgraded Python2 to Python3 successfully.
- Ask CSIT community how to identify performance hold/stop issue asap - Juraj
- Will send email to CSIT-dev on how to avoid the similar case/issue.
- VM-VHost test failing on 3n-tsh server.
- VPP Path
- Verifying VPP on Centos/Arm - Juraj
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- Will investigate 4-5 test failures in 'make test' - Lijian
- Looks like it would be only build and unit test. VPP device and performance tests would run on the physical devices in the lab.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilation
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- Loop Nitin, Sachin, Honnappa, Lijian in container cross-compilation discussion.
- Verifying VPP on Centos/Arm - Juraj
- VPP Device
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Two ThunderX2 servers are installed in Arm lab. But intel NIC can not be enumerated on one ThunderX2-01.
- Cables for intel NICs have been ordered.
- Universal rails will be tried with ThunderX2 servers. If it works, will send the rails to FD.io lab.
- Script/commands to verify the NICs are ready. Will try with Mellanox NIC on ThunderX2-02 firstly - Lijian
- Confirm the power cable requirements for the Vexxhost lab Inform about the 2 servers coming to the lab - Juraj
- Current Configurations:
- RAM: 256G
- Disk: 480G SSD
- The boxes are coming with Qlogic cards which are not supported in VPP.
- Changes required to the servers:
- The power cable specifications will be 4 x 6ft 14 AWG C13 to C14 Power Cables.
- Need 2 Intel NICs XL710-QDA2 for each server.
- If there is space for 2 more cards, we should add 2 Mellanox cards SR-IOV capability.
- Disk size to 480G
- Both the servers should have the NICs in the same PCIe slots. It does not matter which PCIe slots the cards are in.
- Cables: N1, P1 to N2, P1 and so on
- Cables for IPMI and Management port: 2
- Will call a meeting once the ThunderX2 arrives to Dean, inviting Juraj/Vxxhost people in FD.io lab to make sure hardware ready.
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- OS: Ubuntu 18.04
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- Two ThunderX2 servers are installed in Arm lab. But intel NIC can not be enumerated on one ThunderX2-01.
- VPP
- Align Arm patches with VPP release plan.
- F0 2020-01-08 APIs frozen. Only low-risk changes accepted on main branch.
- RC1 2020-01-15 (F0+7) Code complete, pull first release throttle branch, only bug fixes accepted on throttle train. After pull: main branch reopens for new feature / risky commits. First artifacts posted.
- RC2 2020-01-22 (RC1+7) Second artifacts posted.
- Formal Release 2020-01-29 (RC2+7) 20.01 release artifacts available
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Usage of MAP is recorded in confluence
- Will update the patch to ignore sigprof singal - Jieqiang
- Benchmarking AVF drivers on Arm servers - Jieqiang
- VPP+DPDK (5.18Mpps/5.21Mpps) VS VPP+AVF (8.39Mpps/8.38Mpps) on ThunderX2.
- Check if performance tests includes AVF driver or not?
- AVF failed to create AVF interface on SMP CPU on N1SDP/Qualcomm - Jieqiang
- Current N1SDP does not support SRIOV, so cannot run AVF on N1SDP.
- Will try one patch to enable N1SDP board.
- Please try AVF with Mcbin if possible.
- Investigate bihash operations in L2 throughput are hot-spots
- Apply prefetches and dual loop with l2_fwd node, and failed to l2_learn
- Lock-free allocation/free give 7%-11% improvement on ThunderX2, but no improvement on x86 and CortexA72.
- Cache misses and CRC32 calculation are possible opportunities.
- To check cycles by applying CRC32 calculation unrolling
- Bench-mark VPP on Dawn N1SDP board
- Done finishing single flow with L2/L3/input-ACL on N1SDP board, will share the data.
- Investigating bi-hash lockless implementation - Jason
- Firstly apply make_working_copy for all bihash update operations, and then apply RCU to make look-up lock-less
- Internal CI is not working due to Python3.6 upgrade in vpp code repository. - Jieqiang
- EPIC for next quarter:
- Align Arm patches with VPP release plan.
01/21/2020
- Attendees
- Tina Tsou
- Govindarajan Mohandoss
- Honnappa Nagarahalli
- Michaela Tahiri
- General
- CSIT
- VPP Performance Test
- VM-VHost test failing on 3n-tsh server.
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- 'show pci' command will cause crash issue, which affects all performance tests on Taishan server only.
- Issue is root-caused and the patch is in community review - https://gerrit.fd.io/r/c/vpp/+/23849
- 'show pci' is replaced with 'show ver' temporarily. Now performance test is running fine.
- Will change performance job to be running daily from current weekly running. - Juraj
- Patch to resolve "show pci" crash issue is merged. Will ask CSIT team to remove the workaround. - Lijian
- Have upgraded Python2 to Python3 successfully.
- Ask CSIT community how to identify performance hold/stop issue asap - Juraj
- Will send email to CSIT-dev on how to avoid the similar case/issue.
- VPP Path
- Verifying VPP on Centos/Arm - Juraj
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- Will investigate 4-5 test failures in 'make test' - Lijian
- Looks like it would be only build and unit test. VPP device and performance tests would run on the physical devices in the lab.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilation
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- Loop Nitin, Sachin, Honnappa, Lijian in container cross-compilation discussion.
- Verifying VPP on Centos/Arm - Juraj
- VPP Device
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Two ThunderX2 servers are installed in Arm lab. But intel NIC can not be enumerated on one ThunderX2-01.
- Cables for intel NICs have been ordered.
- Universal rails will be tried with ThunderX2 servers. If it works, will send the rails to FD.io lab.
- Script/commands to verify the NICs are ready. Will try with Mellanox NIC on ThunderX2-02 firstly - Lijian
- Confirm the power cable requirements for the Vexxhost lab Inform about the 2 servers coming to the lab - Juraj
- Current Configurations:
- RAM: 256G
- Disk: 480G SSD
- The boxes are coming with Qlogic cards which are not supported in VPP.
- Changes required to the servers:
- The power cable specifications will be 4 x 6ft 14 AWG C13 to C14 Power Cables.
- Need 2 Intel NICs XL710-QDA2 for each server.
- If there is space for 2 more cards, we should add 2 Mellanox cards SR-IOV capability.
- Disk size to 480G
- Both the servers should have the NICs in the same PCIe slots. It does not matter which PCIe slots the cards are in.
- Cables: N1, P1 to N2, P1 and so on
- Cables for IPMI and Management port: 2
- Will call a meeting once the ThunderX2 arrives to Dean, inviting Juraj/Vxxhost people in FD.io lab to make sure hardware ready.
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- OS: Ubuntu 18.04
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- Two ThunderX2 servers are installed in Arm lab. But intel NIC can not be enumerated on one ThunderX2-01.
- VPP
- Align Arm patches with VPP release plan.
- F0 2020-01-08 APIs frozen. Only low-risk changes accepted on main branch.
- RC1 2020-01-15 (F0+7) Code complete, pull first release throttle branch, only bug fixes accepted on throttle train. After pull: main branch reopens for new feature / risky commits. First artifacts posted.
- RC2 2020-01-22 (RC1+7) Second artifacts posted.
- Formal Release 2020-01-29 (RC2+7) 20.01 release artifacts available
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Usage of MAP is recorded in confluence
- Will update the patch to ignore sigprof singal - Jieqiang
- Benchmarking AVF drivers on Arm servers - Jieqiang
- VPP+DPDK (5.18Mpps/5.21Mpps) VS VPP+AVF (8.39Mpps/8.38Mpps) on ThunderX2.
- Check if performance tests includes AVF driver or not?
- AVF failed to create AVF interface on SMP CPU on N1SDP/Qualcomm - Jieqiang
- Current N1SDP does not support SRIOV, so cannot run AVF on N1SDP.
- Will try one patch to enable N1SDP board.
- Please try AVF with Mcbin if possible.
- Investigate bihash operations in L2 throughput are hot-spots
- Apply prefetches and dual loop with l2_fwd node, and failed to l2_learn
- Lock-free allocation/free give 7%-11% improvement on ThunderX2, but no improvement on x86 and CortexA72.
- Cache misses and CRC32 calculation are possible opportunities.
- To check cycles by applying CRC32 calculation unrolling
- Bench-mark VPP on Dawn N1SDP board
- Done finishing single flow with L2/L3/input-ACL on N1SDP board, will share the data.
- Investigating bi-hash lockless implementation - Jason
- Firstly apply make_working_copy for all bihash update operations, and then apply RCU to make look-up lock-less
- Internal CI is not working due to Python3.6 upgrade in vpp code repository. - Jieqiang
- EPIC for next quarter:
- Align Arm patches with VPP release plan.
01/14/2020
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Dean Arnold
- Michaela Tahiri
- General
- CSIT
- VPP Performance Test
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- 'show pci' command will cause crash issue, which affects all performance tests on Taishan server only.
- Issue is root-caused and the patch is in community review - https://gerrit.fd.io/r/c/vpp/+/23849
- 'show pci' is replaced with 'show ver' temporarily. Now performance test is running fine.
- Will change performance job to be running daily from current weekly running. - Juraj
- Patch to resolve "show pci" crash issue is merged. Will ask CSIT team to remove the workaround. - Lijian
- Have upgraded Python2 to Python3 successfully.
- Ask CSIT community how to identify performance hold/stop issue asap - Juraj
- Will send email to CSIT-dev on how to avoid the similar case/issue.
- VPP Path
- Verifying VPP on Centos/Arm - Juraj
- By fixing software bug, VPP can boot up normally with 16K/64K page size.
- Will investigate 4-5 test failures in 'make test' - Lijian
- Looks like it would be only build and unit test. VPP device and performance tests would run on the physical devices in the lab.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilation
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- Loop Nitin, Sachin, Honnappa, Lijian in container cross-compilation discussion.
- Verifying VPP on Centos/Arm - Juraj
- VPP Device
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Two ThunderX2 servers are installed in Arm lab. But intel NIC can not be enumerated on one ThunderX2-01.
- Cables for intel NICs have been ordered.
- Universal rails will be tried with ThunderX2 servers. If it works, will send the rails to FD.io lab.
- Script/commands to verify the NICs are ready. Will try with Mellanox NIC on ThunderX2-02 firstly - Lijian
- Confirm the power cable requirements for the Vexxhost lab Inform about the 2 servers coming to the lab - Juraj
- Current Configurations:
- RAM: 256G
- Disk: 480G SSD
- The boxes are coming with Qlogic cards which are not supported in VPP.
- Changes required to the servers:
- The power cable specifications will be 4 x 6ft 14 AWG C13 to C14 Power Cables.
- Need 2 Intel NICs XL710-QDA2 for each server.
- If there is space for 2 more cards, we should add 2 Mellanox cards SR-IOV capability.
- Disk size to 480G
- Both the servers should have the NICs in the same PCIe slots. It does not matter which PCIe slots the cards are in.
- Cables: N1, P1 to N2, P1 and so on
- Cables for IPMI and Management port: 2
- Will call a meeting once the ThunderX2 arrives to Dean, inviting Juraj/Vxxhost people in FD.io lab to make sure hardware ready.
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- OS: Ubuntu 18.04
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- Two ThunderX2 servers are installed in Arm lab. But intel NIC can not be enumerated on one ThunderX2-01.
- VPP
- Align Arm patches with VPP release plan.
- F0 2020-01-08 APIs frozen. Only low-risk changes accepted on main branch.
- RC1 2020-01-15 (F0+7) Code complete, pull first release throttle branch, only bug fixes accepted on throttle train. After pull: main branch reopens for new feature / risky commits. First artifacts posted.
- RC2 2020-01-22 (RC1+7) Second artifacts posted.
- Formal Release 2020-01-29 (RC2+7) 20.01 release artifacts available
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Usage of MAP is recorded in confluence
- Will update the patch to ignore sigprof singal - Jieqiang
- Benchmarking AVF drivers on Arm servers - Jieqiang
- VPP+DPDK (5.18Mpps/5.21Mpps) VS VPP+AVF (8.39Mpps/8.38Mpps) on ThunderX2.
- Check if performance tests includes AVF driver or not?
- AVF failed to create AVF interface on SMP CPU on N1SDP/Qualcomm - Jieqiang
- Current N1SDP does not support SRIOV, so cannot run AVF on N1SDP.
- Will try one patch to enable N1SDP board.
- Please try AVF with Mcbin if possible.
- Investigate bihash operations in L2 throughput are hot-spots
- Apply prefetches and dual loop with l2_fwd node, and failed to l2_learn
- Lock-free allocation/free give 7%-11% improvement on ThunderX2, but no improvement on x86 and CortexA72.
- Cache misses and CRC32 calculation are possible opportunities.
- To check cycles by applying CRC32 calculation unrolling
- Bench-mark VPP on Dawn N1SDP board
- Done finishing single flow with L2/L3/input-ACL on N1SDP board, will share the data.
- Investigating bi-hash lockless implementation - Jason
- Firstly apply make_working_copy for all bihash update operations, and then apply RCU to make look-up lock-less
- Internal CI is not working due to Python3.6 upgrade in vpp code repository. - Jieqiang
- EPIC for next quarter:
- Align Arm patches with VPP release plan.
01/07/2020
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Dean Arnold
- Michaela Tahiri
- General
- CSIT
- VPP Performance Test
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- 'show pci' command will cause crash issue, which affects all performance tests on Taishan server only.
- Issue is root-caused and the patch is in community review - https://gerrit.fd.io/r/c/vpp/+/23849
- 'show pci' is replaced with 'show ver' temporarily. Now performance test is running fine.
- Will change performance job to be running daily from current weekly running. - Juraj
- Patch to resolve "show pci" crash issue is merged. Will ask CSIT team to remove the workaround. - Lijian
- Have upgraded Python2 to Python3 successfully.
- Ask CSIT community how to identify performance hold/stop issue asap - Juraj
- Will send email to CSIT-dev on how to avoid the similar case/issue.
- VPP Path
- Verifying VPP on Centos/Arm - Juraj
- Trying to update kernel to 64K page size on CentOS - No update - Lijian
- VPP can boot up normally with 16K/64K page size. Will investigate 4-5 test failures in 'make test' - Lijian
- Will try with CentOS 8 which seems to be working fine with 64K page size.
- Looks like it would be only build and unit test. VPP device and performance tests would run on the physical devices in the lab.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilation
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- Loop Nitin, Sachin, Honnappa, Lijian in container cross-compilation discussion.
- VPP Device
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Two ThunderX2 servers are installed in Arm lab. But intel NIC can not be enumerated on one ThunderX2-01.
- Cables for intel NICs have been ordered.
- Universal rails will be tried with ThunderX2 servers. If it works, will send the rails to FD.io lab.
- Script/commands to verify the NICs are ready. Will try with Mellanox NIC on ThunderX2-02 firstly - Lijian
- Confirm the power cable requirements for the Vexxhost lab Inform about the 2 servers coming to the lab - Juraj
- Current Configurations:
- RAM: 256G
- Disk: 480G SSD
- The boxes are coming with Qlogic cards which are not supported in VPP.
- Changes required to the servers:
- The power cable specifications will be 4 x 6ft 14 AWG C13 to C14 Power Cables.
- Need 2 Intel NICs XL710-QDA2 for each server.
- If there is space for 2 more cards, we should add 2 Mellanox cards SR-IOV capability.
- Disk size to 480G
- Both the servers should have the NICs in the same PCIe slots. It does not matter which PCIe slots the cards are in.
- Cables: N1, P1 to N2, P1 and so on
- Cables for IPMI and Management port: 2
- Will call a meeting once the ThunderX2 arrives to Dean, inviting Juraj/Vxxhost people in FD.io lab to make sure hardware ready.
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- OS: Ubuntu 18.04
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- Two ThunderX2 servers are installed in Arm lab. But intel NIC can not be enumerated on one ThunderX2-01.
- VPP
- Align Arm patches with VPP release plan.
- F0 2020-01-08 APIs frozen. Only low-risk changes accepted on main branch.
- RC1 2020-01-15 (F0+7) Code complete, pull first release throttle branch, only bug fixes accepted on throttle train. After pull: main branch reopens for new feature / risky commits. First artifacts posted.
- RC2 2020-01-22 (RC1+7) Second artifacts posted.
- Formal Release 2020-01-29 (RC2+7) 20.01 release artifacts available
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Usage of MAP is recorded in confluence
- Will update the patch to ignore sigprof singal - Jieqiang
- Benchmarking AVF drivers on Arm servers - Jieqiang
- VPP+DPDK (5.18Mpps/5.21Mpps) VS VPP+AVF (8.39Mpps/8.38Mpps) on ThunderX2.
- Check if performance tests includes AVF driver or not?
- AVF failed to create AVF interface on SMP CPU on N1SDP/Qualcomm - Jieqiang
- Investigate bihash operations in L2 throughput are hot-spots
- Apply prefetches and dual loop with l2_fwd node, and failed to l2_learn
- Lock-free allocation/free give 7%-11% improvement on ThunderX2, but no improvement on x86 and CortexA72.
- Cache misses and CRC32 calculation are possible opportunities.
- To check cycles by applying CRC32 calculation unrolling
- Bench-mark VPP on Dawn N1SDP board
- Done finishing single flow with L2/L3/input-ACL on N1SDP board, will share the data.
- Investigating bi-hash lockless implementation - Jason
- Firstly apply make_working_copy for all bihash update operations, and then apply RCU to make look-up lock-less
- EPIC for next quarter:
- Align Arm patches with VPP release plan.
12/17/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Dean Arnold
- Michaela Tahiri
- Xiaoming Jiang
- General
- CSIT
- VPP Performance Test
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- 'show pci' command will cause crash issue, which affects all performance tests on Taishan server only.
- Issue is root-caused and the patch is in community review - https://gerrit.fd.io/r/c/vpp/+/23849
- 'show pci' is replaced with 'show ver' temporarily. Now performance test is running fine.
- Will change performance job to be running daily from current weekly running. - Juraj
- Have upgraded Python2 to Python3 successfully.
- Ask CSIT community how to identify performance hold/stop issue asap - Juraj
- VPP Path
- Verifying VPP on Centos/Arm - Juraj
- Trying to update kernel to 64K page size on CentOS - No update - Lijian
- Will try with CentOS 8 which seems to be working fine with 64K page size.
- Looks like it would be only build and unit test. VPP device and performance tests would run on the physical devices in the lab.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilation
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- Loop Nitin, Sachin, Honnappa, Lijian in container cross-compilation discussion.
- VPP Device
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Two ThunderX2 servers are installed in Arm lab. But intel NIC can not be enumerated on one ThunderX2-01.
- Cables for intel NICs have been ordered.
- Universal rails will be tried with ThunderX2 servers. If it works, will send the rails to FD.io lab.
- Script/commands to verify the NICs are ready. Will try with Mellanox NIC on ThunderX2-02 firstly - Lijian
- Confirm the power cable requirements for the Vexxhost lab Inform about the 2 servers coming to the lab - Juraj
- Current Configurations:
- RAM: 256G
- Disk: 480G SSD
- The boxes are coming with Qlogic cards which are not supported in VPP.
- Changes required to the servers:
- The power cable specifications will be 4 x 6ft 14 AWG C13 to C14 Power Cables.
- Need 2 Intel NICs XL710-QDA2 for each server.
- If there is space for 2 more cards, we should add 2 Mellanox cards SR-IOV capability.
- Disk size to 480G
- Both the servers should have the NICs in the same PCIe slots. It does not matter which PCIe slots the cards are in.
- Cables: N1, P1 to N2, P1 and so on
- Cables for IPMI and Management port: 2
- Will call a meeting once the ThunderX2 arrives to Dean, inviting Juraj/Vxxhost people in FD.io lab to make sure hardware ready.
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- OS: Ubuntu 18.04
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- Two ThunderX2 servers are installed in Arm lab. But intel NIC can not be enumerated on one ThunderX2-01.
- VPP
- Align Arm patches with VPP release plan.
- F0 2020-01-08 APIs frozen. Only low-risk changes accepted on main branch.
- RC1 2020-01-15 (F0+7) Code complete, pull first release throttle branch, only bug fixes accepted on throttle train. After pull: main branch reopens for new feature / risky commits. First artifacts posted.
- RC2 2020-01-22 (RC1+7) Second artifacts posted.
- Formal Release 2020-01-29 (RC2+7) 20.01 release artifacts available
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Usage of MAP is recorded in confluence
- Benchmarking AVF drivers on Arm servers - Jieqiang
- Add max-size parameter to pmalloc module. - Jieqiang
- Patches are upstreamed, but not reviewed yet.
- Investigate bihash operations in L2 throughput are hot-spots
- Apply prefetches and dual loop with l2_fwd node, and failed to l2_learn
- Lock-free allocation/free give 7%-11% improvement on ThunderX2, but no improvement on x86 and CortexA72.
- Cache misses and CRC32 calculation are possible opportunities.
- To check cycles by applying CRC32 calculation unrolling
- Bench-mark VPP on Dawn N1SDP board
- Use rte_mbuf_sanity_check checking meta data.
- Investigating bi-hash lockless implementation - Jason
- Firstly apply make_working_copy for all bihash update operations, and then apply RCU to make look-up lock-less
- EPIC for next quarter:
- Align Arm patches with VPP release plan.
12/10/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Dean Arnold
- Michaela Tahiri
- Xiaoming Jiang
- General
- CSIT
- VPP Performance Test
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- 'show pci' command will cause crash issue, which affects all performance tests on Taishan server only.
- Issue is root-caused and the patch is in community review - https://gerrit.fd.io/r/c/vpp/+/23849
- 'show pci' is replaced with 'show ver' temporarily. Now performance test is running fine.
- Will change performance job to be running daily from current weekly running. - Juraj
- Have upgraded Python2 to Python3 successfully.
- VPP Path
- Verifying VPP on Centos/Arm - Juraj
- Trying to update kernel to 64K page size on CentOS - Lijian
- Will try with CentOS 8 which seems to be working fine with 64K page size.
- Looks like it would be only build and unit test. VPP device and performance tests would run on the physical devices in the lab.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilation
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- Loop Nitin, Sachin, Honnappa, Lijian in container cross-compilation discussion.
- VPP Device
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Two ThunderX2 servers are installed in Arm lab. But intel NIC can not be enumerated on one ThunderX2.
- What's the preferred work method with Mellanox NIC, using DPDK pmd or RDMA? - Juraj
- Check BIOS version - Lijian
- Make sure all NICs are plugged into same PCI slot number - Lijian
- Verify intel i40e driver/firmware version - Lijian
- Confirm the power cable requirements for the Vexxhost lab Inform about the 2 servers coming to the lab - Juraj
- Current Configurations:
- RAM: 256G
- Disk: 480G SSD
- The boxes are coming with Qlogic cards which are not supported in VPP.
- Changes required to the servers:
- The power cable specifications will be 4 x 6ft 14 AWG C13 to C14 Power Cables.
- Need 2 Intel NICs XL710-QDA2 for each server.
- If there is space for 2 more cards, we should add 2 Mellanox cards SR-IOV capability.
- Disk size to 480G
- Both the servers should have the NICs in the same PCIe slots. It does not matter which PCIe slots the cards are in.
- Cables: N1, P1 to N2, P1 and so on
- Cables for IPMI and Management port: 2
- Will call a meeting once the ThunderX2 arrives to Dean, inviting Juraj/Vxxhost people in FD.io lab to make sure hardware ready.
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- OS: Ubuntu 18.04
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- Two ThunderX2 servers are installed in Arm lab. But intel NIC can not be enumerated on one ThunderX2.
- VPP
- Align Arm patches with VPP release plan.
- F0 2020-01-08 APIs frozen. Only low-risk changes accepted on main branch.
- RC1 2020-01-15 (F0+7) Code complete, pull first release throttle branch, only bug fixes accepted on throttle train. After pull: main branch reopens for new feature / risky commits. First artifacts posted.
- RC2 2020-01-22 (RC1+7) Second artifacts posted.
- Formal Release 2020-01-29 (RC2+7) 20.01 release artifacts available
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Usage of MAP is recorded in confluence
- MAP can give profiling data at certain different time-line spots
- MAP cannot do profiling with specific CPU cores, and cannot give assembly views
- Add max-size parameter to pmalloc module. - Jieqiang
- Investigate bihash operations in L2 throughput are hot-spots
- Apply prefetches and dual loop with l2_fwd node, and failed to l2_learn
- Getting entries in ACL cache-line aligned, and bench-mark it.
- Cache misses and CRC32 calculation are possible opportunities.
- To check cycles by applying CRC32 calculation unrolling
- Bench-mark VPP on Dawn N1SDP board
- Use rte_mbuf_sanity_check checking meta data.
- Investigating bi-hash lockless implementation - Jason
- Firstly apply make_working_copy for all bihash update operations, and then apply RCU to make look-up lock-less
- EPIC for next quarter:
- Align Arm patches with VPP release plan.
12/03/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Dean Arnold
- Michaela Tahiri
- Xiaoming Jiang
- General
- CSIT
- VPP Performance Test
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- There's a Python API issue which affects all performance tests on Taishan server only.
- The failure turns out to be caused by PCI show with Mellanox NICs on Taishan servers.
- Talk to Peter to temporarily remove 'PCI dump' for Taishan servers - Juraj
- Could you try debug version of VPP with the setup and capture the traceback log? - Juraj
- Will try to root cause the problem with Taishan + Mellanox NIC - Lijian
- VPP Path
- Verifying VPP on Centos/Arm - Juraj
- Trying to update kernel to 64K page size on CentOS - Lijian
- Looks like it would be only build and unit test. VPP device and performance tests would run on the physical devices in the lab.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilation
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- Loop Nitin, Sachin, Honnappa, Lijian in container cross-compilation discussion.
- VPP Device
- VPP device failed after Python3 upgrade
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Confirm the power cable requirements for the Vexxhost lab Inform about the 2 servers coming to the lab - Juraj
- Current Configurations:
- RAM: 256G
- Disk: 480G SSD
- The boxes are coming with Qlogic cards which are not supported in VPP.
- Changes required to the servers:
- The power cable specifications will be 4 x 6ft 14 AWG C13 to C14 Power Cables.
- Need 2 Intel NICs XL710-QDA2 for each server.
- If there is space for 2 more cards, we should add 2 Mellanox cards SR-IOV capability.
- Disk size to 480G
- Both the servers should have the NICs in the same PCIe slots. It does not matter which PCIe slots the cards are in.
- Cables: N1, P1 to N2, P1 and so on
- Cables for IPMI and Management port: 2
- Will call a meeting once the ThunderX2 arrives to Dean, inviting Juraj/Vxxhost people in FD.io lab to make sure hardware ready.
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- OS: Ubuntu 18.04
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- Align Arm patches with VPP release plan.
- F0 2020-01-08 APIs frozen. Only low-risk changes accepted on main branch.
- RC1 2020-01-15 (F0+7) Code complete, pull first release throttle branch, only bug fixes accepted on throttle train. After pull: main branch reopens for new feature / risky commits. First artifacts posted.
- RC2 2020-01-22 (RC1+7) Second artifacts posted.
- Formal Release 2020-01-29 (RC2+7) 20.01 release artifacts available
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Usage of MAP is recorded in confluence
- MAP can give profiling data at certain different time-line spots
- MAP cannot do profiling with specific CPU cores, and cannot give assembly views
- Add max-size parameter to pmalloc module. - Jieqiang
- Investigate bihash operations in L2 throughput are hot-spots
- Apply prefetches and dual loop with l2_fwd node, and failed to l2_learn
- Getting entries in ACL cache-line aligned, and bench-mark it.
- Cache misses and CRC32 calculation are possible opportunities.
- To check cycles by applying CRC32 calculation unrolling
- Bench-mark VPP on Dawn N1SDP board
- Use rte_mbuf_sanity_check checking meta data.
- Investigating bi-hash lockless implementation - Jason
- Firstly apply make_working_copy for all bihash update operations, and then apply RCU to make look-up lock-less
- EPIC for next quarter:
- Align Arm patches with VPP release plan.
11/26/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Dean Arnold
- Michaela Tahiri
- Xiaoming Jiang
- General
- CSIT
- VPP Performance Test
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- There's a Python API issue which affects all performance tests on Taishan server only.
- VPP Path
- Looks like it would be only build and unit test. VPP device and performance tests would run on the physical devices in the lab.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilation
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- Loop Nitin, Sachin, Honnappa, Lijian in container cross-compilation discussion.
- VPP Device
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Confirm the power cable requirements for the Vexxhost lab Inform about the 2 servers coming to the lab - Juraj
- Current Configurations:
- RAM: 256G
- Disk: 480G SSD
- The boxes are coming with Qlogic cards which are not supported in VPP.
- Changes required to the servers:
- The power cable specifications will be 4 x 6ft 14 AWG C13 to C14 Power Cables.
- Need 2 Intel NICs XL710-QDA2 for each server.
- If there is space for 2 more cards, we should add 2 Mellanox cards SR-IOV capability.
- Disk size to 480G
- Both the servers should have the NICs in the same PCIe slots. It does not matter which PCIe slots the cards are in.
- Cables: N1, P1 to N2, P1 and so on
- Cables for IPMI and Management port: 2
- Will call a meeting once the ThunderX2 arrives to Dean, inviting Juraj/Vxxhost people in FD.io lab to make sure hardware ready.
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- OS: Ubuntu 18.04
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- Align Arm patches with VPP release plan.
- F0 2020-01-08 APIs frozen. Only low-risk changes accepted on main branch.
- RC1 2020-01-15 (F0+7) Code complete, pull first release throttle branch, only bug fixes accepted on throttle train. After pull: main branch reopens for new feature / risky commits. First artifacts posted.
- RC2 2020-01-22 (RC1+7) Second artifacts posted.
- Formal Release 2020-01-29 (RC2+7) 20.01 release artifacts available
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- There's no crash issue with latest VPP code.
- Will do software profiling with MAP on VPP.
- Add max-size parameter to pmalloc module. - Jieqiang
- Investigate bihash operations in L2 throughput are hot-spots
- Apply prefetches and dual loop with l2_fwd node, and failed to l2_learn
- Getting entries in ACL cache-line aligned, and bench-mark it.
- Cache misses and CRC32 calculation are possible opportunities.
- To check cycles by applying CRC32 calculation unrolling
- Bench-mark VPP on Dawn N1SDP board
- Use rte_mbuf_sanity_check checking meta data.
- Investigating bi-hash lockless implementation - Jason
- Firstly apply make_working_copy for all bihash update operations, and then apply RCU to make look-up lock-less
- EPIC for next quarter:
- Align Arm patches with VPP release plan.
11/19/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Dean Arnold
- Michaela Tahiri
- General
- CSIT
- VPP Performance Test
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- There's a Python API issue which affects all performance tests on Taishan server only.
- VPP Path
- Looks like it would be only build and unit test. VPP device and performance tests would run on the physical devices in the lab.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilation
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- Loop Nitin, Sachin, Honnappa, Lijian in container cross-compilation discussion.
- VPP Device
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Confirm the power cable requirements for the Vexxhost lab Inform about the 2 servers coming to the lab - Juraj
- Current Configurations:
- RAM: 256G
- Disk: 480G SSD
- The boxes are coming with Qlogic cards which are not supported in VPP.
- Changes required to the servers:
- The power cable specifications will be 4 x 6ft 14 AWG C13 to C14 Power Cables.
- Need 2 Intel NICs XL710-QDA2 for each server.
- If there is space for 2 more cards, we should add 2 Mellanox cards SR-IOV capability.
- Disk size to 480G
- Both the servers should have the NICs in the same PCIe slots. It does not matter which PCIe slots the cards are in.
- Cables: N1, P1 to N2, P1 and so on
- Cables for IPMI and Management port: 2
- Will call a meeting once the ThunderX2 arrives to Dean, inviting Juraj/Vxxhost people in FD.io lab to make sure hardware ready.
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- OS: Ubuntu 18.04
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- Align Arm patches with VPP release plan.
- F0 2020-01-08 APIs frozen. Only low-risk changes accepted on main branch.
- RC1 2020-01-15 (F0+7) Code complete, pull first release throttle branch, only bug fixes accepted on throttle train. After pull: main branch reopens for new feature / risky commits. First artifacts posted.
- RC2 2020-01-22 (RC1+7) Second artifacts posted.
- Formal Release 2020-01-29 (RC2+7) 20.01 release artifacts available
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- There's no crash issue with latest VPP code.
- Will do software profiling with MAP on VPP.
- Add max-size parameter to pmalloc module. - Jieqiang
- Investigate bihash operations in L2 throughput are hot-spots
- Apply prefetches and dual loop with l2_fwd node, and failed to l2_learn
- Getting entries in ACL cache-line aligned, and bench-mark it.
- Cache misses and CRC32 calculation are possible opportunities.
- To check cycles by applying CRC32 calculation unrolling
- Bench-mark VPP on Dawn N1SDP board
- Use rte_mbuf_sanity_check checking meta data.
- Investigating bi-hash lockless implementation - Jason
- Firstly apply make_working_copy for all bihash update operations, and then apply RCU to make look-up lock-less
- EPIC for next quarter:
- Align Arm patches with VPP release plan.
11/12/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Stanislav Clebec
- General
- CSIT
- VPP Performance Test
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- VPP Path
- Looks like it would be only build and unit test. VPP device and performance tests would run on the physical devices in the lab.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilation
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- VPP Device
- https://gerrit.fd.io/r/#/c/vpp/+/21152/ - to fix occasional VPP device failure - Merged.
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Confirm the power cable requirements for the Vexxhost lab Inform about the 2 servers coming to the lab - Juraj
- Current Configurations:
- RAM: 256G
- Disk: 480G SSD
- The boxes are coming with Qlogic cards which are not supported in VPP.
- Changes required to the servers:
- The power cable specifications will be 4 x 6ft 14 AWG C13 to C14 Power Cables.
- Need 2 Intel NICs XL710-QDA2 for each server.
- If there is space for 2 more cards, we should add 2 Mellanox cards SR-IOV capability.
- Disk size to 480G
- Both the servers should have the NICs in the same PCIe slots. It does not matter which PCIe slots the cards are in.
- Cables: N1, P1 to N2, P1 and so on
- Cables for IPMI and Management port: 2
- Will call a meeting once the ThunderX2 arrives to Dean, inviting Juraj/Vxxhost people in FD.io lab to make sure hardware ready.
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- OS: Ubuntu 18.04
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- Align Arm patches with VPP release plan.
- F0 2020-01-08 APIs frozen. Only low-risk changes accepted on main branch.
- RC1 2020-01-15 (F0+7) Code complete, pull first release throttle branch, only bug fixes accepted on throttle train. After pull: main branch reopens for new feature / risky commits. First artifacts posted.
- RC2 2020-01-22 (RC1+7) Second artifacts posted.
- Formal Release 2020-01-29 (RC2+7) 20.01 release artifacts available
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- There's no crash issue with latest VPP code.
- Will do software profiling with MAP on VPP.
- Add max-size parameter to pmalloc module. - Jieqiang
- Investigate bihash operations in L2 throughput are hot-spots
- Apply prefetches and dual loop with l2_fwd node, and failed to l2_learn
- Getting entries in ACL cache-line aligned, and bench-mark it.
- Cache misses and CRC32 calculation are possible opportunities.
- To check cycles by applying CRC32 calculation unrolling
- Investigating bi-hash lockless implementation - Jason
- Firstly apply make_working_copy for all bihash update operations, and then apply RCU to make look-up lockless
- EPIC for next quarter:
- Align Arm patches with VPP release plan.
10/29/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Stanislav Clebec
- General
- CSIT
- VPP Performance Test
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- VPP Path
- Looks like it would be only build and unit test. VPP device and performance tests would run on the physical devices in the lab.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilation
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- VPP Device
- https://gerrit.fd.io/r/#/c/vpp/+/21152/ - to fix occasional VPP device failure - Merged.
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Confirm the power cable requirements for the Vexxhost lab Inform about the 2 servers coming to the lab - Juraj
- Current Configurations:
- RAM: 256G
- Disk: 480G SSD
- The boxes are coming with Qlogic cards which are not supported in VPP.
- Changes required to the servers:
- The power cable specifications will be 4 x 6ft 14 AWG C13 to C14 Power Cables.
- Need 2 Intel NICs XL710-QDA2 for each server.
- If there is space for 2 more cards, we should add 2 Mellanox cards SR-IOV capability.
- Disk size to 480G
- Both the servers should have the NICs in the same PCIe slots. It does not matter which PCIe slots the cards are in.
- Cables: N1, P1 to N2, P1 and so on
- Cables for IPMI and Management port: 2
- Will call a meeting once the ThunderX2 arrives to Dean, inviting Juraj/Vxxhost people in FD.io lab to make sure hardware ready.
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- OS: Ubuntu 18.04
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- Align Arm patches with VPP release plan.
- F0 2020-01-08 APIs frozen. Only low-risk changes accepted on main branch.
- RC1 2020-01-15 (F0+7) Code complete, pull first release throttle branch, only bug fixes accepted on throttle train. After pull: main branch reopens for new feature / risky commits. First artifacts posted.
- RC2 2020-01-22 (RC1+7) Second artifacts posted.
- Formal Release 2020-01-29 (RC2+7) 20.01 release artifacts available
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- There's no crash issue with latest VPP code.
- Will do software profiling with MAP on VPP.
- Add max-size parameter to pmalloc module. - Jieqiang
- Investigate bihash operations in L2 throughput are hot-spots
- Apply prefetches and dual loop with l2_fwd node, and failed to l2_learn
- Getting entries in ACL cache-line aligned, and bench-mark it.
- Cache misses and CRC32 calculation are possible opportunities.
- To check cycles by applying CRC32 calculation unrolling
- Investigating bi-hash lockless implementation - Jason
- Firstly apply make_working_copy for all bihash update operations, and then apply RCU to make look-up lockless
- EPIC for next quarter:
- Align Arm patches with VPP release plan.
10/22/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Stanislav Clebec
- General
- CSIT
- VPP Performance Test
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- VPP Path
- Looks like it would be only build and unit test. VPP device and performance tests would run on the physical devices in the lab.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilation
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- VPP Device
- https://gerrit.fd.io/r/#/c/vpp/+/21152/ - to fix occasional VPP device failure - Merged.
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Confirm the power cable requirements for the Vexxhost lab Inform about the 2 servers coming to the lab - Juraj
- Current Configurations:
- RAM: 256G
- Disk: 480G SSD
- The boxes are coming with Qlogic cards which are not supported in VPP.
- Changes required to the servers:
- The power cable specifications will be 4 x 6ft 14 AWG C13 to C14 Power Cables.
- Need 2 Intel NICs XL710-QDA2 for each server.
- If there is space for 2 more cards, we should add 2 Mellanox cards SR-IOV capability.
- Disk size to 480G
- Both the servers should have the NICs in the same PCIe slots. It does not matter which PCIe slots the cards are in.
- Cables: N1, P1 to N2, P1 and so on
- Cables for IPMI and Management port: 2
- Will call a meeting once the ThunderX2 arrives to Dean, inviting Juraj/Vxxhost people in FD.io lab to make sure hardware ready.
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- OS: Ubuntu 18.04
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- Align Arm patches with VPP release plan.
- F0 2020-01-08 APIs frozen. Only low-risk changes accepted on main branch.
- RC1 2020-01-15 (F0+7) Code complete, pull first release throttle branch, only bug fixes accepted on throttle train. After pull: main branch reopens for new feature / risky commits. First artifacts posted.
- RC2 2020-01-22 (RC1+7) Second artifacts posted.
- Formal Release 2020-01-29 (RC2+7) 20.01 release artifacts available
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- There's no crash issue with latest VPP code.
- Will do software profiling with MAP on VPP.
- Add max-size parameter to pmalloc module. - Lijian
- Investigate bihash operations in L2 throughput are hot-spots
- Apply prefetches and dual loop with l2_fwd node, and failed to l2_learn
- Getting entries in ACL cache-line aligned, and bench-mark it.
- Cache misses and CRC32 calculation are possible opportunities.
- To check cycles by applying CRC32 calculation unrolling
- Investigating bi-hash lockless implementation - Jason
- Firstly apply make_working_copy for all bihash update operations, and then apply RCU to make look-up lockless
- EPIC for next quarter:
- Align Arm patches with VPP release plan.
10/15/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Stanislav Clebec
- General
- CSIT
- VPP Performance Test
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- VPP Path
- Looks like it would be only build and unit test. VPP device and performance tests would run on the physical devices in the lab.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilation
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- VPP Device
- https://gerrit.fd.io/r/#/c/vpp/+/21152/ - to fix occasional VPP device failure - Merged.
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Confirm the power cable requirements for the Vexxhost lab Inform about the 2 servers coming to the lab - Juraj
- Current Configurations:
- RAM: 256G
- Disk: 240G SSD
- The boxes are coming with Qlogic cards which are not supported in VPP.
- Changes required to the servers:
- The power cable specifications will be 4 x 6ft 14 AWG C13 to C14 Power Cables.
- Need 2 Intel NICs XL710-QDA2 for each server.
- If there is space for 2 more cards, we should add 2 Mellanox cards SR-IOV capability.
- Disk size to 480G
- Both the servers should have the NICs in the same PCIe slots. It does not matter which PCIe slots the cards are in.
- Cables: N1, P1 to N2, P1 and so on
- Cables for IPMI and Management port: 2
- Will call a meeting once the ThunderX2 arrives to Dean, inviting Juraj/Vxxhost people in FD.io lab to make sure hardware ready.
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- OS: Ubuntu 18.04
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- Align Arm patches with VPP release plan.
- F0 2020-01-08 APIs frozen. Only low-risk changes accepted on main branch.
- RC1 2020-01-15 (F0+7) Code complete, pull first release throttle branch, only bug fixes accepted on throttle train. After pull: main branch reopens for new feature / risky commits. First artifacts posted.
- RC2 2020-01-22 (RC1+7) Second artifacts posted.
- Formal Release 2020-01-29 (RC2+7) 20.01 release artifacts available
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- There's no crash issue with latest VPP code.
- Will do software profiling with MAP on VPP.
- Investigate bihash operations in L2 throughput are hot-spots
- Cache misses and CRC32 calculation are possible opportunities.
- To check cycles by applying CRC32 calculation unrolling
- Cache misses and CRC32 calculation are possible opportunities.
- Investigating bi-hash lockless implementation - Jason
- EPIC for next quarter:
- Align Arm patches with VPP release plan.
10/08/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Stanislav Clebec
- General
- CSIT
- VPP Performance Test
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- VPP Path
- Looks like it would be only build and unit test. VPP device and performance tests would run on the physical devices in the lab.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilation
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- VPP Device
- https://gerrit.fd.io/r/#/c/vpp/+/21152/ - to fix occasional VPP device failure - Merged.
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Confirm the power cable requirements for the Vexxhost lab Inform about the 2 servers coming to the lab - Juraj
- Current Configurations:
- RAM: 256G
- Disk: 240G SSD
- The boxes are coming with Qlogic cards which are not supported in VPP.
- Changes required to the servers:
- The power cable specifications will be 4 x 6ft 14 AWG C13 to C14 Power Cables.
- Need 2 Intel NICs XL710-QDA2 for each server.
- If there is space for 2 more cards, we should add 2 Mellanox cards SR-IOV capability.
- Disk size to 480G
- Both the servers should have the NICs in the same PCIe slots. It does not matter which PCIe slots the cards are in.
- Cables: N1, P1 to N2, P1 and so on
- Cables for IPMI and Management port: 2
- Will call a meeting once the ThunderX2 arrives to Dean, inviting Juraj/Vxxhost people in FD.io lab to make sure hardware ready.
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- OS: Ubuntu 18.04
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- Align Arm patches with VPP release plan.
- F0 2020-01-08 APIs frozen. Only low-risk changes accepted on main branch.
- RC1 2020-01-15 (F0+7) Code complete, pull first release throttle branch, only bug fixes accepted on throttle train. After pull: main branch reopens for new feature / risky commits. First artifacts posted.
- RC2 2020-01-22 (RC1+7) Second artifacts posted.
- Formal Release 2020-01-29 (RC2+7) 20.01 release artifacts available
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- There's no crash issue with latest VPP code.
- Will do software profiling with MAP on VPP.
- Investigate bihash operations in L2 throughput are hot-spots
- Cache misses and CRC32 calculation are possible opportunities.
- To check cycles by applying CRC32 calculation unrolling
- Cache misses and CRC32 calculation are possible opportunities.
- Investigating bi-hash lockless implementation - Jason
- EPIC for next quarter:
- Align Arm patches with VPP release plan.
10/01/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Stanislav Clebec
- General
- CSIT
- VPP Performance Test
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- VPP Path
- Looks like it would be only build and unit test. VPP device and performance tests would run on the physical devices in the lab.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilation
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- VPP Device
- https://gerrit.fd.io/r/#/c/vpp/+/21152/ - to fix occasional VPP device failure
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Confirm the power cable requirements for the Vexxhost lab Inform about the 2 servers coming to the lab - Juraj
- Current Configurations:
- RAM: 256G
- Disk: 240G SSD
- The boxes are coming with Qlogic cards which are not supported in VPP.
- Changes required to the servers:
- Need 2 Intel NICs XL710-QDA2 for each server.
- If there is space for 2 more cards, we should add 2 Mellanox cards SR-IOV capability.
- Disk size to 480G
- Both the servers should have the NICs in the same PCIe slots. It does not matter which PCIe slots the cards are in.
- Cables: N1, P1 to N2, P1 and so on
- Cables for IPMI and Management port: 2
- Will call a meeting once the ThunderX2 arrives to Dean, inviting Juraj/Vxxhost people in FD.io lab to make sure hardware ready.
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- OS: Ubuntu 18.04
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- Align Arm patches with VPP release plan. - Lijian
- Vectorization
- https://gerrit.fd.io/r/c/vpp/+/22391 - vlib: vectorized buffer pointer to index with 128-bit SIMD
- https://gerrit.fd.io/r/c/vpp/+/22392 - ethernet: 128-bits vectorized next node selection
- https://gerrit.fd.io/r/c/vpp/+/20273 - vppinfra: vectorize eth_input_adv_and_flags_x4 with 128-bit SIMD
- MAP with VPP - error is resolved. Sort of working. Record the details.
- There's no crash issue with latest VPP code.
- Investigate bihash operations in L2 throughput are hot-spots
- Cache misses and CRC32 calculation are possible opportunities.
- Investigating bi-hash lockless implementation - Jason
- EPIC for next quarter:
09/24/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Khemendra Kumar
- Greeshma Katarki
- General
- Share VPN application and FD.io server access info to Greeshma and Khem.
- Finished PPT and demo to Pravin - Will share with Juraj and Honnappa.
- CSIT
- VPP Performance Test
- Investigate DPDK performance job - Juraj
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- If there's performance drop in CSIT performance testing, what action will be taken and who will take care of the drop?
- Currently trending data could be monitored manually only.
- https://jenkins.fd.io/job/csit-vpp-perf-verify-1908-3n-tsh/, takes a lots of time, 3 days, 61 hours (28 hours on x86)
- Daily job is running twice a day on x86; on Arm, it takes 16 hours and will run one time each day.
- Trying to fix all the failures with daily test in performance test. Basically almost all the tests passed locally.
- Some common failures due to Python bindings or something inside VPP image.
- VPP Path
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilation
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- VPP Device
- https://gerrit.fd.io/r/#/c/vpp/+/21152/ - to fix occasional VPP device failure
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Will call a meeting once the ThunderX2 arrives to Dean, inviting Juraj/Vxxhost people in FD.io lab to make sure hardware ready.
- Inform Prashant in FD.io lab for the incoming ThunderX2 - Lijian
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- It’s 1RU blade ThunderX2.
- The machine will be handled by Dean’s team. Cambridge folk will set up the machine before sending it to FD.io lab.
- The machine should have a big RAM, more than 120G and 256G preferred.
- The machine should Three NICs (XL710-QDA2, 2x40G).
- The script assumes the two ThunderX2 have the same NIC type, same fiber SFP type, and NICs are plugged into same PCI slots.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- https://tools.ietf.org/html/draft-hopps-ipsecme-iptfs-01 - From Christian
- Align Arm patches with VPP release plan. - Lijian
- Once our work items are added to release plan, the community is forced to review the patches and provide the feedback in a timely manner.
- Will check VPP release schedule and map with Arm Quarterly plan.
- Note down patches in community review and align them to VPP release plan.
- It has been challenging to do that in VPP.
- Vectorization
- Optimize eth_input_adv_and_flags_x4 is upstreamed and under community review.
- The patch is also enabled for x86. Will ask maintainer to review.
- Vectorize the data buffer index to data buffer pointer function.
- Jieqiang has finished code reviewing. Honnappa to review the patches.
- MAP with VPP - error is resolved. Sort of working. Record the details.
- There's no crash issue with latest VPP code.
- Investigate bihash operations in L2 throughput are hot-spots
- Cache misses and CRC32 calculation are possible opportunities.
- Lockless patch with IPv4 mtrie - Jason
- Finished reviewing the patches. Honnappa to review the patches.
- Investigating bi-hash lockless implementation - Jason
- Think of memory usage and optimization for smaller device/memory
- Think about running VPP on big-endian CPU, as there's mixture usage of gcc vector extension and vector intrinsics
- To confirm the firmware, vpp side to enable mcbin cache stashing - Honnappa
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
09/17/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Khemendra Kumar
- Greeshma Katarki
- General
- Share VPN application and FD.io server access info to Greeshma and Khem.
- Show CSIT CI/CD, Jenkins status, log and the voting right if there's any failure - Juraj & Lijian
- Will sync up with Juraj/Stan on Thursday on CSIT demo to Arm product manager.
- CSIT
- VPP Performance Test
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- If there's performance drop in CSIT performance testing, what action will be taken and who will take care of the drop?
- Currently trending data could be monitored manually only.
- https://jenkins.fd.io/job/csit-vpp-perf-verify-1908-3n-tsh/, takes a lots of time, 3 days, 61 hours (28 hours on x86)
- Daily job is running twice a day on x86; on Arm, it takes 16 hours and will run one time each day.
- Trying to fix all the failures with daily test in performance test. Basically almost all the tests passed locally.
- Some common failures due to Python bindings or something inside VPP image.
- VPP Path
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilaion
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- VPP Device
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Inform Prashant in FD.io lab for the incoming ThunderX2 - Lijian
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- It’s 1RU blade ThunderX2.
- The machine will be handled by Dean’s team. Cambridge folk will set up the machine before sending it to FD.io lab.
- The machine should have a big RAM, more than 120G and 256G preferred.
- The machine should Three NICs (XL710-QDA2, 2x40G).
- The script assumes the two ThunderX2 have the same NIC type, same fiber SFP type, and NICs are plugged into same PCI slots.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- https://tools.ietf.org/html/draft-hopps-ipsecme-iptfs-01 - From Christian
- Align Arm patches with VPP release plan. - Lijian
- Once our work items are added to release plan, the community is forced to review the patches and provide the feedback in a timely manner.
- Will check VPP release schedule and map with Arm Quarterly plan.
- Note down patches in community review and align them to VPP release plan.
- It has been challenging to do that in VPP.
- Vectorization
- Optimize eth_input_adv_and_flags_x4 is upstreamed and under community review.
- The patch is also enabled for x86. Will ask maintainer to review.
- Vectorize the data buffer index to data buffer pointer function.
- Jieqiang has finished code reviewing. Honnappa to review the patches.
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Crash issue is reproduced - Jieqiang
- Crash is gone after applying the patch.
- There's crash issue when executing 'show hardwares'
- https://gerrit.oss.arm.com/#/c/131831/
- Investigate bihash operations in L2 throughput are hot-spots
- Cache misses and CRC32 calculation are possible opportunities.
- Lockless patch with IPv4 mtrie - Jason
- Finished reviewing the patches. Honnappa to review the patches.
- Investigating bi-hash lockless implementation - Jason
- Think of memory usage and optimization for smaller device/memory
- Think about running VPP on big-endian CPU, as there's mixture usage of gcc vector extension and vector intrinsics
- To confirm the firmware, vpp side to enable mcbin cache stashing - Honnappa
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
09/10/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Khemendra Kumar
- Greeshma Katarki
- General
- Share VPN application and FD.io server access info to Greeshma and Khem.
- Show CSIT CI/CD, Jenkins status, log and the voting right if there's any failure - Juraj & Lijian
- Talk to Song about it.
- CSIT
- VPP Performance Test
- Performance data on Arm in official release 19.08 is available.
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- If there's performance drop in CSIT performance testing, what action will be taken and who will take care of the drop?
- Currently trending data could be monitored manually only.
- https://jenkins.fd.io/job/csit-vpp-perf-verify-1908-3n-tsh/, takes a lots of time, 3 days, 61 hours (28 hours on x86)
- Daily job is running twice a day on x86; on Arm, it takes 16 hours and will run one time each day.
- Trying to fix all the failures with daily test in performance test. Basically almost all the tests passed locally.
- Some common failures due to Python bindings or something inside VPP image.
- VPP Path
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- https://gerrit.fd.io/r/#/c/vpp/+/21035/, VPP path cross compilaion
- setup proper platform specific folder names in DEB packages
- proper architecture string included in the DEB package name
- VPP Device
- Totally 29 VPP device test cases executed, and all passed on Arm servers.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Inform Prashant in FD.io lab for the incoming ThunderX2 - Lijian
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- It’s 1RU blade ThunderX2.
- The machine will be handled by Dean’s team. Cambridge folk will set up the machine before sending it to FD.io lab.
- The machine should have a big RAM, more than 120G and 256G preferred.
- The machine should Three NICs (XL710-QDA2, 2x40G).
- The script assumes the two ThunderX2 have the same NIC type, same fiber SFP type, and NICs are plugged into same PCI slots.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- https://tools.ietf.org/html/draft-hopps-ipsecme-iptfs-01 - From Christian
- Align Arm patches with VPP release plan. - Lijian
- Once our work items are added to release plan, the community is forced to review the patches and provide the feedback in a timely manner.
- Will check VPP release schedule and map with Arm Quarterly plan.
- Note down patches in community review and align them to VPP release plan.
- It has been challenging to do that in VPP.
- Vectorization
- Optimize eth_input_adv_and_flags_x4 is upstreamed and under community review.
- The patch is also enabled for x86. Will ask maintainer to review.
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Crash issue is reproduced - Jieqiang
- Crash is gone after applying the patch
- There's crash issue when executing 'show hardwares'
- https://gerrit.oss.arm.com/#/c/131831/
- Apply dual/quad optimization - Lijian
- Benchmarking on Cortex-A72 with dpdk, ethernet-input, ip4 rewrite, tx-output nodes
- Submitted patches on applying on dpdk-input, ethernet-input, ip4-input, ip4-rewrite nodes
- Will do bench-marking profiling on mcbin/Bluefield.
- Lockless patch with IPv4 mtrie - Jason
- Investigating bi-hash lockless implementation - Jason
- Think of memory usage and optimization for smaller device/memory
- Think about running VPP on big-endian CPU, as there's mixture usage of gcc vector extension and vector intrinsics
- To confirm the firmware, vpp side to enable mcbin cache stashing - Honnappa
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
09/03/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Khemendra Kumar
- Greeshma Katarki
- General
- Share VPN application and FD.io server access info to Greeshma and Khem.
- Show CSIT CI/CD, Jenkins status, log and the voting right if there's any failure - Juraj & Lijian
- Talk to Song about it.
- CSIT
- VPP Performance Test
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- If there's performance drop in CSIT performance testing, what action will be taken and who will take care of the drop?
- Currently trending data could be monitored manually only.
- https://jenkins.fd.io/job/csit-vpp-perf-verify-1908-3n-tsh/, takes a lots of time, 3 days, 61 hours (28 hours on x86)
- Daily job is running twice a day on x86; on Arm, it takes 16 hours and will run one time each day.
- Trying to fix all the failures with daily test in performance test. Basically almost all the tests passed locally.
- Some common failures due to Python bindings or something inside VPP image.
- VPP Path
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- Issue is root-caused. Patch is in community review - https://gerrit.fd.io/r/c/vpp/+/21469
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- Totally 29 VPP device test cases executed, and 26 cases passed, and only 3 tap related tests failed.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Inform Prashant in FD.io lab for the incoming ThunderX2 - Lijian
- Require and install 1G NIC for the management port because of the 1G management switch. - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- It’s 1RU blade ThunderX2.
- The machine will be handled by Dean’s team. Cambridge folk will set up the machine before sending it to FD.io lab.
- The machine should have a big RAM, more than 120G and 256G preferred.
- The machine should Three NICs (XL710-QDA2, 2x40G).
- The script assumes the two ThunderX2 have the same NIC type, same fiber SFP type, and NICs are plugged into same PCI slots.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- https://tools.ietf.org/html/draft-hopps-ipsecme-iptfs-01 - From Christian
- Align Arm patches with VPP release plan. - Lijian
- Once our work items are added to release plan, the community is forced to review the patches and provide the feedback in a timely manner.
- Will check VPP release schedule and map with Arm Quarterly plan.
- Note down patches in community review and align them to VPP release plan.
- It has been challenging to do that in VPP.
- Vectorization
- Optimize eth_input_adv_and_flags_x4 is upstreamed and under community review.
- The patch is also enabled for x86. Will ask maintainer to review.
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Crash issue is reproduced - Jieqiang
- Apply dual/quad optimization - Lijian
- Benchmarking on Cortex-A72 with dpdk, ethernet-input, ip4 rewrite, tx-output nodes
- Submitted patches on applying on dpdk-input, ethernet-input, ip4-input, ip4-rewrite nodes
- Will do bench-marking profiling on mcbin/Bluefield.
- Lockless patch with IPv4 mtrie - Jason
- Investigating bi-hash lockless implementation - Jason
- Think of memory usage and optimization for smaller device/memory
- Think about running VPP on big-endian CPU, as there's mixture usage of gcc vector extension and vector intrinsics
- To confirm the firmware, vpp side to enable mcbin cache stashing - Honnappa
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
08/27/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Khemendra Kumar
- Greeshma Katarki
- General
- Share VPN application and FD.io server access info to Greeshma and Khem.
- CSIT
- VPP Performance Test
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- If there's performance drop in CSIT performance testing, what action will be taken and who will take care of the drop?
- Currently trending data could be monitored manually only.
- https://jenkins.fd.io/job/csit-vpp-perf-verify-1908-3n-tsh/, takes a lots of time, 3 days, 61 hours (28 hours on x86)
- Daily job is running twice a day on x86; on Arm, it takes 16 hours and will run one time each day.
- Trying to fix all the failures with daily test in performance test. Basically almost all the tests passed locally.
- Some common failures due to Python bindings or something inside VPP image.
- VPP Path
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- There's issues to build vpp distros.
- what is not finished is packaging - currently we support Ubuntu's DEB packages for aarch64 architecture (make PLATFORM=aarch64-generic pkg-deb) this is what I currently am trying to sort out ...
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- Issue is root-caused. Patch is in community review - https://gerrit.fd.io/r/c/vpp/+/21469
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- Totally 29 VPP device test cases executed, and 26 cases passed, and only 3 tap related tests failed.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Inform Prashant in FD.io lab for the incoming ThunderX2 - Lijian
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- It’s 1RU blade ThunderX2.
- The machine will be handled by Dean’s team. Cambridge folk will set up the machine before sending it to FD.io lab.
- The machine should have a big RAM, more than 120G and 256G preferred.
- The machine should Three NICs (XL710-QDA2, 2x40G).
- The script assumes the two ThunderX2 have the same NIC type, same fiber SFP type, and NICs are plugged into same PCI slots.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- https://tools.ietf.org/html/draft-hopps-ipsecme-iptfs-01 - From Christian
- Align Arm patches with VPP release plan. - Lijian
- Once our work items are added to release plan, the community is forced to review the patches and provide the feedback in a timely manner.
- Will check VPP release schedule and map with Arm Quarterly plan.
- Note down patches in community review and align them to VPP release plan.
- It has been challenging to do that in VPP.
- Vectorization
- Optimize eth_input_adv_and_flags_x4 is upstreamed and under community review.
- The patch is also enabled for x86. Will ask maintainer to review.
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Run VPP with MAP and reproduce the previous crash/failures - Jieqiang
- Got latest license to install MAP on Shanghai server.
- Apply dual/quad optimization - Lijian
- Benchmarking on Cortex-A72 with dpdk, ethernet-input, ip4 rewrite, tx-output nodes
- Submitted patches on applying on dpdk-input, ethernet-input, ip4-input, ip4-rewrite nodes
- Will do bench-marking profiling on mcbin/Bluefield.
- Lockless patch with IPv4 mtrie - Jason
- Investigating bi-hash lockless implementation - Jason
- Think of memory usage and optimization for smaller device/memory
- Think about running VPP on big-endian CPU, as there's mixture usage of gcc vector extension and vector intrinsics
- To confirm the firmware, vpp side to enable mcbin cache stashing - Honnappa
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
08/20/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Khemendra Kumar
- Greeshma Katarki
- General
- Share VPN application and FD.io server access info to Greeshma and Khem.
- CSIT
- VPP Performance Test
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Trending page: https://docs.fd.io/csit/master/trending/index.html
- If there's performance drop in CSIT performance testing, what action will be taken and who will take care of the drop?
- Currently trending data could be monitored manually only.
- https://jenkins.fd.io/job/csit-vpp-perf-verify-1908-3n-tsh/, takes a lots of time, 3 days, 61 hours (28 hours on x86)
- Daily job is running twice a day on x86; on Arm, it takes 16 hours and will run one time each day.
- Trying to fix all the failures with daily test in performance test. Basically almost all the tests passed locally.
- Some common failures due to Python bindings or something inside VPP image.
- VPP Path
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine.
- Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- on Arm, different default memory map regions for normal page and huge page;
- vring with huge-page mapped to normal page region addresses is not working.
- 1. Reserve 16G VA space for future usage, automatic, private, anonymous and without HUGETLB option.
- base = mmap (0x410000000, 16 << 30, PROT_NONE, MAP_PRIVATE | MAP_ANONYMOUS, -1, 0);
- 2. From the 16G VA space, pick up a 40M unused space, redo mmap() with the HUGETLB option, address fixed
- vaWithinBase = mmap (base, 40 << 20, PROT_READ | PROT_WRITE, MAP_SHARED | MAP_FIXED | MAP_HUGETLB | MAP_LOCKED, fd, 0);
- 3. Use vaWithinBase to initialize vring and vring_desc
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- Totally 29 VPP device test cases executed, and 26 cases passed, and only 3 tap related tests failed.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- VPP Performance Test
- FD.io lab
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- It’s 1RU blade ThunderX2.
- The machine will be handled by Dean’s team. Cambridge folk will set up the machine before sending it to FD.io lab.
- The machine should have a big RAM, more than 120G and 256G preferred.
- The machine should Three NICs (XL710-QDA2, 2x40G).
- The script assumes the two ThunderX2 have the same NIC type, same fiber SFP type, and NICs are plugged into same PCI slots.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- https://tools.ietf.org/html/draft-hopps-ipsecme-iptfs-01 - From Christian
- Align Arm patches with VPP release plan. - Lijian
- Once our work items are added to release plan, the community is forced to review the patches and provide the feedback in a timely manner.
- Will check VPP release schedule and map with Arm Quarterly plan.
- Note down patches in community review and align them to VPP release plan.
- It has been challenging to do that in VPP.
- Vectorization
- Optimize eth_input_adv_and_flags_x4 is upstreamed and under community review.
- The patch is also enabled for x86. Will ask maintainer to review.
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Run VPP with MAP and reproduce the previous crash/failures - Jieqiang
- Apply dual/quad optimization - Lijian
- Benchmarking on Cortex-A72 with dpdk, ethernet-input, ip4 rewrite, tx-output nodes
- Submitted patches on applying on dpdk-input, ethernet-input, ip4-input, ip4-rewrite nodes
- Will do bench-marking profiling on mcbin/Bluefield.
- Lockless patch with IPv4 mtrie - Jason
- Investigating bi-hash lockless implementation - Jason
- Think of memory usage and optimization for smaller device/memory
- Think about running VPP on big-endian CPU, as there's mixture usage of gcc vector extension and vector intrinsics
- To confirm the firmware, vpp side to enable mcbin cache stashing - Honnappa
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
08/13/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- Khemendra Kumar
- Greeshma Katarki
- General
- Share VPN application and FD.io server access info to Greeshma and Khem.
- CSIT
- VPP Performance Test
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trending data recorded with https://docs.fd.io/csit/master/trending/introduction/introduction.html
- Daily job is running twice a day on x86; on Arm, it takes 16 hours and will run one time each day.
- Trying to fix all the failures with daily test in performance test. Basically almost all the tests passed locally.
- Only 1 out of 199 test cases failed, 8 test cases show random 'show hardware-interfaces' failure.
- Some failures are related with 'show hardware-interfaces'/'show vhost dump', time-out.
- Juraj to send Lijian the commands/APIs in random dump failure.
- https://jira.fd.io/browse/CSIT-1453
- SFP eeprom dump is enabled with 'show hardware-interfaces detail' only. Patch is merged.
- Juraj will change CSIT script with 'show hardware-interfaces verbose', https://gerrit.fd.io/r/#/c/csit/+/21085/
- CSIT patch is merged.
- Adding Taishan test bed to CSIT Status: https://gerrit.fd.io/r/#/c/16850/
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- Patch to generate daily data and trending graph is committed.
- VPP Path
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- Initial cross-compiling is working fine. Patch is under review. https://gerrit.fd.io/r/#/c/vpp/+/21035/
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- Have gone thru the whole patch, pmalloc module and tap interface code, but cannot identify the root-cause - Lijian
- Buffer allocate/free based pmalloc seems to be causing the problem.
- mmap() regions with normal page and huge-page have separate VA spaces.
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- Totally 29 VPP device test cases executed, and 26 cases passed, and only 3 tap related tests failed.
- Currently VPP device is not executed per patch. Issue is still under investigation.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- FD.io lab
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- It’s 1RU blade ThunderX2.
- The machine will be handled by Dean’s team. Cambridge folk will set up the machine before sending it to FD.io lab.
- The machine should have a big RAM, more than 120G and 256G preferred.
- The machine should Three NICs (XL710-QDA2, 2x40G).
- The script assumes the two ThunderX2 have the same NIC type, same fiber SFP type, and NICs are plugged into same PCI slots.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- https://tools.ietf.org/html/draft-hopps-ipsecme-iptfs-01 - From Christian
- Align Arm patches with VPP release plan. - Lijian
- Once our work items are added to release plan, the community is forced to review the patches and provide the feedback in a timely manner.
- Will check VPP release schedule and map with Arm Quarterly plan.
- Note down patches in community review and align them to VPP release plan.
- It has been challenging to do that in VPP.
- Vectorization
- Optimize eth_input_adv_and_flags_x4 is upstreamed and under community review.
- The patch is also enabled for x86. Will ask maintainer to review.
- Spinlock/read-write lock optimization - Jason
- All 7 patches are merged.
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Jieqiang checked the video by Sirshak
- Apply dual/quad optimization - Lijian
- Benchmarking on Cortex-A72 with dpdk, ethernet-input, ip4 rewrite, tx-output nodes
- Submitted patches on applying on dpdk-input, ethernet-input, ip4-input, ip4-rewrite nodes
- Will do bench-marking profiling on mcbin/Bluefield.
- Lockless patch with IPv4 mtrie - Jason
- Investigating bi-hash lockless implementation - Jason
- Think of memory usage and optimization for smaller device/memory
- Think about running VPP on big-endian CPU, as there's mixture usage of gcc vector extension and vector intrinsics
- To confirm the firmware, vpp side to enable mcbin cache stashing - Honnappa
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
08/06/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- General
- CSIT
- VPP Performance Test
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Daily job is running twice a day on x86; on Arm, it takes 16 hours and will run one time each day.
- Trying to fix all the failures with daily test in performance test. Basically almost all the tests passed locally.
- Only 1 out of 199 test cases failed, 8 test cases show random 'show hardware-interfaces' failure.
- Some failures are related with 'show hardware-interfaces'/'show vhost dump', time-out.
- Juraj to send Lijian the commands/APIs in random dump failure.
- https://jira.fd.io/browse/CSIT-1453
- SFP eeprom dump is enabled with 'show hardware-interfaces detail' only. Patch is merged.
- Juraj will change CSIT script with 'show hardware-interfaces verbose', https://gerrit.fd.io/r/#/c/csit/+/21085/
- Adding Taishan test bed to CSIT Status: https://gerrit.fd.io/r/#/c/16850/
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- Patch to generate daily data and trending graph is committed.
- VPP Path
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- Have gone thru the whole patch, pmalloc module and tap interface code, but cannot identify the root-cause - Lijian
- Buffer allocate/free based pmalloc seems to be causing the problem.
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- Totally 29 VPP device test cases executed, and 26 cases passed, and only 3 tap related tests failed.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- FD.io lab
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- It’s 1RU blade ThunderX2.
- The machine will be handled by Dean’s team. Cambridge folk will set up the machine before sending it to FD.io lab.
- The machine should have a big RAM, more than 120G and 256G preferred.
- The machine should Three NICs (XL710-QDA2, 2x40G).
- The script assumes the two ThunderX2 have the same NIC type, same fiber SFP type, and NICs are plugged into same PCI slots.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- https://tools.ietf.org/html/draft-hopps-ipsecme-iptfs-01 - From Christian
- Align Arm patches with VPP release plan. - Lijian
- Once our work items are added to release plan, the community is forced to review the patches and provide the feedback in a timely manner.
- Will check VPP release schedule and map with Arm Quarterly plan.
- Note down patches in community review and align them to VPP release plan.
- It has been challenging to do that in VPP.
- Vectorization
- Optimize eth_input_adv_and_flags_x4 is upstreamed and under community review.
- The patch is also enabled for x86. Will ask maintainer to review.
- Spinlock/read-write lock optimization - Jason
- All 7 patches are merged.
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Jieqiang checked the video by Sirshak
- Apply dual/quad optimization - Lijian
- Benchmarking on Cortex-A72 with dpdk, ethernet-input, ip4 rewrite, tx-output nodes
- Submitted patches on applying on dpdk-input, ethernet-input, ip4-input, ip4-rewrite nodes
- Will do bench-marking profiling on mcbin/Bluefield.
- Lockless patch with IPv4 mtrie - Jason
- Investigating bi-hash lockless implementation - Jason
- Think of memory usage and optimization for smaller device/memory
- Think about running VPP on big-endian CPU, as there's mixture usage of gcc vector extension and vector intrinsics
- To confirm the firmware, vpp side to enable mcbin cache stashing - Honnappa
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
07/30/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- General
- CSIT
- VPP Performance Test
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trying to fix all the failures with daily test in performance test. Basically almost all the tests passed locally.
- Only 1 out of 199 test cases failed, 8 test cases show random 'show hardware-interfaces' failure.
- Some failures are related with 'show hardware-interfaces'/'show vhost dump', time-out.
- Juraj to send Lijian the commands/APIs in random dump failure.
- https://jira.fd.io/browse/CSIT-1453
- Will check details with x86 server also. It's slow also on x86, but only 5 sec, but it takes 40 sec on Taishan - Lijian
- It’s quite time-consuming for ‘show hardware-interfaces’ reading eeprom of the SFP, via software emulated I2C bus.
- Adding Taishan test bed to CSIT Status: https://gerrit.fd.io/r/#/c/16850/
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- VPP Path
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Tried cross-compiling with DPDK only.
- We can put the cross-compiling knowledge into section 'for developers', vpp-docs project.
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- Have gone thru the whole patch, pmalloc module and tap interface code, but cannot identify the root-cause - Lijian
- pmalloc module test cases failed on Arm server due to sudo privilege.
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- Totally 35 VPP device test cases passed, and only 3 tap related tests failed.
- VPP device job is running now and will be triggered per VPP patch and CSIT patch
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2/
- https://jenkins.fd.io/view/vpp/job/vpp-csit-verify-device-master-1n-tx2/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-semiweekly/
- https://jenkins.fd.io/view/csit/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/
- FD.io lab
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- It’s 1RU blade ThunderX2.
- The machine will be handled by Dean’s team. Cambridge folk will set up the machine before sending it to FD.io lab.
- The machine should have a big RAM, more than 120G and 256G preferred.
- The machine should Three NICs (XL710-QDA2, 2x40G).
- The script assumes the two ThunderX2 have the same NIC type, same fiber SFP type, and NICs are plugged into same PCI slots.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- https://tools.ietf.org/html/draft-hopps-ipsecme-iptfs-01 - From Christian
- Align Arm patches with VPP release plan.
- Once our work items are added to release plan, the community is forced to review the patches and provide the feedback in a timely manner.
- Will check VPP release schedual and map with Arm Quaterly plan.
- Note down patches in community review and align them to VPP release plan.
- It has been challenging to do that in VPP.
- VPP host-stack Hotspots
- Decrease or remove ATOMIC_ACQUIRE atomics in foreach_device_and_queue
- Message queue optimize it with relaxed atomic intrinsics - Lijian
- Vectorization
- Optimize eth_input_adv_and_flags_x4 is upstreamed and under community review.
- The patch is also enabled for x86. Will ask maintainer to review.
- Spinlock/read-write lock optimization - Jason
- Refactored spinlock and added test file for spinlock. Patches are under internal review.
- Spinlock with inner loop got improvement on both x86 and Arm.
- Read/write lock got a little degradation with the patch.
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Jieqiang checked the video by Sirshak
- Apply dual/quad optimization - Lijian
- Benchmarking on Cortex-A72 with dpdk, ethernet-input, ip4 rewrite, tx-output nodes
- Submitted patches on applying on dpdk-input, ethernet-input, ip4-input, ip4-rewrite nodes
- Will do bench-marking profiling on mcbin/Bluefield.
- Think of memory usage and optimization for smaller device/memory
- Think about running VPP on big-endian CPU, as there's mixture usage of gcc vector extension and vector intrinsics
- To confirm the firmware, vpp side to enable mcbin cache stashing - Honnappa
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
07/23/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- General
- CSIT
- VPP Performance Test
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Trying to fix all the failures with daily test in performance test. Basically almost all the tests passed locally.
- Only 1 out of 199 test cases failed, 8 test cases show random 'show interface' failure.
- Some failures are related with 'show hardware'/'show interface'/'show vhost dump', time-out.
- https://jira.fd.io/browse/CSIT-1453
- Will check details with x86 server also. It's slow also on x86, but only 5 sec, but it takes 40 sec on Taishan - Lijian
- Adding Taishan test bed to CSIT Status: https://gerrit.fd.io/r/#/c/16850/
- creating a job. - Everything is ready except the docker image
- 1. All tests are failing. 'show hardware' takes too much time. https://jira.fd.io/browse/VPP-1722
- 2. To figure out which test cases are executed
- creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- VPP Path
- Working on MAC learning test failures on Cortex-A72 server - Jieqiang
- Enlarge duration can fix the failure, but will investigate more details.
- Issues have been fixed in latest master branch. Investigating the details.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- Working on MAC learning test failures on Cortex-A72 server - Jieqiang
- VPP Device
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- Send email and current debug details to community calling for volunteer to fix it. - Lijian
- pmalloc module test cases failed on Arm server.
- Changes are uploaded to community gerrit.
- vpp VMs seems to bring up well. Will work on init script and bring up vpp.
- VM tests passed. Patches are to be submitted for community review.
- All the patches are merged and all images are built.
- Will confirm with Ed about where to upload VPP docker for VPP device - Juraj
- Docker images for both Arm and x86 are merged and available.
- https://jenkins.fd.io/sandbox/job/csit-vpp-device-master-ubuntu1804-1n-tx2-weekly/1/console
- Docker image is verified on Arm server, but to verify it on x86 server also and try it in Jenkins.
- Arm and x86 have separate docker image. Arm docker image is to be built.
- Totally 35 test cases, and only 3 tap related tests failed.
- Ed to help set up numad cluster with dual ThunderX and one ThunderX2
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- FD.io lab
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- It’s 1RU blade ThunderX2.
- The machine will be handled by Dean’s team. Cambridge folk will set up the machine before sending it to FD.io lab.
- The machine should have a big RAM, more than 120G and 256G preferred.
- The machine should Three NICs (XL710-QDA2, 2x40G).
- The script assumes the two ThunderX2 have the same NIC type, same fiber SFP type, and NICs are plugged into same PCI slots.
- Server info in CSIT docs:
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD
- https://gerrit.fd.io/r/gitweb?p=csit.git;a=blob;f=docs/lab/testbed_specifications.md;h=afa36ff56c7be09621e85bae6a1498aadf3a1981;hb=HEAD#l495
- ThunderX1
- VPP
- VPP host-stack Hotspots
- Decrease or remove ATOMIC_ACQUIRE atomics in foreach_device_and_queue
- Message queue optimize it with relaxed atomic intrinsics - Lijian
- Vectorization
- Optimize eth_input_adv_and_flags_x4 is upstreamed and under community review.
- The patch is also enabled for x86. Will ask maintainer to review.
- Spinlock/read-write lock optimization - Jason
- Refactored spinlock and added test file for spinlock. Patches are under internal review.
- Spinlock with inner loop got improvement on both x86 and Arm.
- Read/write lock got a little degradation with the patch.
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Inform MAP owner that Jieqiang will take care of MAP on VPP. - Lijian
- Apply dual/quad optimization - Lijian
- Benchmarking on Cortex-A72 with dpdk, ethernet-input, ip4 rewrite, tx-output nodes
- Submitted patches on applying on dpdk-input, ethernet-input, ip4-input, ip4-rewrite nodes
- Will do bench-marking profiling on mcbin/Bluefield.
- Think of memory usage and optimization for smaller device/memory
- Think about running VPP on big-endian CPU, as there's mixture usage of gcc vector extension and vector intrinsics
- To confirm the firmware, vpp side to enable mcbin cache stashing - Honnappa
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
- VPP host-stack Hotspots
07/16/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- General
- CSIT
- VPP Performance Test
- Performance job is merged. https://jenkins.fd.io/view/csit/job/csit-vpp-perf-mrr-daily-master-3n-tsh
- Adding Taishan test bed to CSIT Status: https://gerrit.fd.io/r/#/c/16850/
- creating a job. - Everything is ready except the docker image
- 1. All tests are failing. 'show hardware' takes too much time. https://jira.fd.io/browse/VPP-1722
- 2. To figure out which test cases are executed
- creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- VPP Path
- Working on MAC learning test failures on Cortex-A72 server - Jieqiang
- Enlarge duration can fix the failure, but will investigate more details.
- Issues have been fixed in latest master branch. Investigating the details.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- Working on MAC learning test failures on Cortex-A72 server - Jieqiang
- VPP Device
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- Send email and current debug details to community calling for volunteer to fix it. - Lijian
- Changes are uploaded to community gerrit.
- vpp VMs seems to bring up well. Will work on init script and bring up vpp.
- VM tests passed. Patches are to be submitted for community review.
- Patch is splited into three small pieces. Two patches (kernel image for VM test/generic CSIT changes to support ThunderX2 testbed) are merged. Third patch about code changes for VM test to be merged, Arm specific code and use kernel image.
- Will confirm with Ed about where to upload VPP docker for VPP device - Juraj
- Docker images for both Arm and x86 are merged and available.
- Docker image is verified on Arm server, but to verify it on x86 server also and try it in Jenkins.
- Ed to help set up numad cluster with dual ThunderX and one ThunderX2
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- FD.io lab
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Require one more ThunderX2 to form a normal cluster (1xThunderX + 2xThunderX2), to enable voting right for Arm servers in CSIT
- It’s 1RU blade ThunderX2.
- The machine will be handled by Dean’s team. Cambridge folk will set up the machine before sending it to FD.io lab.
- The machine should have a big RAM, more than 120G and 256G preferred.
- The machine should Three NICs (XL710-QDA2, 2x40G).
- The script assumes the two ThunderX2 have the same NIC type, same fiber SFP type, and NICs are plugged into same PCI slots.
- ThunderX1
- VPP
- VPP host-stack Hotspots
- Decrease or remove ATOMIC_ACQUIRE atomics in foreach_device_and_queue
- Message queue optimize it with relaxed atomic intrinsics - Lijian
- Vectorization
- Optimize eth_input_adv_and_flags_x4 is upstreamed and under community review.
- The patch is also enabled for x86. Will ask maintainer to review.
- Spinlock/read-write lock optimization - Jason
- Refactored spinlock and added test file for spinlock. Patches are under internal review.
- Spinlock with inner loop got improvement on both x86 and Arm.
- Read/write lock got a little degradation with the patch.
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Apply dual/quad optimization - Lijian
- Benchmarking on Cortex-A72 with dpdk, ethernet-input, ip4 rewrite, tx-output nodes
- Submitted patches on applying on dpdk-input, ethernet-input, ip4-input, ip4-rewrite nodes
- Will do bench-marking profiling on mcbin.
- Think of memory usage and optimization for smaller device/memory
- Think about running VPP on big-endian CPU, as there's mixture usage of gcc vector extension and vector intrinsics
- To confirm the firmware, vpp side to enable mcbin cache stashing - Honnappa
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
- VPP host-stack Hotspots
07/09/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- Christian Hopps
- General
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server: https://jira.fd.io/browse/VPP-1569
- Patch to resolve the issue is in community review. https://gerrit.fd.io/r/#/c/18278/ - Lijian
- Adding Taishan test bed to CSIT Status: https://gerrit.fd.io/r/#/c/16850/
- creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- VPP Path
- IPSEC test cases are failing and skipped on Arm server in CI/CD
- https://jira.fd.io/browse/VPP-1714
- Create a Jira ticket to track all the info related to this issue - Juraj
- Working on MAC learning test failures on Cortex-A72 server - Jieqiang
- Enlarge duration can fix the failure, but will investigate more details.
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- IPSEC test cases are failing and skipped on Arm server in CI/CD
- VPP Device
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- Send email and current debug details to community calling for volunteer to fix it. - Lijian
- Changes are uploaded to community gerrit.
- vpp VMs seems to bring up well. Will work on init script and bring up vpp.
- VM tests passed. Patches are to be submitted for community review.
- Will confirm with Ed about where to upload VPP docker for VPP device - Juraj
- Docker images for both Arm and x86 are merged and available.
- Ed to help set up numad cluster with dual ThunderX and one ThunderX2
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- FD.io lab
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Discuss requiring another ThunderX2 1U blade with Pravin and Tina - Lijian
- Update the current status to Pravin. - Lijian
- The machine will be handled by Jingjing's team. Cambridge folk will set up the machine before sending it to FD.io lab.
- Require a bigger than 120G RAM, prefer 256G
- Three NICs and each has two ports.
- ThunderX1
- VPP
- VPP host-stack Hotspots
- Decrease or remove ATOMIC_ACQUIRE atomics in foreach_device_and_queue
- Message queue optimize it with relaxed atomic intrinsics - Lijian
- Vectorization
- Optimize eth_input_adv_and_flags_x4 is upstreamed and under community review.
- Spinlock/read-write lock optimization - Jason
- Refactored spinlock and added test file for spinlock. Patches are under internal review.
- Spinlock with inner loop got improvement on both x86 and Arm.
- Read/write lock got a little degradation with the patch.
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Apply dual/quad optimization - Lijian
- Benchmarking on Cortex-A72 with dpdk, ethernet-input, ip4 rewrite, tx-output nodes
- Submitted patches on applying on dpdk-input, ethernet-input, ip4-input, ip4-rewrite nodes
- Will do bench-marking profiling on mcbin.
- Think of memory usage and optimization for smaller device/memory
- Think about running VPP on big-endian CPU, as there's mixture usage of gcc vector extension and vector intrinsics
- To confirm the firmware, vpp side to enable mcbin cache stashing - Honnappa
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
- VPP host-stack Hotspots
07/02/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- General
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server: https://jira.fd.io/browse/VPP-1569
- Patch to resolve the issue is in community review. https://gerrit.fd.io/r/#/c/18278/ - Lijian
- Adding Taishan test bed to CSIT Status: https://gerrit.fd.io/r/#/c/16850/
- creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- VPP Path
- IPSEC test cases are failing and skipped on Arm server in CI/CD
- https://jira.fd.io/browse/VPP-1714
- Create a Jira ticket to track all the info related to this issue - Juraj
- Working on MAC learning test failures on Cortex-A72 server - Jieqiang
- Enlarge duration can fix the failure, but will investigate more details.
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - Lijian
- The current default C compiler identification is GNU 8.3.0
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- IPSEC test cases are failing and skipped on Arm server in CI/CD
- VPP Device
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- Send email and current debug details to community calling for volunteer to fix it. - Lijian
- vpp VMs seems to bring up well. Will work on init script and bring up vpp.
- Will confirm with Ed about where to upload VPP docker for VPP device - Juraj
- Set up numad cluster with dual ThunderX and one ThunderX2
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- FD.io lab
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Discuss requiring another ThunderX2 1U blade with Pravin and Tina - Lijian
- Update the current status to Pravin. - Lijian
- The machine will be handled by Jingjing's team. Cambridge folk will set up the machine before sending it to FD.io lab.
- Require a bigger than 120G RAM, prefer 256G
- Three NICs and each has two ports.
- ThunderX1
- VPP
- VPP host-stack Hotspots
- Decrease or remove ATOMIC_ACQUIRE atomics in foreach_device_and_queue
- Message queue, remove atomic intrinsics and use lock version only - Lijian
- Have verified removing atomic intrinsics in message_queue alloc/free APIs, and require confirmation from Florin.
- Vectorization
- Optimize eth_input_adv_and_flags_x4 is upstreamed and under community review.
- Spinlock/read-write lock optimization - Jason
- Refactored spinlock and added test file for spinlock. Patches are under internal review.
- Fix ip4_forward compiling - Jason
- Will check gerrit CI/CD related with that patch. Check why it's not warning in gerrit Jenkins.
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Spread dual/quad optimization - Lijian
- Benchmarking on Cortex-A72 with dpdk, ethernet-input, ip4 rewrite, tx-output nodes
- Will do bench-marking profiling on mcbin.
- Think of memory usage and optimization for smaller device/memory
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
- VPP host-stack Hotspots
06/25/2019
- Attendees
- Tina Tsou
- Honnappa Nagarahalli
- Lijian Zhang
- Jieqiang Wang
- Jason Zhang
- Juraj Linkes
- General
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server: https://jira.fd.io/browse/VPP-1569
- Patch to resolve the issue is in community review. https://gerrit.fd.io/r/#/c/18278/ - Lijian
- Adding Taishan test bed to CSIT Status: https://gerrit.fd.io/r/#/c/16850/
- creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- VPP Path
- IPSEC test cases are failing and skipped on Arm server in CI/CD
- https://jira.fd.io/browse/VPP-1714
- Create a Jira ticket to track all the info related to this issue - Juraj
- Working on MAC learning test failures on Cortex-A72 server - Jieqiang
- Enlarge duration can fix the failure, but will investigate more details.
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - Juraj
- The current default C compiler identification is GNU 8.3.0
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- IPSEC test cases are failing and skipped on Arm server in CI/CD
- VPP Device
- VPP tap interface is not working on all Arm servers. Works on stable/1810, but not working on stable/1901.
- Crypto test cases, will use dpdk driver if configured, native-vpp implementation, fall back to openSSL
- Will try Crypto test cases next week - Juraj
- Juraj to send Lijian the details of vpp VMs, Lijian will confirm internally
- FD.io lab
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- Discuss requiring another ThunderX2 1U blade with Pravin and Tina - Lijian
- Firstly will sponsor the machine
- The machine will be handled by Jingjing's team. Cambridge folk will set up the machine before sending it to FD.io lab.
- Require a bigger than 120G RAM, prefer 256G
- Three NICs and each has two ports.
- ThunderX1
- VPP
- VPP host-stack Hotspots
- Decrease or remove ATOMIC_ACQUIRE atomics in foreach_device_and_queue
- Message queue, remove atomic intrinsics and use lock version only - Lijian
- Have verified removing atomic intrinsics in message_queue alloc/free APIs, and require confirmation from Florin.
- Vectorization
- Optimize eth_input_adv_and_flags_x4 is upstreamed and under community review.
- Spinlock optimization - Jason
- Refactored spinlock and added test file for spinlock. Patches are under internal review.
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Spread dual/quad optimization - Lijian
- Benchmarking on Cortex-A72 with dpdk, ethernet-input, ip4 rewrite, tx-output nodes
- Will do bench-marking profiling on mcbin.
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
- VPP host-stack Hotspots
06/18/2019
- Attendees
- Sirshak Das
- Honnappa Nagarahalli
- Tina Tsou
- Lijian Zhang
- Jieqiang Wang
- Juraj Linkes
- General
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server: https://jira.fd.io/browse/VPP-1569
- Patch to resolve the issue is in community review. https://gerrit.fd.io/r/#/c/18278/ - Lijian
- Adding Taishan test bed to CSIT Status: https://gerrit.fd.io/r/#/c/16850/
- creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - Juraj
- The current default C compiler identification is GNU 8.3.0
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - Juraj
- VPP Device
- thunderx Status:
- mcbin: Kernel Migration on mcbin. Status:
- Confirm with Gorka if their mcbin can support docker. If yes, then ask them to provide image with their latest kernel/file system/dtd
- thunderx2: Status:
- FD.io lab
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- ThunderX1
- VPP
- VPP host-stack Hotspots
- Decrease or remove ATOMIC_ACQUIRE atomics in foreach_device_and_queue
- Investigating message queue, understand use case with svm queue, talk the ideas with Florin - ongoing - Lijian
- Will confirm with Florin to remove atomic intrinsics in message_queue alloc/free APIs
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Investigating performance degradation on Bluefield - Sirshak
- Will resume Taishan host-stack setup - Lijian
- VPP machiatto bin showing some unstable performance.
- Vectorization
- ethernet-input causes performance drop on AArch64.
- There's performance drop issue after the ethernet-input optimization. The major reason is after the refactor, if promiscuous is enabled on NIC, all traffic from the NIC will fall into so-called slow path.
- A vectorized patch to optimize eth_input_adv_and_flags_x4 is under internal review.
- ethernet-input causes performance drop on AArch64.
- TAS patch - Upstreamed.
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Investigate hyperscan plugin in VPP - Sirshak
- Spread qual/quad optimization - ethernet-input
- Redo perf/MAP profiling/bench-marking
- DPI plugin?
- EPIC for next quarter:
- Apply dual/quad optimization on more data path nodes
- Investigate and optimize VPP hash and bihash library
- VPP translation overhead analysis btw Mbuf and VLIB buffer ENTNET-1293
- VPP Memif performance analysis and optimization ENTNET-1292
- VPP l3fwd performance analysis and optimization ENTNET-751
- Using MAP with VPP ENTNET-1288
- VPP host-stack Hotspots
06/11/2019
- Attendees
- Sirshak Das
- Honnappa Nagarahalli
- Tina Tsou
- Lijian Zhang
- Jieqiang Wang
- Juraj
- General
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server: https://jira.fd.io/browse/VPP-1569
- Patch to resolve the issue is in community review. https://gerrit.fd.io/r/#/c/18278/ - Lijian
- Adding Taishan test bed to CSIT Status: https://gerrit.fd.io/r/#/c/16850/
- creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - Juraj
- The current default C compiler identification is GNU 8.3.0
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - Juraj
- VPP Device
- thunderx Status:
- mcbin: Kernel Migration on mcbin. Status:
- Confirm with Gorka if their mcbin can support docker. If yes, then ask them to provide image with their latest kernel/file system/dtd
- thunderx2: Status:
- FD.io lab
- Require two ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- ThunderX1
- VPP
- VPP host-stack Hotspots
- Decrease or remove ATOMIC_ACQUIRE atomics in foreach_device_and_queue
- Investigating message queue, understand use case with svm queue, talk the ideas with Florin - ongoing - Lijian
- Will confirm with Florin to remove atomic intrinsics in message_queue alloc/free APIs
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Investigating performance degradation on Bluefield - Sirshak
- Will resume Taishan host-stack setup - Lijian
- VPP machiatto bin showing some unstable performance.
- Vectorization
- ethernet-input causes performance drop on AArch64.
- There's performance drop issue after the ethernet-input optimization. The major reason is after the refactor, if promiscuous is enabled on NIC, all traffic from the NIC will fall into so-called slow path.
- A vectorized patch to optimize eth_input_adv_and_flags_x4 is under internal review.
- ethernet-input causes performance drop on AArch64.
- TAS patch - Upstreamed.
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Investigate hyperscan plugin in VPP - Sirshak
- Spread qual/quad optimization - ethernet-input
- Redo perf/MAP profiling/bench-marking
- DPI plugin?
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
- VPP host-stack Hotspots
06/04/2019
- Attendees
- Sirshak Das
- Honnappa Nagarahalli
- Tina Tsou
- Lijian Zhang
- Jieqiang Wang
- Stan
- General
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server: https://jira.fd.io/browse/VPP-1569
- Patch to resolve the issue is in community review. https://gerrit.fd.io/r/#/c/18278/ - Lijian
- Adding Taishan test bed to CSIT Status: https://gerrit.fd.io/r/#/c/16850/
- creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - No update
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- thunderx Status:
- mcbin: Kernel Migration on mcbin. Status:
- Confirm with Gorka if their mcbin can support docker. If yes, then ask them to provide image with their latest kernel/file system/dtd
- thunderx2: Status:
- FD.io lab
- Require one ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- ThunderX1
- VPP
- VPP host-stack Hotspots
- Decrease or remove ATOMIC_ACQUIRE atomics in foreach_device_and_queue
- Investigating message queue, understand use case with svm queue, talk the ideas with Florin - ongoing - Lijian
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Investigating performance degradation on Bluefield - Sirshak
- Will resume Taishan host-stack setup - Lijian
- VPP machiatto bin showing some unstable performance.
- Vectorization
- ethernet-input causes performance drop on AArch64.
- There's performance drop issue after the ethernet-input optimization. The major reason is after the refactor, if promiscuous is enabled on NIC, all traffic from the NIC will fall into so-called slow path.
- A vectorized patch to optimize eth_input_adv_and_flags_x4 is under internal review.
- ethernet-input causes performance drop on AArch64.
- TAS patch - Upstreamed.
- MAP with VPP - error is resolved. Sort of working. Record the details.
- Investigate hyperscan plugin in VPP - Sirshak
- DPI plugin?
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
- VPP host-stack Hotspots
05/28/2019
- Attendees
- Sirshak Das
- Honnappa Nagarahalli
- Tina
- Lijian Zhang
- Vijay (vijayakumar.rajamanickam@nokia.com)
- General
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server: https://jira.fd.io/browse/VPP-1569
- Patch to resolve the issue is in community review. https://gerrit.fd.io/r/#/c/18278/ - Lijian
- Adding Taishan test bed to CSIT Status: https://gerrit.fd.io/r/#/c/16850/
- creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - No update
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- thunderx Status:
- mcbin: Kernel Migration on mcbin. Status:
- Confirm with Gorka if their mcbin can support docker. If yes, then ask them to provide image with their latest kernel/file system/dtd
- thunderx2: Status:
- FD.io lab
- Require one ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- ThunderX1
- VPP
- VPP host-stack Hotspots
- Decrease or remove ATOMIC_ACQUIRE atomics in foreach_device_and_queue
- Investigating message queue, understand use case with svm queue, talk the ideas with Florin - ongoing - Lijian
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Investigating performance degradation on Bluefield - Sirshak
- Will resume Taishan host-stack setup - Lijian
- VPP machiatto bin showing some unstable performance.
- Vectorization
- ethernet-input causes performance drop on AArch64.
- There's performance drop issue after the ethernet-input optimization. The major reason is after the refactor, if promiscuous is enabled on NIC, all traffic from the NIC will fall into so-called slow path.
- A vectorized patch to optimize eth_input_adv_and_flags_x4 is under internal review.
- ethernet-input causes performance drop on AArch64.
- TAS patch - internal Review.
- MAP with VPP - Tried internal Patch still failing. Continuing to work on it.
- Investigate hyperscan plugin in VPP - Sirshak
- DPI plugin?
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
- VPP host-stack Hotspots
05/21/2019
- Attendees
- Sirshak Das
- Honnappa Nagarahalli
- Tina
- Lijian Zhang
- Vijay (vijayakumar.rajamanickam@nokia.com)
- General
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server: https://jira.fd.io/browse/VPP-1569
- Patch to resolve the issue is in community review. https://gerrit.fd.io/r/#/c/18278/ - Lijian
- Adding Taishan test bed to CSIT Status: https://gerrit.fd.io/r/#/c/16850/
- creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - No update
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- thunderx Status:
- mcbin: Kernel Migration on mcbin. Status:
- Confirm with Gorka if their mcbin can support docker. If yes, then ask them to provide image with their latest kernel/file system/dtd
- thunderx2: Status:
- FD.io lab
- Require one ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- ThunderX1
- VPP
- VPP host-stack Hotspots
- Decrease or remove ATOMIC_ACQUIRE atomics in foreach_device_and_queue
- Investigating message queue, understand use case with svm queue, talk the ideas with Florin - ongoing - Lijian
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Investigating performance degradation on Bluefield - Sirshak
- Will resume Taishan host-stack setup - Lijian
- VPP machiatto bin showing some unstable performance.
- Vectorization
- ethernet-input causes performance drop on AArch64.
- There's performance drop issue after the ethernet-input optimization. The major reason is after the refactor, if promiscuous is enabled on NIC, all traffic from the NIC will fall into so-called slow path.
- A vectorized patch to optimize eth_input_adv_and_flags_x4 is under internal review.
- ethernet-input causes performance drop on AArch64.
- TAS patch - internal Review.
- MAP with VPP - Tried internal Patch still failing. Continuing to work on it.
- Investigate hyperscan plugin in VPP - Sirshak
- DPI plugin?
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
- VPP host-stack Hotspots
05/14/2019
- Attendees
- Sirshak Das
- Honnappa Nagarahalli
- Tina
- Lijian Zhang
- Vijay (vijayakumar.rajamanickam@nokia.com)
- General
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server: https://jira.fd.io/browse/VPP-1569
- Patch to resolve the issue is in community review. https://gerrit.fd.io/r/#/c/18278/ - Lijian
- Adding Taishan test bed to CSIT Status: https://gerrit.fd.io/r/#/c/16850/
- creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - No update
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- thunderx Status:
- mcbin: Kernel Migration on mcbin. Status:
- Confirm with Gorka if their mcbin can support docker. If yes, then ask them to provide image with their latest kernel/file system/dtd
- thunderx2: Status:
- FD.io lab
- Require one ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- ThunderX1
- VPP
- VPP host-stack Hotspots
- Decrease or remove ATOMIC_ACQUIRE atomics in foreach_device_and_queue
- VPP generic distro package building patch - Patch updated. Require Damjan's follow up review.
- Investigating message queue, understand use case with svm queue, talk the ideas with Florin - ongoing - Lijian
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Investigating performance degradation on Bluefield - Sirshak
- Will resume Taishan host-stack setup - Lijian
- VPP machiatto bin showing some unstable performance.
- Vectorization
- ethernet-input causes performance drop on AArch64.
- There's performance drop issue after the ethernet-input optimization. The major reason is after the refactor, if promiscuous is enabled on NIC, all traffic from the NIC will fall into so-called slow path.
- A vectorized patch to optimize eth_input_adv_and_flags_x4 is under internal review.
- ethernet-input causes performance drop on AArch64.
- TAS patch - internal Review.
- MAP with VPP - Tried internal Patch still failing. Continuing to work on it.
- Investigate hyperscan plugin in VPP - Sirshak
- DPI plugin?
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
- VPP host-stack Hotspots
05/07/2019
- Attendees
- Sirshak Das
- Honnappa Nagarahalli
- Tina
- Lijian Zhang
- Vijay (vijayakumar.rajamanickam@nokia.com)
- General
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server: https://jira.fd.io/browse/VPP-1569
- Patch to resolve the issue is in community review. https://gerrit.fd.io/r/#/c/18278/ - Lijian
- Adding Taishan test bed to CSIT Status: https://gerrit.fd.io/r/#/c/16850/
- creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - No update
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- thunderx Status:
- mcbin: Kernel Migration on mcbin. Status:
- Confirm with Gorka if their mcbin can support docker. If yes, then ask them to provide image with their latest kernel/file system/dtd
- thunderx2: Status:
- FD.io lab
- Require one ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- ThunderX1
- VPP
- VPP host-stack Hotspots
- Decrease or remove ATOMIC_ACQUIRE atomics in foreach_device_and_queue
- Investigate ((xd->flags & DPDK_DEVICE_FLAG_ADMIN_UP) == 0) in dpdk_device_input.
- VPP generic distro package building patch - Patch updated Damjan's follow up review required.
- Investigating message queue, understand use case with svm queue, talk the ideas with Florin - ongoing - Lijian
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Investigating performance degradation on Bluefield - Sirshak
- Will resume Taishan host-stack setup - Lijian
- VPP machiatto bin showing some unstable performance.
- Vectorization
- Vectorization in esp-encrypt, optimize memcpy_le. Upstreamed(https://gerrit.fd.io/r/#/c/18398/). - Lijian
- ethernet-input causes performance drop on AArch64.
- There's performance drop issue after the ethernet-input optimization. The major reason is after the refactor, if promiscuous is enabled on NIC, all traffic from the NIC will fall into so-called slow path.
- A vectorized patch to optimize eth_input_adv_and_flags_x4 is under internal review.
- TAS patch - internal Review.
- MAP(Arm Proprietary Performance Analysis Tool) with VPP - Tried internal Patch still failing. Continuing to work on it.
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
- VPP host-stack Hotspots
04/30/2019
- Attendees
- Sirshak Das
- Honnappa Nagarahalli
- Tina
- General
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server: https://jira.fd.io/browse/VPP-1569
- Patch to resolve the issue is in community review. https://gerrit.fd.io/r/#/c/18278/ - Lijian
- Adding Taishan test bed to CSIT Status: https://gerrit.fd.io/r/#/c/16850/
- creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - No update
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- thunderx Status:
- mcbin: Kernel Migration on mcbin. Status:
- Confirm with Gorka if their mcbin can support docker. If yes, then ask them to provide image with their latest kernel/file system/dtd
- thunderx2: Status:
- FD.io lab
- Require one ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- ThunderX1
- VPP
- VPP host-stack Hotspots
- Decrease or remove ATOMIC_ACQUIRE atomics in foreach_device_and_queue
- Investigate ((xd->flags & DPDK_DEVICE_FLAG_ADMIN_UP) == 0) in dpdk_device_input.
- VPP generic distro package building patch - Patch updated Damjan's follow up review required.
- Investigating message queue, understand use case with svm queue, talk the ideas with Florin - ongoing - Lijian
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Investigating performance degradation on Bluefield - Sirshak
- Will resume Taishan host-stack setup - Lijian
- VPP machiatto bin showing some unstable performance.
- Vectorization
- Vectorization in esp-encrypt, optimize memcpy_le. Upstreamed(https://gerrit.fd.io/r/#/c/18398/). - Lijian
- ethernet-input causes performance drop on AArch64.
- There's performance drop issue after the ethernet-input optimization. The major reason is after the refactor, if promiscuous is enabled on NIC, all traffic from the NIC will fall into so-called slow path.
- A vectorized patch to optimize eth_input_adv_and_flags_x4 is under internal review.
- TAS patch - internal Review.
- MAP(Arm Proprietary Performance Analysis Tool) with VPP - Tried internal Patch still failing. Continuing to work on it.
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
- VPP host-stack Hotspots
04/23/2019
- Attendees
- Sirshak Das
- Lijian Zhang
- Juraj Linkeš
- Vijay
- Nitin
- Khemendra Kumar
- Tina Tsou
- Andy Wang
- Honnappa Nagarahalli
- General
- CSIT
- VPP Performance Test
- List all the blockers on aarch64 in CSIT wiki page - Done - Stan or Juraj
- Package Installation error Status(Juraj): interfaces configured numa node 2,3 are not visible. https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server, https://jira.fd.io/browse/VPP-1569
- Patch to resolve the issue is in community review. https://gerrit.fd.io/r/#/c/18278/ - Lijian
- Estimates from Khem and Stan/Juraj. Status: https://gerrit.fd.io/r/#/c/16850/
- a. Host Config, almost done with patch https://gerrit.fd.io/r/#/c/16837/ - Done
- b. merging CSIT patch. - Closing done
- c. creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- Add license header/copy right to scripts - Sirshak/Honnappa to confirm with Andy Waffa
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - No update
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Status: Juraj to bring this up in CSIT call. (start with just cross-compilation)
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- thunderx Status: 1-node topology was rewired because of QSFP+ switch.
- QSFP+ is available and working now.
- Juraj to resend email to Mahamad about the details, including Sirshak and Tina
- mcbin: Kernel Migration on mcbin. Status: was able to update uboot but not boot the new kernel. Jialin suggested different boot parameters, juraj yet to try.
- Confirm with Gorka if their mcbin can support docker. If yes, then ask them to provide image with their latest kernel/file system/dtd
- thunderx2: Status: server is set up. Management connection works. Intel NICs are well connected. Will prepare the server for VPP device testing. Now is working on containers for VPP device. Will probably be able to run VPP device tests manually this week.
- FD.io lab
- Require one ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- ThunderX1
- QSFP+ switch for ThunderX1 Status: ONL OS to be installed on QSFP+ switch.
- Management connection thru QSFP+ switch is working now. Moving VPP device work to ThunderX1 blade servers.
- Will use these four new ThunderX1 servers for CI, Genkins to replace the previous three old ThunderX1 servers.
- These four ThunderX1 blade are not identical. The first one has two numa nodes, and other three blades have one numa node.
- Investigate why these three blades have only one numa node - Juraj
- VPP
- Investigate session_queue_node_fn/vlib_worker_loop.
- Decrease or remove ATOMIC_ACQUIRE atomics in foreach_device_and_queue
- Investigate ((xd->flags & DPDK_DEVICE_FLAG_ADMIN_UP) == 0) in dpdk_device_input
- Rebase VPP distro package building patch; contact Damjan in slack; Talk with Damjan in vpp meeting - Lijian & Sirshak
- Investigating message queue, understand use case with svm queue, talk the ideas with Florin - Lijian
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Perf degradation is fixed. Investigating performance degradation on Bluefield - Sirshak
- Will resume Taishan host-stack setup - Lijian
- Octeon-Tx Status(Sirshak): Done by Malvika. Running DPDK sample apps on it now.
- VPP running on Arm side, x86 iperf3 client observes unstable performance rate.(Reason unknown).
- Vectorization
- Vecterization in esp-encrypt, https://gerrit.fd.io/r/#/c/18398/ - Get improvement on ThunderX/OcteonTX/Taishan, but degradation on ThunderX2 - Lijian
- TAS patch will be ready soon (Sirshak)
- MAP with VPP is ongoing - Sirshak
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
- Investigate session_queue_node_fn/vlib_worker_loop.
- Action Items - Last Week
- Action Items - Next Week
04/16/2019
- Attendees
- Sirshak Das
- Lijian Zhang
- Juraj Linkeš
- Vijay
- Nitin
- Khemendra Kumar
- Tina Tsou
- Andy Wang
- Honnappa Nagarahalli
- General
- CSIT
- VPP Performance Test
- List all the blockers on aarch64 in CSIT wiki page - Done - Stan or Juraj
- Package Installation error Status(Juraj): interfaces configured numa node 2,3 are not visible. https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server, https://jira.fd.io/browse/VPP-1569
- Patch to resolve the issue is in community review. https://gerrit.fd.io/r/#/c/18278/ - Lijian
- Estimates from Khem and Stan/Juraj. Status: https://gerrit.fd.io/r/#/c/16850/
- a. Host Config, almost done with patch https://gerrit.fd.io/r/#/c/16837/ - Done
- b. merging CSIT patch. - Closing done
- c. creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- Add license header/copy right to scripts - Sirshak/Honnappa to confirm with Andy Waffa
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - No update
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Status: Juraj to bring this up in CSIT call. (start with just cross-compilation)
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- thunderx Status: 1-node topology was rewired because of QSFP+ switch.
- QSFP+ is available and working now.
- Juraj to resend email to Mahamad about the details, including Sirshak and Tina
- mcbin: Kernel Migration on mcbin. Status: was able to update uboot but not boot the new kernel. Jialin suggested different boot parameters, juraj yet to try.
- Confirm with Gorka if their mcbin can support docker. If yes, then ask them to provide image with their latest kernel/file system/dtd
- thunderx2: Status: server is set up. Management connection works. Intel NICs are well connected. Will prepare the server for VPP device testing. Now is working on containers for VPP device. Will probably be able to run VPP device tests manually this week.
- FD.io lab
- Require one ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- ThunderX1
- QSFP+ switch for ThunderX1 Status: ONL OS to be installed on QSFP+ switch.
- Management connection thru QSFP+ switch is working now. Moving VPP device work to ThunderX1 blade servers.
- Will use these four new ThunderX1 servers for CI, Genkins to replace the previous three old ThunderX1 servers.
- These four ThunderX1 blade are not identical. The first one has two numa nodes, and other three blades have one numa node.
- Investigate why these three blades have only one numa node - Juraj
- VPP
- Investigate session_queue_node_fn/vlib_worker_loop. - https://jira.arm.com/browse/ENTNET-1179 - Done
- Will create two Jira tickets to track the findings. - Lijian
- Rebase VPP distro package building patch; contact Damjan in slack; Talk with Damjan in vpp meeting - Lijian & Sirshak
- Investigating message queue - Lijian
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Perf degradation is fixed. Investigating performance degradation on Bluefield - Sirshak
- Will resume Taishan host-stack setup - Lijian
- Octeon-Tx Status(Sirshak): Done by Malvika. Running DPDK sample apps on it now.
- VPP running on Arm side, x86 iperf3 client observes unstable performance rate.(Reason unknown).
- Vectorization
- Vecterization in esp-encrypt, https://gerrit.fd.io/r/#/c/18398/ - Get improvement on ThunderX/OcteonTX/Taishan, but degradation on ThunderX2 - Lijian
- EPIC for next quarter:
- ethernet-input - Planned (Lijian). will implement for aarch64 128bits only
- Message Queue - Planned (Lijian)
- VPP svm_fifo patch performance optimization on A72 cores – Planned (Sirshak)
- TAS patch (Sirshak)
- MAP with VPP - Planned (Sirshak)
- Roadmap for TCP optimization
- Timer implementation - (Sirshak) - Indicative
- perf analysis - Planned (Sirshak)
- TCP state machine from weak memory model perspective
- Investigate session_queue_node_fn/vlib_worker_loop. - https://jira.arm.com/browse/ENTNET-1179 - Done
- Action Items - Last Week
- Action Items - Next Week
04/09/2019
- Attendees
- Sirshak Das
- Lijian Zhang
- Juraj Linkeš
- Nitin
- Khemendra Kumar
- Tina Tsou
- Andy Wang
- Honnappa Nagarahalli
- General
- CSIT
- VPP Performance Test
- List all the blockers on aarch64 in CSIT wiki page - Done - Stan or Juraj
- Package Installation error Status(Juraj): interfaces configured numa node 2,3 are not visible. https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server, https://jira.fd.io/browse/VPP-1569
- Patch to resolve the issue is in community review. https://gerrit.fd.io/r/#/c/18278/ - Lijian
- Estimates from Khem and Stan/Juraj. Status: https://gerrit.fd.io/r/#/c/16850/
- a. Host Config, almost done with patch https://gerrit.fd.io/r/#/c/16837/ - Done
- b. merging CSIT patch. - Closing done
- c. creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- Add license header/copy right to scripts - Sirshak/Honnappa to confirm with Andy Waffa
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - No update
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Status: Juraj to bring this up in CSIT call. (start with just cross-compilation)
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- thunderx Status: 1-node topology was rewired because of QSFP+ switch.
- QSFP+ is available and working now.
- Juraj to resend email to Mahamad about the details, including Sirshak and Tina
- mcbin: Kernel Migration on mcbin. Status: was able to update uboot but not boot the new kernel. Jialin suggested different boot parameters, juraj yet to try.
- Confirm with Gorka if their mcbin can support docker. If yes, then ask them to provide image with their latest kernel/file system/dtd
- thunderx2: Status: server is set up. Management connection works. Intel NICs are well connected. Will prepare the server for VPP device testing. Now is working on containers for VPP device. Will probably be able to run VPP device tests manually this week.
- FD.io lab
- Require one ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- ThunderX1
- QSFP+ switch for ThunderX1 Status: ONL OS to be installed on QSFP+ switch.
- Management connection thru QSFP+ switch is working now. Moving VPP device work to ThunderX1 blade servers.
- Will use these four new ThunderX1 servers for CI, Genkins to replace the previous three old ThunderX1 servers.
- These four ThunderX1 blade are not identical. The first one has two numa nodes, and other three blades have one numa node.
- Investigate why these three blades have only one numa node - Juraj
- VPP
- VPP Hoststack
- Investigate session_queue_node_fn/vlib_worker_loop. - https://jira.arm.com/browse/ENTNET-1179 - Done
- Rebase VPP distro package building patch; contact Damjan in slack; Talk with Damjan in vpp meeting - Lijian & Sirshak
- Investigating message queue - Lijian
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Perf degradation is fixed. Investigating performance degradation on Bluefield - Sirshak
- Octeon-Tx Status(Sirshak): Done by Malvika. Running DPDK sample apps on it now.
- VPP running on Arm side, x86 iperf3 client observes unstable performance rate.(Reason unknown).
- Vectorization
- Vecterization in esp-encrypt, https://gerrit.fd.io/r/#/c/18398/ - Get improvement on ThunderX/OcteonTX/Taishan, but degradation on ThunderX2 - Lijian
- ethernet-input - will implement for aarch64 128bits only
- Create vectorization specific EPIC - Lijian
- Action Items - Last Week
- Action Items - Next Week
04/02/2019
- Attendees
- Sirshak Das
- Juraj Linkeš
- Nitin
- Khemendra Kumar
- Tina Tsou
- Andy Wang
- Gorka
- Fede
- Honnappa Nagarahalli
- General
- VPP Hoststack
- After assigned dedicated CPU processors for VPP main/VPP worker/iperf3 server, both ThunderX2 and Taishan Server VPP hoststack give better performance compared with Linux stack.
- Investigate session_queue_node_fn/message queue data structure. - Investigating the source code
- Review https://gerrit.fd.io/r/#/c/18398/ - Lijian
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Working on fixing VPP Path errors from svm_fifo patch. Upstreamed. Perf degradation is fixed.
- Octeon-Tx Status(Sirshak): Done by Malvika. Running DPDK on it now.
- VPP running on Arm side, x86 iperf3 client observes unstable performance rate.(Reason unknown).
- CSIT
- VPP Performance Test
- List all the blockers on aarch64 in CSIT wiki page - Stan or Juraj
- Package Installation error Status(Juraj): interfaces configured numa node 2,3 are not visible. https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server, https://jira.fd.io/browse/VPP-1569
- Both binaries and packages built with generic option by default, and provide Makefile variable NATIVE_OPTIMIZE=Y for end user to build native optimized images.
- Prepare email and a draft patch asking comments from community - Lijian
- Estimates from Khem and Stan/Juraj. Status: https://gerrit.fd.io/r/#/c/16850/
- a. Host Config, almost done with patch https://gerrit.fd.io/r/#/c/16837/ - Done
- b. merging CSIT patch. - Closing done
- c. creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- Add license header/copy right to scripts - Sirshak/Honnappa to confirm with Andy Waffa
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - No update
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Status: Juraj to bring this up in CSIT call. (start with just cross-compilation)
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- thunderx Status: 1-node topology was rewired because of QSFP+ switch.
- QSFP+ is available and working now.
- Juraj to resend email to Mahamad about the details, including Sirshak and Tina
- mcbin: Kernel Migration on mcbin. Status: was able to update uboot but not boot the new kernel. Jialin suggested different boot parameters, juraj yet to try.
- Confirm with Gorka if their mcbin can support docker. If yes, then ask them to provide image with their latest kernel/file system/dtd
- Confirm if Jianlin's board has the exactly same plugable switches with Juraj's boards - Lijian
- thunderx2: Status: server is set up. Management connection works. Intel NICs are well connected. Will prepare the server for VPP device testing. Now is working on containers for VPP device. Will probably be able to run VPP device tests manually this week.
- FD.io lab
- Require one ThunderX2(currently only one thunderX2 in the lab) in FD.io lab
- ThunderX1
- QSFP+ switch for ThunderX1 Status: ONL OS to be installed on QSFP+ switch.
- Management connection thru QSFP+ switch is working now. Moving VPP device work to ThunderX1 blade servers.
- Will use these four new ThunderX1 servers for CI, Genkins to replace the previous three old ThunderX1 servers.
- These four ThunderX1 blade are not identical. The first one has two numa nodes, and other three blades have one numa node.
- Investigate why these three blades have only one numa node - Juraj
- VPP
- Write description/expectation about the two NEON related patch - Lijian
- Investigating performance degradation on CortexA72 - Sirshak
- Message queue - Sirshak
- Verify effects and make NEON changes Jira: https://jira.fd.io/browse/VPP-1560 - To close it.
- Vectorization
- ethernet-input - no progress yet
- 128B cache line size
- Taishan/A72 Status: Khem to try 128B cache line on taishan (performance difference).
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- Action Items - Next Week
03/26/2019
- Attendees
- Sirshak Das
- Juraj Linkeš
- Nitin
- Khemendra Kumar
- Tina Tsou
- Andy Wang
- Gorka
- Fede
- Honnappa Nagarahalli
- General
- VPP Hoststack
- After assigned dedicated CPU processors for VPP main/VPP worker/iperf3 server, both ThunderX2 and Taishan Server VPP hoststack give better performance compared with Linux stack.
- Investigate session_queue_node_fn/message queue data structure. - Investigating the source code
- Review https://gerrit.fd.io/r/#/c/18398/ - Lijian
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Working on fixing VPP Path errors from svm_fifo patch. Upstreamed. Perf degradation is fixed.
- Octeon-Tx Status(Sirshak): Done by Malvika. Running DPDK on it now.
- VPP running on Arm side, x86 iperf3 client observes unstable performance rate.(Reason unknown).
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): interfaces configured numa node 2,3 are not visible. https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server, https://jira.fd.io/browse/VPP-1569
- Both binaries and packages built with generic option by default, and provide Makefile variable NATIVE_OPTIMIZE=Y for end user to build native optimized images.
- Prepare email and a draft patch asking comments from community - Lijian
- Estimates from Khem and Stan/Juraj. Status: https://gerrit.fd.io/r/#/c/16850/
- a. Host Config, almost done with patch https://gerrit.fd.io/r/#/c/16837/ - Done
- b. merging CSIT patch. - Closing done
- c. creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- Add license header/copy right to scripts - Sirshak/Honnappa to confirm with Andy Waffa
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - No update
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Status: Juraj to bring this up in CSIT call. (start with just cross-compilation)
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- thunderx Status: 1-node topology was rewired because of QSFP+ switch.
- QSFP+ is available and working now.
- Juraj to resend email to Mahamad about the details, including Sirshak and Tina
- mcbin: Kernel Migration on mcbin. Status: was able to update uboot but not boot the new kernel. Jialin suggested different boot parameters, juraj yet to try.
- Confirm with Gorka if their mcbin can support docker. If yes, then ask them to provide image with their latest kernel/file system/dtd
- Confirm if Jianlin's board has the exactly same plugable switches with Juraj's boards - Lijian
- thunderx2: Status: server is set up. Management connection works. Intel NICs are well connected. Will prepare the server for VPP device testing. Now is working on containers for VPP device. Will probably be able to run VPP device tests manually this week.
- FD.io lab
- ThunderX1
- QSFP+ switch for ThunderX1 Status: ONL OS to be installed on QSFP+ switch.
- Management connection thru QSFP+ switch is working now. Moving VPP device work to ThunderX1 blade servers.
- Will use these four new ThunderX1 servers for CI, Genkins to replace the previous three old ThunderX1 servers.
- These four ThunderX1 blade are not identical. The first one has two numa nodes, and other three blades have one numa node.
- Investigate why these three blades have only one numa node - Juraj
- ThunderX1
- VPP
- Verify effects and make NEON changes Jira: https://jira.fd.io/browse/VPP-1560 - To close it.
- Vectorization
- ethernet-input - no progress yet
- 128B cache line size
- Taishan/A72 Status: Khem to try 128B cache line on taishan (performance difference).
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- Action Items - Next Week
03/19/2019
- Attendees
- Sirshak Das
- Juraj Linkeš
- Stanislav Chlebec
- Khemendra Kumar
- Tina Tsou
- Andy Wang
- Gorka
- Fede
- Honnappa Nagarahalli
- General
- VPP Hoststack
- After assigned dedicated CPU processors for VPP main/VPP worker/iperf3 server, both ThunderX2 and Taishan Server VPP hoststack give better performance compared with Linux stack.
- vlib_worker_loop and session_queue_node_fn are two major hot-spots. - Just started
- Enable NEON instruction in Buffer pool free function. Patch is committed.
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Working on fixing VPP Path errors from svm_fifo patch. Upstreamed, but still working on issues, e.g., performance degradation
- Octeon-Tx Status(Sirshak): yet to try steps from gorka for usb ubuntu rootfs installation. - Done by Malvika.
- VPP on both sides(iperf3 server and client) give a boost.(Reason unknown).
- VPP running on Arm side, x86 iperf3 client observes unstable performance rate.(Reason unknown).
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): interfaces configured numa node 2,3 are not visible. https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server, https://jira.fd.io/browse/VPP-1569
- Prepare email and a draft patch asking comments from community - Lijian
- Estimates from Khem and Stan/Juraj. Status: https://gerrit.fd.io/r/#/c/16850/
- a. Host Config, almost done with patch https://gerrit.fd.io/r/#/c/16837/ - Done
- b. merging CSIT patch. - Closing done
- c. creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- Add license header/copy right to scripts - Sirshak/Honnappa to confirm with Andy Waffa
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - No update
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Status: Juraj to bring this up in CSIT call. (start with just cross-compilation)
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- thunderx Status: 1-node topology was rewired because of QSFP+ switch.
- Juraj to resend email to Mahamad about the details, including Sirshak and Tina
- mcbin: Kernel Migration on mcbin. Status: was able to update uboot but not boot the new kernel. Jialin suggested different boot parameters, juraj yet to try.
- Confirm with Gorka if their mcbin can support docker. If yes, then ask them to provide image with their latest kernel/file system/dtd
- Confirm if Jianlin's board has the exactly same plugable switches with Juraj's boards - Lijian
- thunderx2: Status: Talk to edk about deployment strategy with 1-node. Also blocked by QSFP+ issue.
- thunderx Status: 1-node topology was rewired because of QSFP+ switch.
- FD.io lab
- ThunderX1
- QSFP+ switch for ThunderX1 Status: ONL OS to be installed on QSFP+ switch.
- Juraj setup call with LF people. Status: Done.
- ThunderX2
- Cables: Sent. Juraj to open another tkt for wiring the ThunderX2.
- ThunderX1
- VPP
- Verify effects and make NEON changes Jira: https://jira.fd.io/browse/VPP-1560 - In internal review.
- Vectorization
- ethernet-input - no progress yet
- buffer pools - https://jira.fd.io/browse/VPP-1560. In internal review
- 128B cache line size
- VPP image with 128B cache line size crashed on ThunderX2 - Cannot reproduce crash with my setup
- Taishan/A72 Status: Khem to try 128B cache line on taishan (performance difference).
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- Action Items - Next Week
- Commit VPP distro making patch - Lijian
- Plugin 25G NIC Taishan server, and connect the 25G ports to x86 25G NIC - Lijian
- Follow Jianlin's suggestion, update Uboot and Kernel, and then sync up with Juraj - Lijian
03/12/2019
- Attendees
- Sirshak Das
- Juraj Linkeš
- Stanislav Chlebec
- Khemendra Kumar
- Tina Tsou
- Andy Wang
- Gorka
- Fede
- Honnappa Nagarahalli
- General
- Tina to update the meeting notice.
- VPP Hoststack
- After assigned dedicated CPU processors for VPP main/VPP worker/iperf3 server, both ThunderX2 and Taishan Server VPP hoststack give better performance compared with Linux stack.
- vlib_worker_loop and session_queue_node_fn are two major hot-spots. - No progress
- Enable NEON instruction in Buffer pool free function. Patch is committed.
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Working on fixing VPP Path errors from svm_fifo patch. All test failures are resolved.
- Octeon-Tx Status(Sirshak): yet to try steps from gorka for usb ubuntu rootfs installation. - Switched to Malvika.
- VPP on both sides(iperf3 server and client) give a boost.(Reason unknown).
- VPP running on Arm side, x86 iperf3 client observes unstable performance rate.(Reason unknown).
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): interfaces configured numa node 2,3 are not visible. https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server, https://jira.fd.io/browse/VPP-1569
- Prepare email and a draft patch asking comments from community - Lijian
- Estimates from Khem and Stan/Juraj. Status: https://gerrit.fd.io/r/#/c/16850/
- a. Host Config, almost done with patch https://gerrit.fd.io/r/#/c/16837/ - Done
- b. merging CSIT patch. - Closing done
- c. creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- Add license header/copy right to scripts - Sirshak/Honnappa to confirm with Andy Waffa
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - No update
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Status: Juraj to bring this up in CSIT call. (start with just cross-compilation)
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- thunderx Status: 1-node topology was rewired because of QSFP+ switch.
- mcbin: Kernel Migration on mcbin. Status: was able to update uboot but not boot the new kernel. Jialin suggested different boot parameters, juraj yet to try.
- Confirm if Jianlin's board has the exactly same plugable switches with Juraj's boards - Lijian
- thunderx2: Status: Talk to edk about deployment strategy with 1-node.
- FD.io lab
- ThunderX1
- QSFP+ switch for ThunderX1 Status: ONL OS to be installed on QSFP+ switch.
- Juraj setup call with LF people. Status: Done.
- ThunderX2
- Cables: Sent. Juraj to open another tkt for wiring the ThunderX2.
- ThunderX1
- VPP
- Verify effects and make NEON changes Jira: https://jira.fd.io/browse/VPP-1560 - In internal review.
- Vectorization
- ethernet-input - no progress yet
- buffer pools - https://jira.fd.io/browse/VPP-1560. In internal review
- 128B cache line size
- VPP image with 128B cache line size crashed on ThunderX2
- thunderx2 crashing - No update
- Taishan/A72 Status: Khem to try 128B cache line on taishan (performance difference).
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- Action Items - Next Week
- Commit VPP distro making patch - Lijian
- Plugin 25G NIC Taishan server, and connect the 25G ports to x86 25G NIC - Lijian
- Follow Jianlin's suggestion, update Uboot and Kernel, and then sync up with Juraj - Lijian
03/05/2019
- Attendees
- Sirshak Das
- Juraj Linkeš
- Stanislav Chlebec
- Khemendra Kumar
- Tina Tsou
- Andy Wang
- Gorka
- Fede
- Honnappa Nagarahalli
- General
- VPP Hoststack
- After assigned dedicated CPU processors for VPP main/VPP worker/iperf3 server, both ThunderX2 and Taishan Server VPP hoststack give better performance compared with Linux stack.
- vlib_worker_loop and session_queue_node_fn are two major hot-spots. - No progress
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Working on fixing VPP Path errors from svm_fifo patch. All test failures are resolved.
- Octeon-Tx Status(Sirshak): yet to try steps from gorka for usb ubuntu rootfs installation. - Switched to Malvika.
- VPP on both sides(iperf3 server and client) give a boost.(Reason unknown).
- VPP running on Arm side, x86 iperf3 client observes unstable performance rate.(Reason unknown).
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): interfaces configured numa node 2,3 are not visible. https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server, https://jira.fd.io/browse/VPP-1569
- Estimates from Khem and Stan/Juraj. Status: https://gerrit.fd.io/r/#/c/16850/
- a. Host Config, almost done with patch https://gerrit.fd.io/r/#/c/16837/ - Done
- b. merging CSIT patch. - Closing done
- c. creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- Add license header/copy right to scripts - Sirshak/Honnappa to confirm with Andy Waffa
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - No update
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Status: Juraj to bring this up in CSIT call. (start with just cross-compilation)
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- thunderx Status: 1-node topology was rewired because of QSFP+ switch.
- mcbin: Kernel Migration on mcbin. Status: was able to update uboot but not boot the new kernel. Jialin suggested different boot parameters, juraj yet to try.
- Confirm if Jianlin's board has the exactly same plugable switches with Juraj's boards - Lijian
- thunderx2: Status: Talk to edk about deployment strategy with 1-node.
- FD.io lab
- ThunderX1
- QSFP+ switch for ThunderX1 Status: ONL OS to be installed on QSFP+ switch.
- Juraj setup call with LF people. Status: Done.
- ThunderX2
- Cables: Sent. Juraj to open another tkt for wiring the ThunderX2.
- ThunderX1
- VPP
- Verify effects and make NEON changes Jira: https://jira.fd.io/browse/VPP-1560 - No progress
- Investigate with latest VPP code on x86 server - Lijian - Send emails to vpp-dev mailor if there's problem. Will not put much effort.
- Vectorization
- ethernet-input
- buffer pools
- 128B cache line size
- Will try this on Taishan server - Slightly performance degradation with 128 bytes cache line
- thunderx2 crashing - No update
- Taishan/A72 Status: Khem to try 128B cache line on taishan (performance difference).
- Verify effects and make NEON changes Jira: https://jira.fd.io/browse/VPP-1560 - No progress
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- Action Items - Next Week
02/26/2019
- Attendees
- Sirshak Das
- Juraj Linkeš
- Stanislav Chlebec
- Khemendra Kumar
- Tina Tsou
- Andy Wang
- Gorka
- Fede
- Honnappa Nagarahalli
- General
- VPP Hoststack
- After assigned dedicated CPU processors for VPP main/VPP worker/iperf3 server, both ThunderX2 and Taishan Server VPP hoststack give better performance compared with Linux stack.
- el0_sys hot-spot on Taishan D05 only, no plan to fix it.
- vlib_worker_loop and session_queue_node_fn are two major hot-spots. - No progress
- memcpy optimization
- memcpy patch verification on taishan by khem l3 forwarding usecase- Lijian Status(khem): No updates.
- memcpy patch consumes more clocks in OcteonTX2 - updated by Nitin.
- Stopped working on this patch.
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Working on fixing VPP Path errors from svm_fifo patch. Test failure on SCTP, not root-caused yet.
- Octeon-Tx Status(Sirshak): yet to try steps from gorka for usb ubuntu rootfs installation. - Switched to Marvikar
- VPP on both sides(iperf3 server and client) give a boost.(Reason unknown).
- VPP running on Arm side, x86 iperf3 client observes unstable performance rate.(Reason unknown).
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): interfaces configured numa node 2,3 are not visible. https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server, https://jira.fd.io/browse/VPP-1569
- Estimates from Khem and Stan/Juraj. Status: https://gerrit.fd.io/r/#/c/16850/
- a. Host Config, almost done with patch https://gerrit.fd.io/r/#/c/16837/ - Done
- b. merging CSIT patch. - Closing done
- c. creating a job. - Everything is ready except the docker image
- Target: master trending job - firstly create trending graph from daily data; then create release report(require some manual work)
- Add license header/copy right to scripts - Sirshak/Honnappa to confirm with Andy Waffa
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559 - No update
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Status: Juraj to bring this up in CSIT call. (start with just cross-compilation)
- Doesn't work, seems to be caused improper cross-building-tools. https://wiki.fd.io/view/VPP/Build_System_Deep_Dive
- VPP Device
- thunderx Status: 1-node topology was rewired because of QSFP+ switch.
- mcbin: Kernel Migration on mcbin. Status: was able to update uboot but not boot the new kernel. Jialin suggested different boot parameters, juraj yet to try.
- Confirm if Jianlin's board has the exactly same plugable switches with Juraj's boards - Lijian
- thunderx2: Status: Talk to edk about deployment strategy with 1-node.
- FD.io lab
- ThunderX1
- QSFP+ switch for ThunderX1 Status: ONL OS to be installed on QSFP+ switch.
- Juraj setup call with LF people. Status: Done.
- ThunderX2
- Cables: Sent. Juraj to open another tkt for wiring the ThunderX2.
- ThunderX1
- VPP
- Buffer Pools per NUMA
- Verify effects and make NEON changes Jira: https://jira.fd.io/browse/VPP-1560
- Investigate with latest VPP code on x86 server - Lijian - Send emails to vpp-dev mailor if there's problem. Will not put much effort.
- Vectorization
- ethernet-input
- buffer pools
- 128B cache line size
- Will try this on Taishan server - Slightly performance degradation with 128 bytes cache line
- Qualcomm no change iperf3
- thunderx2 crashing - No update
- Taishan/A72 Status: Khem to try 128B cache line on taishan (performance difference).
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- Action Items - Next Week
02/19/2019
- Attendees
- Sirshak Das
- Juraj Linkeš
- Stanislav Chlebec
- Khemendra Kumar
- Tina Tsou
- Andy Wang
- Gorka
- Fede
- Honnappa Nagarahalli
- General
- VPP Hoststack
- After assigned dedicated CPU processors for VPP main/VPP worker/iperf3 server, both ThunderX2 and Taishan Server VPP hoststack give better performance compared with Linux stack.
- memcpy optimization
- memcpy patch verification on taishan by khem l3 forwarding usecase- Lijian Status(khem): No updates.
- memcpy patch consumes more clocks in OcteonTX2 - updated by Nitin.
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Working on fixing VPP Path errors from svm_fifo patch.
- Octeon-Tx Status(Sirshak): yet to try steps from gorka for usb ubuntu rootfs installation.
- VPP on both sides(iperf3 server and client) give a boost.(Reason unknown).
- VPP running on Arm side, x86 iperf3 client observes unstable performance rate.(Reason unknown).
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): interfaces configured numa node 2,3 are not visible. https://jira.fd.io/browse/VPP-1566
- Installed VPP crashed on Taishan server, https://jira.fd.io/browse/VPP-1569
- Estimates from Khem and Stan/Juraj. Status: https://gerrit.fd.io/r/#/c/16850/
- a. Host Config, almost done with patch https://gerrit.fd.io/r/#/c/16837/
- b. merging CSIT patch.
- c. creating a job.
- Target: master trending job
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Status: Juraj to bring this up in CSIT call. (start with just cross-compilation)
- VPP Device
- thunderx Status: 1-node topology was rewired because of QSFP+ switch.
- mcbin: Kernel Migration on mcbin. Status: was able to update uboot but not boot the new kernel. Jialin suggested different boot parameters, juraj yet to try.
- Confirm if Jianlin's board has the exactly same plugable switches with Juraj's boards - Lijian
- thunderx2: Status: Talk to edk about deployment strategy with 1-node.
- FD.io lab
- ThunderX1
- QSFP+ switch for ThunderX1 Status: ONL OS to be installed on QSFP+ switch.
- Juraj setup call with LF people. Status: Done.
- ThunderX2
- Cables: Sent. Juraj to open another tkt for wiring the ThunderX2.
- ThunderX1
- VPP
- Buffer Pools per NUMA
- Verify effects and make NEON changes Jira: https://jira.fd.io/browse/VPP-1560
- 1GB page taking long time Status: fixed.
- Investigate with latest VPP code on x86 server
- Vectorization
- ethernet-input
- buffer pools
- memcpy
- 128B cache line size
- Will try this on Taishan server - Lijian
- Qualcomm no change iperf3
- thunderx2 crashing - No update
- Taishan/A72 Status: Khem to try 128B cache line on taishan (performance difference).
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- Action Items - Next Week
02/11/2019
- Attendees
- Sirshak Das
- Juraj Linkeš
- Stanislav Chlebec
- Khemendra Kumar
- Tina Tsou
- Andy Wang
- Gorka
- Fede
- Honnappa Nagarahalli
- General
- VPP Hoststack
- memcpy optimization
- memcpy patch verification on taishan by khem l3 forwarding usecase- Lijian Status(khem): No updates.
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation Status(Sirshak): no updates.
- svm_fifo: Status(Sirshak): Working on fixing VPP Path errors from svm_fifo patch.
- Octeon-Tx Status(Sirshak): yet to try steps from gorka for usb ubuntu rootfs installation.
- VPP on both sides(iperf3 server and client) give a boost.(Reason unknown).
- memcpy optimization
- CSIT
- VPP Performance Test
- Package Installation error Status(Juraj): interfaces configured numa node 2,3 are not visible.
- Estimates from Khem and Stan/Juraj. Status: https://gerrit.fd.io/r/#/c/16850/
- a. Host Config
- b. merging CSIT patch.
- c. creating a job.
- Target: master trending job
- VPP Path
- gcc-8 compilation: Jira(Sirshak): https://jira.fd.io/browse/VPP-1559
- cross compilation in VPP PATH. Jira(Juraj): https://jira.fd.io/browse/CTP-3
- Status: Juraj to bring this up in CSIT call. (start with just cross-compilation)
- VPP Device
- thunderx Status: 1-node topology was rewired because of QSFP+ switch.
- mcbin: Kernel Migration on mcbin. Status: was able to update uboot but not boot the new kernel. Jialin suggested different boot parameters, juraj yet to try.
- thunderx2: Status: Talk to edk about deployment strategy with 1-node.
- FD.io lab
- ThunderX1
- QSFP+ switch for ThunderX1 Status: ONL OS to be installed on QSFP+ switch.
- Juraj setup call with LF people. Status: Done.
- ThunderX2
- Cables: Sent. Juraj to open another tkt for wiring the ThunderX2.
- ThunderX1
- VPP
- Buffer Pools per NUMA
- Verify effects and make NEON changes Jira: https://jira.fd.io/browse/VPP-1560
- 1GB page taking long time Status: fixed.
- Vectorization
- ethernet-input
- buffer pools
- memcpy
- 128B cache line size
- Qualcomm no change iperf3
- thunderx2 crashing
- Taishan/A72 Status: Khem to try 128B cache line on taishan (performance difference).
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- Action Items - Next Week
02/05/2019
- Attendees
- Sirshak Das
- Juraj Linkeš
- Stanislav Chlebec
- Khemendra Kumar
- Tina Tsou
- Andy Wang
- Gorka
- Fede
- Honnappa Nagarahalli
- General
- VPP Hoststack
- memcpy optimization
- Check optimized memory copy version are deployed on Taishan and ThunderX2 during runtime - Lijian
- Send memcpy patch to Khem and Fede for further verification - Lijian Status: fede: small improvement in mcbin with iperf3, khem to try them with l3 forwarding
- iperf3 performance with Hoststack.
- ip4_local_inline quad loop under investigation
- Working on svm_fifo alternate version with front and back pointers synchronized instead of cursize.
- Verifying per NUMA node buffer pool https://gerrit.fd.io/r/#/c/16638/
- sirshak create jira id in fd.io jira. https://jira.fd.io/browse/VPP-1560
- Hanging of VPP is actually VPP taking a lot of time to allocate 400K chunks for 1GB - Damjan has this in his todo list
- gcc-8 compilation still fails on ARM.
- sirshak create a jira id in fd.io jira. Status: https://jira.fd.io/browse/VPP-1559
- Octeon-Tx failure. Status: unknown
- Gorka is trying some optimal configs for VCL. Status: no updates.
- VPP on both sides(iperf3 server and client) give a boost.(Reason unknown).
- OcteonTx boots to buildroot with no dhclient hence an impasse. Still not clear how to use USB stick.
- memcpy optimization
- CSIT
- VPP Path
- Sirshak to keep track of gcc-8 compilation, once clean we can switch to gcc-8. https://jira.fd.io/browse/VPP-1559
- ARM CI results are overwritten by x86 machines. Should be a Jenkin issue. Monitor if this corner will happen again. - Juraj
- Add cross compilation CI Juraj: https://jira.fd.io/browse/CTP-3
- VPP Device
- thunderx: 1-node topology on cavium thunderx. Status: no updates.
- Kernel Migration on mcbin. Status:
- ThunderX2:
- VPP Performance Test
- Stan and Khem to come up with a summary of current status and an estimate of at least upstreaming basic L2/L3 performance suites.
- Juraj to come up with a solution for NUMA node anamoly in Taishan.
- https://gerrit.fd.io/r/#/c/16850/ Status: Juraj has a version all ready to work. Package installation blocker.
- Package installation error Status: Juraj to investigate logs.
- VPP Path
- FD.io lab
- ThunderX1 -
- New QSFP+ switch for ThunderX1 is available now: QSFP+ to be connected SFP+ switch.
- Juraj to setup a call with LF folks on.
- ThunderX2 -
- Andy still waiting cables.
- Juraj to remind Andy of when the cable will be available.
- Juraj to follow up on ssh connectivity to thunderx2.
- ThunderX1 -
- VPP
- Vectorization
- [Lijian] Macro benchmarking on ThunderX2/Centriq(4%)/Taishan D05(10%) is done, data is updated into Jira. Code is in internal review.
- [Lijian] Check if setting default cache line size to 128 will degradate thru-put on Taishan/Qualcomm/ThunderX2
- no perf diff in Qualcomm
- vpp crashes on thunderx2
- waiting for results on A72 (Taishan)
- [Sirshak] on ethernet-input node, investigate vectorized buffer index, Damjan's per numa node buffer pool patch. Status: No updates
- open fd.io jira tkt. https://jira.fd.io/browse/VPP-1560
- Vectorization
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- Action Items - Next Week
01/29/2019
- Attendees
- Sirshak Das
- Juraj Linkeš
- Lijian Zhang
- Stanislav Chlebec
- Khemendra Kumar
- Tina Tsou
- Andy Wang
- Garcia
- Manuel
- Gorka
- Fede
- Honnappa Nagarahalli
- John Ddigilio
- General
- DPDK multi-core scheduler
- https://gerrit.fd.io/r/#/c/15084/
- Please join slack.
- Merge TCP optimization meeting into VPP/Aarch64 community public meeting.
- VPP Hoststack
- TaiShan Server with Debian distro crashed with command of 'ip probe-neighbor' when doing VPP hoststack with iperf3
- With 64 bytes packets, on ThunderX2, 10G NIC, VPP hoststack bandwidth is about 1/2 of Linux Kernel stack.
- With 64 bytes packets, on Taishan, 10G NIC, VPP hoststack bandwidth is about 2x of Linux Kernel stack.
- Memory copy patch gives 4% improvement on VPP hoststack on Taishan server.
- Check optimized memory copy version are deployed on Taishan and ThunderX2 during runtime - Lijian
- Send memcopy patch to Khem and Fede for further verification - Lijian
- iperf3 performance with Hoststack.
- Investigating lock-less fifo, ip4_local_forward node and memory reordering for VPP hoststack - Sirshak
- Sirshak is trying to set up one fd.io setup for everyone debugging VPP hoststack.
- Verifying https://gerrit.fd.io/r/#/c/16638/ - Suppose to give better performance, but VPP hang with this patch on some Arm machines.
- Gorka is trying some optimal configs for VCL. - No Updates.
- VPP on both sides(iperf3 server and client) give a boost.(Reason unknown).
- OcteonTX is received in ARM lab. Will boot it up firstly and then start doing profiling with it.
- FD.io lab
- ThunderX1 -
- New Arista switch for ThunderX1 is available now. Gathering details that required by LF lab before sending the switch to CSIT lab. - Juraj - Andy will try to send the switch to CSIT this Thursday.
- ThunderX2 -
- Cable type is confirmed. Procurement is in the process.
- Juraj to remind Andy of when the cable will be available.
- Require access to these servers in FD.io lab. Anton gives the IP to access them.(ADMIN/ADMIN)
- ThunderX1 -
- CSIT
- VPP Path
- So far so good.
- ARM CI results are overwritten by x86 machines. Should be a Jenkin issue. Monitor if this corner will happen again. - Juraj
- VPP Device
- thunderx: 1-node topology on cavium thunderx. Basic skeleton of docker topology done. Moving on to deploying the scripts. Juraj is able to modify, execute the scripts in Container. Things to fix is scripts for 1-link 1-node topology and interfaces binding to VPP. Is able to run successfully a traffic test.
- Kernel Migration on mcbin. Juraj is able to build all the images, but got kernel panic. Try with the more recent uBoot version. Tried latest uBoot image, but still has the same issue.
- Juraj to investigate further work once ThunderX2 is available.
- VPP Performance Test
- perftest - https://jenkins.fd.io/job/vpp-csit-verify-perf-master-2n-skx - Triggered manually now if patch is perf sensitive.
- Working ongoing on writing scripts for Performance Jobs.
- Development of L2 test script is under-going now.
- Khem will get L2 work in CI firstly, and then IP4, and other test cases.
- Stan starts to work on performance scripts with Khem. Is able to connect Taishan machines in CSIT lab.
- The performance topology in wiki link is to update per below file.
- https://github.com/FDio/csit/blob/master/docs/lab/Testbeds_Xeon_Skx_Arm_Atom.md
- Stan and Khem to come up with a summary of current status and an estimate of at least upstreaming basic L2/L3 performance suites.
- Install Ubuntu-18.04 on Huawei Taishan servers firstly, and then investigate upstreaming performance test framework to enable Aarch64
- Taishan server works with Ubuntu 18.04, CSIT lab updated Ubuntu 18.04 in Taishan
- Install the packages on Taishan server from cloud repository, to check if VPP can get intel NICs on Taishan - Lijian
- https://packagecloud.io/app/fdio/master/search?q=19.01-rc0%7E642-g31fe7aa3&filter=debs&filter=debs&dist=ubuntu%2Fbionic
- Stan installed latest CSIT scripts on packet generator server(x86 NEON) and Tainshan servers in FD.io lab.
- https://gerrit.fd.io/r/#/c/16850/
- Some of L2 and L3 test cases passed.
- VPP Path
- VPP
- Vectorization
- [Lijian] Macro benchmarking on ThunderX2/Centriq(4%)/Taishan D05(10%) is done, data is updated into Jira. Code is in internal review.
- [Lijian] Check if setting default cache line size to 128 will degradate thru-put on Taishan/Qualcomm/ThunderX2
- [Sirshak] on ethernet-input node, investigate vectorized buffer index.
- Memory Ordering
- [Sirshak] svm_fifo lockless alternate algorithm for SPSC. - In internal review.
- Vectorization
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- [Khem] Deployment of only L2 CSIT performance suite. - Bootstrap script common.sh working and checking the test cases.
- [Juraj] IP4 reassembly and GBP failures are fixed. Patches to enable them two are merged. No any test cases in blacklist for Aarch64 machine.
- [Juraj] Kernel Migration on mcbin. Juraj is able to build all the images.
- Action Items - Next Week
- [Sirshak] -
01/22/2019
- Attendees
- Sirshak Das
- Juraj Linkeš
- Lijian Zhang
- Stanislav Chlebec
- Khemendra Kumar
- Tina Tsou
- Andy Wang
- Garcia
- Manuel
- Gorka
- Fede
- Honnappa Nagarahalli
- John Ddigilio
- General
- DPDK multi-core scheduler
- https://gerrit.fd.io/r/#/c/15084/
- Please join slack.
- Merge TCP optimization meeting into VPP/Aarch64 community public meeting.
- VPP Hoststack
- TaiShan Server with Debian distro crashed with command of 'ip probe-neighbor' when doing VPP hoststack with iperf3
- With 64 bytes packets, on ThunderX2, 10G NIC, VPP hoststack bandwidth is about 1/4 of Linux Kernel stack.
- iperf3 performance with Hoststack.
- Investigating lock-less fifo, ip4_local_forward node and memory reordering for VPP hoststack - Sirshak
- Sirshak is trying to set up one fd.io setup for everyone debugging VPP hoststack.
- Gorka is trying some optimal configs for VCL. - No Updates.
- VPP on both sides(iperf3 server and client) give a boost.(Reason unknown).
- OcteonTX is received in ARM lab. Will boot it up firstly and then start doing profiling with it.
- FD.io lab
- ThunderX1 -
- New Arista switch for ThunderX1 is available now. Gathering details that required by LF lab before sending the switch to CSIT lab. - Juraj - Andy will try to send the switch to CSIT this Thursday.
- ThunderX2 -
- Cable type is confirmed. Procurement is in the process.
- Require access to these servers in FD.io lab.
- ThunderX1 -
- CSIT
- VPP Path
- So far so good.
- ARM CI results are overwritten by x86 machines. Should be a Jenkin issue. Monitor if this corner will happen again. - Juraj
- VPP Device
- thunderx: 1-node topology on cavium thunderx. Basic skeleton of docker topology done. Moving on to deploying the scripts. Juraj is able to modify, execute the scripts in Container. Things to fix is scripts for 1-link 1-node topology and interfaces binding to VPP.
- Kernel Migration on mcbin. Juraj is able to build all the images, but got kernel panic. Try with the more recent uBoot version.
- Juraj to investigate further work once ThunderX2 is available.
- VPP Performance Test
- Working ongoing on writing scripts for Performance Jobs.
- Development of L2 test script is under-going now.
- Khem will get L2 work in CI firstly, and then IP4, and other test cases.
- Stan starts to work on performance scripts with Khem. Is able to connect Taishan machines in CSIT lab.
- The performance topology in wiki link is to update per below file.
- https://github.com/FDio/csit/blob/master/docs/lab/Testbeds_Xeon_Skx_Arm_Atom.md
- Stan and Khem to come up with a summary of current status and an estimate of at least upstreaming basic L2/L3 performance suites.
- Install Ubuntu-18.04 on Huawei Taishan servers firstly, and then investigate upstreaming performance test framework to enable Aarch64
- Lijian to verify Ubuntu-18.04 on Taishan server.
- Stan installed latest CSIT scripts on packet generator server(x86 NEON) and Tainshan servers in FD.io lab.
- https://gerrit.fd.io/r/#/c/16850/
- Some of L2 and L3 test cases passed.
- VPP Path
- VPP
- Vectorization
- [Lijian] Macro benchmarking on ThunderX2/Centriq(4%)/Taishan D05(10%) is done, data is updated into Jira. Code is in internal review.
- [Sirshak] on ethernet-input node, investigate vectorized buffer index.
- Memory Ordering
- [Sirshak] svm_fifo lockless alternate algorithm for SPSC. - In internal review.
- Vectorization
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- [Khem] Deployment of only L2 CSIT performance suite. - Bootstrap script common.sh working and checking the test cases.
- [Juraj] IP4 reassembly and GBP failures are fixed. Patches to enable them two are merged. No any test cases in blacklist for Aarch64 machine.
- [Juraj] Kernel Migration on mcbin. Juraj is able to build all the images.
- Action Items - Next Week
- [Sirshak] - To update patch list in VPP/Aarch64 wiki
01/15/2019
- Attendees
- Sirshak Das
- Juraj Linkeš
- Lijian Zhang
- Stanislav Chlebec
- Khemendra Kumar
- Tina Tsou
- Andy Wang
- Garcia
- Manuel
- Gorka
- Fede
- Honnappa Nagarahalli
- John Ddigilio
- General
- DPDK multi-core scheduler
- https://gerrit.fd.io/r/#/c/15084/
- Please join slack.
- Merge TCP optimization meeting into VPP/Aarch64 community public meeting.
- VPP Hoststack
- iperf3 performance with Hoststack.
- Investigating lock-less fifo and memory reordering for VPP hoststack - Sirshak
- Sirshak is trying to set up one fd.io setup for everyone debugging VPP hoststack.
- Gorka is trying some optimal configs for VCL. - No Updates.
- VPP on both sides(iperf3 server and client) give a boost.(Reason unknown).
- OcteonTX is received in ARM lab. Will boot it up firstly and then start doing profiling with it.
- iperf3 performance with Hoststack.
- FD.io lab
- ThunderX2 -
- New Arista switch is available now. Gathering details that required by LF lab before sending the switch to CSIT lab. - Juraj
- Cable type is confirmed. Procurement is in the process.
- ThunderX2 -
- CSIT
- VPP Path
- IP4 reassembly and GBP failures are fixed. Patches to enable them two are merged. No any test cases in blacklist for Aarch64 machine.
- We have voting verify on bionic. Upload nexus disabled but merge job working. - Juraj created LF ticket for nexus upload. Both master merge job and verifying job are working fine.
- ARM CI results are overwritten by x86 machines. Should be a Jenkin issue. Monitor if this corner will happen again. - Juraj
- VPP Device
- thunderx: 1-node topology on cavium thunderx. Basic skeleton of docker topology done. Moving on to deploying the scripts.
- Kernel Migration on mcbin. Juraj is able to build all the images.
- VPP Performance Test
- Working ongoing on writing scripts for Performance Jobs.
- Development of L2 test script is under-going now.
- Khem will get L2 work in CI firstly, and then IP4, and other test cases.
- Stan starts to work on performance scripts with Khem. Is able to connect Taishan machines in CSIT lab.
- The performance topology in wiki link is to update per below file.
- https://github.com/FDio/csit/blob/master/docs/lab/Testbeds_Xeon_Skx_Arm_Atom.md
- Stan and Khem to come up with a summary of current status and an estimate of at least upstreaming basic L2/L3 performance suites.
- VPP Path
- VPP
- Vectorization
- [Lijian] Macro benchmarking on ThunderX2/Centriq(4%)/Taishan D05(10%) is done, data is updated into Jira. Code is in internal review.
- Memory Ordering
- [Sirshak] svm_fifo lockless alternate algorithm for SPSC. - In internal review.
- Vectorization
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- [Khem] Deployment of only L2 CSIT performance suite. - Bootstrap script common.sh working and checking the test cases.
- [Juraj] IP4 reassembly and GBP failures are fixed. Patches to enable them two are merged. No any test cases in blacklist for Aarch64 machine.
- [Juraj] Kernel Migration on mcbin. Juraj is able to build all the images.
- Action Items - Next Week
- [Sirshak] - To update patch list in VPP/Aarch64 wiki
01/08/2019
- Attendees
- Sirshak Das
- Juraj Linkeš
- Lijian Zhang
- Stanislav Chlebec
- Khemendra Kumar
- General
- DPDK multi-core scheduler
- https://gerrit.fd.io/r/#/c/15084/
- Please join slack.
- VPP Hoststack
- iperf3 performance with Hoststack.
- Investigating lock-less fifo and memory reordering for VPP hoststack - Sirshak
- Sirshak is trying to set up one fd.io setup for everyone debugging VPP hoststack.
- Gorka is trying some optimal configs for VCL. - No Updates.
- VPP on both sides(iperf3 server and client) give a boost.(Reason unknown).
- iperf3 performance with Hoststack.
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- [Khem] Deployment of only L2 CSIT performance suite. - Bootstrap script common.sh working and checking the test cases.
- [Lijian] Working on IP4 reassembly and GBP failures. - fixed. Juraj has upstreamed patched to enable these two tests.
- [Sirshak] Kernel Migration mcbin. Juraj is working on based on Jianlin's suggestion.
- [Andy] Getting a new Arista switch next year.
- VPP
- Vectorization
- [Lijian] working on vectorized memory copy - Macro benchmarking is done and data is updated to Jira.
- Memory Ordering
- [Sirshak] svm_fifo lockless alternate algorithm for SPSC. - In internal review.
- Vectorization
- CSIT
- VPP Path
- VPP Path Failures
- We have voting verify on bionic. Upload nexus disabled but merge job working. - Juraj created LF ticket for nexus upload. Both merge job and verifying job are working fine.
- VPP Device
- thunderx: 1-node topology on cavium thunderx. Basic skeleton of docker topology done. Moving on to deploying the scripts.
- thunderx2: Juraj working with LF to get this resolved.
- mcbin: Juraj can contact Jianlin if needed.
- VPP Performance Test
- Working ongoing on writing scripts for Performance Jobs.
- Development of L2 test script is under-going now.
- Khem will get L2 work in CI firstly, and then IP4, and other test cases.
- Stan is starting working on VPP performance test. Khem to send email to Stan on VPP performance testing stuff.
- FD.io lab
- New Arista switch to be proccured next year.
- ThunderX2 - Racked. Andy is trying to buy cables compatible to Intel XL710. Juraj to confirm info required by lab people before sending out the cables.
- Action Items - Next Week
12/18/2018
- Attendees
- Sirshak Das
- Juraj Linkeš
- Tina Tsou
- Stanislav Chlebec
- Avinash
- Khemendra
- General
- DPDK multi-core scheduler
- https://gerrit.fd.io/r/#/c/15084/
- Cancelling calls on 25th of Dec and 1st of jan. Next meeting 8th Jan.
- Please join slack.
- VPP Hoststack
- iperf3 performance with Hoststack.
- Investigating lock-less fifo and memory reordering for VPP hoststack - Sirshak
- Sirshak is trying to set up one fd.io setup for everyone debugging VPP hoststack.
- Gorka is trying some optimal configs for VCL. - No Updates.
- VPP on both sides(iperf3 server and client) give a boost.(Reason unknown).
- iperf3 performance with Hoststack.
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- [Khem] Deployment of only L2 CSIT performance suite. - Bootstrap script common.sh working.
- [Lijian] Working on IP4 reassembly and GBP failures. - Some preliminary on gbp waiting Neale. Juraj to give access to Lijian to investigate on ThunderX.
- [Sirshak] Kernel Migration mcbin. Status: Jianlin to work with Juraj to get fd.io mcbins up and running. Sirshak to setup a meeting.
- [Andy] Getting a new Arista switch next year.
- VPP
- Vectorization
- [Lijian] working on vectorized memory copy - Still benchmarking and setting it up for internal review.
- [Lijian] Patch for compiling issue with GCC-8.x is under community review. Status: No updtaes.
- [Lijian] Patch for fixing StringTest failure is under community review. Status: Abandoned.
- [Lijian] Patch for CDP failure is under community review. Status: No updates.
- Memory Ordering
- [Sirshak] svm_fifo lockless alternate algorithm for SPSC.
- Vectorization
- CSIT
- VPP Path
- VPP Path Failures
- https://jira.fd.io/browse/VPP-1475 - IP4 random reassembly failure in master, also seen on x86
- https://jira.fd.io/browse/VPP-1491 - GBP L3/L2 Endpoint Learning failure
- We have voting verify on bionic. Upload nexus disabled but merge job working. Juraj to create LF ticket for nexus upload.
- VPP Device
- thunderx: 1-node topology on cavium thunderx. Basic skeleton of docker topology done. Moving on to deploying the scripts.
- thunderx2: Sirshak working with LF to get this resolved.
- mcbin: Sirshak to setup a meeting between Juraj and Jianlin.
- VPP Performance Test
- Working ongoing on writing scripts for Performance Jobs.
- Development of L2 test script is under-going now.
- Khem will get L2 work in CI firstly, and then IP4, and other test cases.
- FD.io lab
- New Arista switch to be proccured next year.
- ThunderX2 - Racked. IPMI Static IP configuration missing. Sirshak with LF.
- Action Items - Next Week
12/11/2018
- Attendees
- Sirshak Das
- Juraj Linkeš
- Tina Tsou
- Stanislav Chlebec
- VPP Hoststack
- iperf3 performance with Hoststack. - Sirshak has done some preliminary bench-marking, and compare kernel and VPP hoststack performance. Three case: kernel to kernel; kernel to VPP hoststack; VPP hoststack to VPP hoststack
- ongoing perf analysis. One patch(https://gerrit.fd.io/r/#/c/16184/) is merged, and the other one is under internal review.
- Investigating lock-less fifo and memory reordering for VPP hoststack - Sirshak
- Sirshak is trying to set up one CSIT setup for everyone debugging VPP hoststack. Will share setup info
- Gorka is trying some optimal configs for VCL.
- VPP on both sides(iperf3 server and client) give a boost.(Reason unknown).
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- [Khem] Deployment of only L2 CSIT performance suite. Two scripts of L2 performance suites for CI management repository are done, investigating on for CSIT repository, and three more scripts to be developed.
- [Lijian] Working on IP4 reassembly and GBP failures
- [Sirshak] Kernel Migration mcbin. Status: Sirshak to try inputs from garcia and damjan. - no progress so far. - To confirm with Jianling and Joyce.
- VPP
- Vectorization
- [Lijian] working on vectorized memory copy - Second priority, no update so far.
- [Lijian] Patch for compiling issue with GCC-8.x is under community review.
- [Lijian] Patch for fixing StringTest failure is under community review.
- [Lijian] Patch for CDP failure is under community review.
- Memory Ordering
- [Sirshak] To start work on Arithmetic and Logic relaxed functions.
- Vectorization
- VPP Path failures
- https://jira.fd.io/browse/VPP-1475 - IP4 random reassembly failure in master, also seen on x86
- https://jira.fd.io/browse/VPP-1491 - GBP L3/L2 Endpoint Learning failure
- CSIT
- VPP Path
- Actually, everything is ready. The only thing is to get CI patch merged.
- VPP Device
- thunderx: 1-node topology on cavium thunderx is in place, but there are errors. Will continue investigation.
- thunderx2: Racked. Lack of static IP. Sirshak gave a work-around to fix lacking of static IP to Anton.
- mcbin: Kernel issue yet to try suggestion from Garcia and Damjan. To confirm with Jianling and Joyce - Lijian
- VPP Performance Test
- Working ongoing on writing scripts for Performance Jobs.
- Development of L2 test script is under-going now. Khem will get L2 work in CI firstly, and then IP4, and other test cases.
- VPP Path
- FD.io lab
- Arista switch is still not working. Andy and Anton are working on the exact requirement for the switch, and two possible option: Andy to replace the Arista or buy a new one.
- ThunderX2 - Racked. Lack of IP.
- Action Items - Next Week
- [Lijian] to continue to investigate make test failures.
- [Andy] to work with Anton to resolve Arista problem.
12/04/2018
- Attendees
- Sirshak Das
- Andy Wang
- Juraj Linkeš
- Khemendra
- Garcia
- Manuel
- Gorka
- Fede
- Tina Tsou
- VPP Hoststack
- iperf3 performance with Hoststack. - Sirshak has done some preliminary bench-marking, and compare kernel and VPP hoststack performance. Three case: kernel to kernel; kernel to VPP hoststack; VPP hoststack to VPP hoststack
- ongoing perf analysis. Two patches ongoing. One is upstreamed and the other is under internal review. Hotpots on memory copy or maybe other stuff.
- Sirshak is trying to set up one CSIT setup for everyone debugging VPP hoststack. Will share setup info
- Gorka is trying some optimal configs for VCL.
- VPP on both sides(iperf3 server and client) give a boost.(Reason unknown).
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- [Khem] Deployment of only L2 CSIT performance suite. Two scripts of L2 performance suites for CI management repository are done, investigating on for CSIT repository, and three more scripts to be developed.
- [Lijian] VPP dlmalloc crash issue root-caused and fixed by maintainer. Florin Coras fixed time-out issues.
- [Sirshak] Kernel Migration mcbin. Status: Sirshak to try inputs from garcia and damjan. - no progress so far. - To confirm with Jianling and Joyce.
- VPP
- Vectorization
- [Lijian] working on vectorized memory copy - Second priority, no update so far.
- [Lijian] Patch for compiling issue with GCC-8.x is under internal review.
- [Lijian] Patch for fixing StringTest failure is under internal review.
- Memory Ordering
- [Sirshak] To start work on Arithmetic and Logic relaxed functions.
- Vectorization
- CSIT
- VPP Path
- https://jira.fd.io/browse/VPP-1475 - IP4 random reassembly failure in master, also seen on x86
- https://jira.fd.io/browse/VPP-1476 - L2FIB failures in master, also seen on x86 - fixed
- https://jira.fd.io/browse/VPP-1491 - GBP L3/L2 Endpoint Learning failure
- https://jira.fd.io/browse/VPP-1490 - Traffic doesn't work in make test, 1604 issue(pmalloc issue) - to confirmed its current status
- https://jira.fd.io/browse/VPP-1497 - Cannot run in parallel problem - fixed
- VPP-1476, VPP-1475, VPP-1478. These failures are seen on Debian x86 VM also.
- Get CSIT/Aarch64 pass with partial test cases - Juraj - https://gerrit.fd.io/r/#/c/16282/
- VPP dlmalloc crash issue root-caused and fixed by maintainer.
- Florin Coras fixed time-out issue.
- VPP Device
- thunderx: 1-node topology on cavium thunderx is in place, but there are errors. Will continue investigation.
- thunderx2: Racked. Lack of IP. To confirm with Anton.
- mcbin: Kernel issue yet to try suggestion from Garcia and Damjan. To confirm with Jianling and Joyce - Lijian
- VPP Performance Test
- Working ongoing on writing scripts for Performance Jobs.
- Development of L2 test script is under-going now. Khem will get L2 work in CI firstly, and then IP4, and other test cases.
- VPP Path
- FD.io lab
- Arista switch is still not working. Andy and Anton are working on the exact requirement for the switch, and two possible option: Andy to replace the Arista or buy a new one.
- ThunderX2 - Racked. Lack of IP.
- Action Items - Next Week
- [Lijian] to continue to investigate make test failures.
- [Andy] to work with Anton to resolve Arista problem.
11/27/2018
- Attendees
- Juraj Linkeš
- Khemendra
- Garcia
- Manuel
- Gorka
- Fede
- Tina Tsou
- VPP Hoststack
- iperf3 performance with Hoststack. - Sirshak has done some preliminary bench-marking, and compare kernel and VPP hoststack performance.
- ongoing perf analysis, two patches ongoing. Hotpots on memory copy or maybe other stuff. Will share patches with community.- Sirshak
- Sirshak is trying to set up one CSIT setup for everyone debugging VPP hoststack. Will share setup info
- Gorka is trying some optimal configs for VCL.
- VPP on both sides(iperf3 server and client) give a boost.(Reason unknown).
- Alternate test cases.
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- [Khem] Deployment of only L2 CSIT performance suite. Status: Working with Juraj to get scripts ready for jobs. - Need to prepare some scripts. First to understand how the script works and then add more options.
- [Lijian] Status on VPP path failures. Status: Still debugging. Still at early study stage.
- [Sirshak] Kernel Migration mcbin. Status: Sirshak to try inputs from garcia and damjan. - no progress so far.
- VPP
- Vectorization
- [Lijian] working on vectorized memory copy
- Memory Ordering
- [Sirshak] To start work on Arithmetic and Logic relaxed functions.
- Vectorization
- CSIT
- VPP Path
- 3 failures currently stalling deployment.
- VPP-1476, VPP-1475, VPP-1478
- These failures are seen on Debian x86 VM also.
- Parallelization(n=32) is resulting in failures. Seems also be caused by below two patches.
- VPP-1490, caused by https://gerrit.fd.io/r/#/c/15106/ and https://gerrit.fd.io/r/#/c/15534/.
- VPP-1491, VPP-1497 about parallazation and GBP failure are filed.
- Get CSIT/Aarch64 pass with partial test cases - Juraj
- VPP Device
- thunderx: Juraj created a LF tkt for wiring the 1-node topology on cavium thunderx.
- thunderx2: to be racked by this Friday.
- mcbin: Kernel issue yet to try suggestion from Garcia and Damjan.
- VPP Performance Test
- Working ongoing on writing scripts for Performance Jobs.
- L2 test is working now manually. Khem is trying to get it work in CI, and then IP4, and other test cases.
- VPP Path
- FD.io lab
- Arista switch is missing cable. Andy will send tracking no. for cables.
- ThunderX2 - to be racked by this Friday.
- Action Items - Next Week
- [Lijian] to investigate VPP-1490 issue.
- [Andy] Andy will send tracking no. for cables.
11/20/2018
- Attendees
- Sirshak Das
- Andy Wang
- Juraj Linkeš
- Khemendra
- Garcia
- Manuel
- Gorka
- Fede
- Tina Tsou
- VPP Hoststack
- iperf3 performance with Hoststack. - Sirshak has done some preliminary bench-marking, and compare kernel and VPP hoststack performance.
- ongoing perf analysis, two patches ongoing. Hotpots on memory copy or maybe other stuff. Will share patches with community.- Sirshak
- Sirshak is trying to set up one CSIT setup for everyone debugging VPP hoststack. Will share setup info
- Gorka is trying some optimal configs for VCL.
- VPP on both sides(iperf3 server and client) give a boost.(Reason unknown).
- Alternate test cases.
- Action Items - Last Week
- [Sachin] to introduce RFC for IPsec offload support in DPDK plugin.
- [Khem] Deployment of only L2 CSIT performance suite. Status: Working with Juraj to get scripts ready for jobs. - Need to prepare some scripts. First to understand how the script works and then add more options.
- [Lijian] Status on VPP path failures. Status: Still debugging. Still at early study stage.
- [Sirshak] Kernel Migration mcbin. Status: Sirshak to try inputs from garcia and damjan. - no progress so far.
- VPP
- Vectorization
- [Lijian] working on vectorized memory copy
- Memory Ordering
- [Sirshak] To start work on Arithmetic and Logic relaxed functions.
- Vectorization
- CSIT
- VPP Path
- 3 failures currently stalling deployment.
- VPP-1476, VPP-1475, VPP-1478
- These failures are seen on Debian x86 VM also.
- Parallelization(n=32) is resulting in failures. Seems also be caused by below two patches.
- VPP-1490, caused by https://gerrit.fd.io/r/#/c/15106/ and https://gerrit.fd.io/r/#/c/15534/.
- VPP-1491, VPP-1497 about parallazation and GBP failure are filed.
- Get CSIT/Aarch64 pass with partial test cases - Juraj
- VPP Device
- thunderx: Juraj created a LF tkt for wiring the 1-node topology on cavium thunderx.
- thunderx2: to be racked by this Friday.
- mcbin: Kernel issue yet to try suggestion from Garcia and Damjan.
- VPP Performance Test
- Working ongoing on writing scripts for Performance Jobs.
- L2 test is working now manually. Khem is trying to get it work in CI, and then IP4, and other test cases.
- VPP Path
- FD.io lab
- Arista switch is missing cable. Andy will send tracking no. for cables.
- ThunderX2 - to be racked by this Friday.
- Action Items - Next Week
- [Lijian] to investigate VPP-1490 issue.
- [Andy] Andy will send tracking no. for cables.
11/12/2018
- Attendees
- Sirshak Das
- Andy Wang
- Juraj Linkeš
- Khemendra
- Garcia
- Gorka
- VPP Hoststack
- iperf3 performance with Hoststack. - Sirshak has done some preliminary bench-marking, and compare kernel and VPP hoststack performance.
- ongoing perf analysis, two patches ongoing. Hotpots on memory copy or maybe other stuff. - Sirshak
- Sirshak is trying to set up one CSIT setup for everyone debugging VPP hoststack. Will share setup info
- Gorka is trying some optimal configs for VCL.
- VPP on both sides(iperf3 server and client) give a boost.(Reason unknown).
- Alternate test cases.
- khem to get more information on benchmarking DMM. Khem to send the information to