DASH Workgroup Community Update 8/20/2025

6 views
Skip to first unread message

Kristina Moore

unread,
Sep 3, 2025, 8:52:30 PMSep 3
to sonic...@googlegroups.com, sonic-o...@lists.sonicfoundation.dev, eddie.ruan, guizhao.lh, Yanfeng, Yuezhou, Murthy Vakkalagadda, Arun, Doddapaneni, Krishna, Moopath velayudhan, Mukesh, Narayanan, Swaminathan, Selvarajan, Arunachalam, Srinivasan, Vijay, Sundara Murthy Gurunathan, Thyamagundalu, Sanjay, Veerappan, Senthilnathan, Venkatesh Srinivasan, Marc Meunier, Chid, Harrish SJ, Madhu, Israel Meilik, Jai Kumar, Lisa Nguyen, Mohammad Hanif, Sandeep Balani, Suresh Satapati, Kannan Selvaraj, grboudre, Hon Lon Lum (honllum), janapal, nissampa, Sid Singhal, vijamoha, Abdel Baig (abdbaig), Anand Srinivasan, Anant Kishor Sharma, Andrew Lyle, Andy Fingerhut, Andy Fingerhut, Ansel Li, Bhagyashree Hanumaiah (bhanumai), Bhavani, Carol Gal (cgal), Don Ewald (doewald), Dylan Peterson (dypeters), Franko Zamora Chacon (fzamora), Guy Duryee (guduryee), Jack Sexton (jacsexto), Joanna Li (joannali), Julia Tamayo (juledesm), Keerthy Erode Mohanasundaram (keerodem), Ken Parker (kentp), Krithika Srinivas (kritsrin), murali Venkateshaiah (muraliv), Perumal Venkatesh (pevenkat), Praveen Bhagwatula (pbhagwat), Ramesh Raghupathy (ram), Rob Murphy (robermur), Satish Ananthanarayana (sanantha), Selvam Ramanathan (selraman), Sudhir Kayamkulangara, TJ Barker (tjbarker), Venkat Sukavanam (vsukavan), Wenchung Wang (vincwang), Yue Gao (yuega2), Joseph White, Mark Sanders, Phaniraj Vattem, Senthil Kumar Ganesa, Shawn Dube, Venkatesan Mahalinga, Faisal Khan, Farhan Tariq J, Mohammad Qasim Farooqi, Saad Mazhar GMail, Zafir, Zarif Hafeez GMail, Ahmed Guetari, Chris McDonald, Heath Parrott, Joel Moses, John Gruber, Tony Torzillo, Ziv Saar, Ravindran Suresh, jame...@geico.com, Amith, Erum Frahim, Ghani, Ixim, Kwangsuk, Lin Songnan, Mahendar Byra, Meyappan K Gmail, Nitesh, Piotr P, Ravi, RS4681, Venkat External, Yoyo, Dean Lee, Alberto Villarreal, Alex Bortok, Chris Sommers, Manodipto Ghose, Mircea Dan Gheorghe, Nitesh Jha, Swaminathan Balasubramanian, Vinod Kumar, Alexander Cheskis, Mike Woster, Kishore Atreya, Sonny Mei, Brad House, Christian Kuhtz, John Evans, Rawal, Amol (Nokia - US/Westford), Abdul Rouff, Alan Lo, E Blatt, Eilon Greenstein, Gagan Punathil Ellath, Idan Hac, Liat Grozovik, Marian Pritsak, Matty Kadosh, Nikhil Sandugula, Oleksandr Ivantsiv, Paul Cummins, Shay Schlafman, Venice Hawa, Wei Bai, Yohad Tor, Yuval Degani, Madhu, Jamal Hadi Salim, Andriy Kokhan, Leonid Khedyk, Mykola Zhuravel, Tetyana Zubova, Michael Offel, Philipp Keydel, VolodymyrX Mytnyk, Aditya Sahni, Mahaboob Gani, Pranay Sahay, Sairam Rangaswamy, Satya Valli Rama, Sohan Prabhu (TATA CONSULTANCY SERVICES LTD), Syed Mehemood, Richard Wu, Kanza Latif, Muhammad Ali, rimsh...@xflowresearch.com, Wajahat Razi, Bud Grise, Ezra Y, John C Carney, Ted Weatherford, Vincent L, Chatterjee, Deb, Cristian Dumitrescu, Dan Peng, Limaye, Namrata, Naren Mididaddi, Paul Kappler, Rao, Radhika, Shan Greer, Shweta Shrivastava, Singhai, Anjali, Stephen Doyle, Subramanian, Maheswari

Hello DASH Community –thank you for your time prior to Labor Day weekend.  Many stakeholders have returned and we will next meet on September 10th, 2025.    

We continue to have a potential contribution back up for grabs.  It would be great to have a volunteer to suggest a PR in the dash-sonic-hld (in the SONiC repo here) for commands to show ENI counters and DPU global metrics – please submit a PR if you are interested!

 

In Summary, we covered ongoing HA feature development, such as the integration of HAMgrD into the dash-ha container, new HA scope fields, and related testing, with updates on unit and integration testing progress.  We are addressing issues with ZMQ data loss and DPU restart message drops, implementing fixes such as setting the ZMQ_IMMEDIATE flag and lazy binding to improve reliability during connection interruptions.  In collaboration with AMD we are working on VNET mapping improvements, including tracking PA validation entries to ensure correct association of port-map IDs.  Lastly we discussed OCP 2025 participation, and returning time on the 1st Wednesday of each month (starting in September). 

Also, I am looking to leverage the Linux Foundation lists more.  If you could please take the time to enter your info into the list here, I can initiate deletion of the sonic-dash@googlegroups list we used when we began the project.    

 

Follow-up tasks:

·        Graceful Shutdown PR Review: Address and respond to review comments from Dawei on the DPU module graceful shutdown support PRs across the three repositories. (Ramesh)

·        HAMgrD Testing: Complete unit testing and provide final update on adding HA manager D to the dash-ha container. (Fred, Ramesh)

·        HA Scope Fields Update: Follow up on the new HA scope fields PR to understand changes and coordinate for a possible future presentation. (Ramesh)

·        VNET Mapping Entry Fix: Monitor and resolve build pipeline issues for the fix to push VNET mapping entries associated with port map in the data path. (Mukesh)

 

In Summary (full list below), since the last Community call we have:

12 PRs Completed (-7)

9 in To Do (+/- 0)

6 in Draft (+ 2)

39 in Progress (+3)

9 Awaiting Review (+/- 0)

 

Just a reminder that we would encourage/invite Community members to present to the Community (test runs or progress, new scenarios, etc…), just ‘r’ to let me know, or generate a PR in the repo.

The DASH YouTube channel link is here to subscribe / access WG content (and click the bell to receive notifications). 

 

Thank you for your time/contributions – see you on 9/10/2025

 

Meeting Title:  SONiC-DASH-Workgroup Community Meeting #158

Attendees (13):

DASH Group to join: https://groups.google.com/g/sonic-dash

Linux Foundation list: https://lists.sonicfoundation.dev/g/SONiC-Dash

 

Abdul Rouff - Nvidia

Gagan Punathil Ellath - Nvidia

Ramesh Raghupathy - Cisco

Bud Grise - XSightLabs

Kristina Moore - MSFT

Selvarajan, Arunachalam - AMD

Don Ewald - Cisco

Mircea Dan Gheorghe - Keysight

Swami Balasubramanian - Keysight

Farhat Ullah - DreamBig Semi

Philipp Keydel - Synogate

Veerappan, Senthil - AMD

Vivek Reddy Karri - Nvidia

 

  

 

Full DASH Community Notes 😊  

  • Pull Request Progress and Review Updates:
    • PR Completion and Tracking: reported that 19 PRs were completed in the last couple of weeks, with ongoing efforts to visualize PR trends over time using Power BI.
    • HA and Platform PRs: Several PRs related to high availability (HA) and platform support were discussed, including the addition of HAMgrD to the dash-ha container, module graceful shutdown support, and fixes for ZMQ lost data. Ramesh and Fred were identified as key contributors, with Ramesh confirming that most comments on the graceful shutdown PRs had been addressed and further review was planned.
    • Dashboarding and Visualization: working to develop trends and dashboards to visualize PR activity, aiming to provide the team with insights into development velocity and bottlenecks.
  • HA Feature Development and Testing: The team discussed ongoing HA feature development, such as the integration of HAMgrD into the dash-ha container, new HA scope fields, and related testing, with updates on unit and integration testing progress.
    • HAMgrD Integration: Cisco is working on adding HAMgrD to the dash-ha container, updating supervisor.conf, and ensuring it is started as a critical process. The integration is in the final stages, with unit testing nearly complete and final validation underway.
    • HA Scope Fields Update: changes to align HA scope fields with the HLD, including rejecting object creation if the set is not created, mapping set IDs, and retrieving VIPs. Ramesh was not directly involved but agreed to follow up on the changes.
    • Testing Status: AMD reported that unit and integration testing for the HA manager and related features is nearly complete, with only final validation steps remaining before merging.
  • ZMQ and Data Path Reliability Fixes: addressed issues with ZMQ data loss and DPU restart message drops, implementing fixes such as setting the ZMQ_IMMEDIATE flag and lazy binding to improve reliability during connection interruptions.
    • ZMQ Lost Data Fix: A draft PR was created to address ZMQ data loss when the connection breaks, particularly affecting the first DASH object configuration after DPU power cycles. The fix involves setting the ZMQ_IMMEDIATE flag to ensure data is only sent when the server is ready.
    • DPU Restart Message Drop: Another PR targets message drops during DPU restarts by implementing lazy binding in the OrchAgent, ensuring that handlers are registered before messages are received to prevent data loss during startup.
  • VNET Mapping and Platform-Specific Updates: MSFT and AMD are working on VNET mapping improvements, including tracking PA validation entries and ensuring correct association of port-map IDs, with Mukesh addressing build pipeline issues and Lawrence handling private link redirect fixes.
    • VNET Mapping Entry Fix: Mukesh implemented a fix to push the VNET mapping entry's port-map attribute to SAI, ensuring proper association in the data path. The fix is under review, with some build pipeline issues delaying completion.
    • Private Link Redirect: Lawrence is addressing a fix for Private Link Redirect, ensuring that the DASH VNET mapping table processes port map fields correctly (in the SWSS repository).
    • Tracking PA Validation Entries: A new feature is being developed to track how many VNET mappings are using a given PA validation entry, improving visibility and management of VNET mappings.
  • Platform-Specific Bug Updates: NVidia reported on open bugs related to SmartSwitch and platform PCIe checks, noting that some PRs are awaiting review by Liat from NVIDIA and others have been shared with Cisco for feedback.
  • Meeting Logistics and Upcoming Events: announced plans to adjust the meeting schedule removing the 1st Wednesday of each month; discussed potential attendance at OCP 2025, and encouraged participants to add their information on the Linux Foundation list.
    • Meeting Schedule Changes: proposed canceling the first Wednesday meeting each month to provide the team with a short break, starting in September.
    • OCP 2025 Attendance: Kristina confirmed approval for travel to OCP 2025 in October and invited others to coordinate if attending
    • Linux Foundation List Update: Kristina requested that team members update their names on the Linux Foundation list to improve communication and tracking.

 

 

 

Sticky for Links/Reference:

 

 

DASH Groups to join to receive Invites, Meeting Notes, and Comms

DASH: https://groups.google.com/g/sonic-dash    

DASH-Test-Workgroup Group: https://groups.google.com/g/sonic-dash-test-workgroup  

Linux Foundation list: https://lists.sonicfoundation.dev/g/SONiC-Dash

If anyone knows potentially interested people who would like info re: our community, please have them joins these groups for receive Comms, etc…

Links to Recording 

Teams/Sharepoint:

SONiC-DASH Workgroup Community Meeting-20250820_090353-Meeting Recording.mp4

 

DASH Community YouTube:
https://youtu.be/aB-TvtvsYoU
 

HA moved to SmartSwitch LF group on Thursdays

YouTube Behavioral Model:
No agenda this week

8/20/2025 DASH Community Call; please request access via the link if you are not able to view/listen

Azure DASH GitHub Repo:                     

https://github.com/sonic-net/DASH

 


Test/Docs folder:

https://github.com/sonic-net/DASH/blob/main/test/docs/dash-test-workflow-saithrift.md

Ideal test workflow is here, converted to .md

SAI Thrift     

SAI Thrift PR

Client server needed for testing

P4

https://opennetworking.org/p4/ and https://p4.org/working-groups/

Open source, domain-specific programming language for network devices, specifying packet processing for data plane devices (switches, routers, NICs, filters, etc.)

PINS

https://opennetworking.org/pins/

 

PNA consortium spec

https://p4.org/p4-spec/docs/PNA-v0.5.0.html

An architecture describing the structure and common capabilities of network interface controller (NIC) devices which process packets transiting one or more interfaces and a host system.

Describes the structure and capabilities of the pipeline, and a user program, which specifies the functionality of the programmable blocks within that pipeline. For more information, see the P4 Language Consortium specifications

IPDK

Infrastructure Programmer Development Kit (ipdk.io) and

https://github.com/ipdk-io/ipdk-io.github.io

IPDK is an open source, vendor agnostic framework of drivers and APIs for infrastructure offload and management which runs on a CPU, IPU, DPU or switch. IPDK runs in Linux and uses a set of well-established tools such as DPDK and P4 to enable network virtualization.

bmv2

https://github.com/p4lang/behavioral-model

The second version of the reference P4 software switch, nicknamed bmv2 (for behavioral model version 2). The software switch is written in C++11. It takes as input a JSON file generated from your P4 program by a P4 compiler and interprets it to implement the packet-processing behavior specified by that P4 program

DPDK

https://www.dpdk.org/

DPDK is the Data Plane Development Kit which consists of libraries to accelerate packet processing workloads running on a wide variety of CPU architectures.

Linux Foundation SmartSwitch

https://lists.sonicfoundation.dev/g/sonic-smartswitch/calendar

 

 

 

Thank you again for your participation…

Kristina Moore MBA, M.S., CISSP - Azure Core Principal PM / DASH & SmartSwitch
Office: 425-722-7720     Mobile: 425-876-2040     Email:
kri...@microsoft.com
DASH Group to join: https://groups.google.com/g/sonic-dash    
Linux Foundation:  
https://lists.sonicfoundation.dev/g/SONiC-Dash
ImageTitle: LinkedIn - Description: image of LinkedIn icon

 

 

Kristina Moore

unread,
Sep 9, 2025, 7:14:49 PMSep 9
to sonic...@googlegroups.com, sonic-o...@lists.sonicfoundation.dev, eddie.ruan, Yanfeng, Yuezhou, Murthy Vakkalagadda, Arun, Moopath velayudhan, Mukesh, Selvarajan, Arunachalam, Sundara Murthy Gurunathan, Veerappan, Senthilnathan, Venkatesh Srinivasan, Marc Meunier, Chid, Harrish SJ, Madhu, Israel Meilik, Jai Kumar, Lisa Nguyen, Sandeep Balani, Suresh Satapati, Kannan Selvaraj, grboudre, Hon Lon Lum (honllum), janapal, nissampa, Sid Singhal, vijamoha, Anand Srinivasan, Anant Kishor Sharma, Andrew Lyle, Andy Fingerhut, Andy Fingerhut, Ansel Li, Bhagyashree Hanumaiah (bhanumai), Bhavani, Carol Gal (cgal), Don Ewald (doewald), Dylan Peterson (dypeters), Franko Zamora Chacon (fzamora), Guy Duryee (guduryee), Jack Sexton (jacsexto), Julia Tamayo (juledesm), Krithika Srinivas (kritsrin), murali Venkateshaiah (muraliv), Ramesh Raghupathy (ram), Selvam Ramanathan (selraman), Sudhir Kayamkulangara, TJ Barker (tjbarker), Yue Gao (yuega2), Joseph White, Mark Sanders, Phaniraj Vattem, Senthil Kumar Ganesa, Shawn Dube, Venkatesan Mahalinga, Faisal Khan, Farhan Tariq J, Mohammad Qasim Farooqi, Saad Mazhar GMail, Zafir, Zarif Hafeez GMail, Ahmed Guetari, Chris McDonald, Heath Parrott, Joel Moses, John Gruber, Tony Torzillo, Ziv Saar, Ravindran Suresh, jame...@geico.com, Amith, Erum Frahim, Ghani, Ixim, Kwangsuk, Lin Songnan, Mahendar Byra, Meyappan K Gmail, Nitesh, Piotr P, Ravi, RS4681, Venkat External, Yoyo, Dean Lee, Alberto Villarreal, Chris Sommers, Manodipto Ghose, Mircea Dan Gheorghe, Nitesh Jha, Swaminathan Balasubramanian, Vinod Kumar, Alexander Cheskis, Mike Woster, Kishore Atreya, Sonny Mei, Brad House, Christian Kuhtz, John Evans, Rawal, Amol (Nokia - US/Westford), Abdul Rouff, E Blatt, Eilon Greenstein, Gagan Punathil Ellath, Idan Hac, Liat Grozovik, Marian Pritsak, Nikhil Sandugula, Oleksandr Ivantsiv, Paul Cummins, Shay Schlafman, Venice Hawa, Wei Bai, Yohad Tor, Yuval Degani, Madhu, Jamal Hadi Salim, Andriy Kokhan, Leonid Khedyk, Mykola Zhuravel, Tetyana Zubova, Michael Offel, Philipp Keydel, VolodymyrX Mytnyk, Aditya Sahni, Mahaboob Gani, Sairam Rangaswamy, Satya Valli Rama, Syed Mehemood, Richard Wu, Kanza Latif, Muhammad Ali, rimsh...@xflowresearch.com, Wajahat Razi, Ezra Y, John C Carney, Ted Weatherford, Vincent L, Cristian Dumitrescu, Dan Peng, Limaye, Namrata, Naren Mididaddi, Paul Kappler, Rao, Radhika, Shan Greer, Shweta Shrivastava, Stephen Doyle

Hello DASH Community, just a quick note to say that we plan to meet at the normal time tomorrow morning at 9am Pacific Time – I hope to see you there.
Please bring any HLD, Q&A, or topic you would like to discuss.

Thank you,

 

 

Kristina Moore MBA, M.S., CISSP

Azure Core Principal PM - DASH, SmartSwitch, & Network Function Disaggregation


Office: 425-722-7720     Mobile: 425-876-2040     Email: kri...@microsoft.com
DASH Group to join:
https://groups.google.com/g/sonic-dash    

 

 


***********************

Reply all
Reply to author
Forward
0 new messages