MoM of today's OCP SONiC call 3/15/2022Topics discussed.
- How much performance gain/ saving time using batched requests API - 8%
- How does the batch write API handles in case of a process crashed ? In case a consumer crashes, Redis queues up the requests and the process should be able to consume it.
- How does the batch process API handles the scenarios priority inversion Ex: Orchagent is single threaded, can be blocked due to a high priority task, (Link up/down/ PFC storms need to react quickly, popping routes) - Wu will look into it.
- Is there a limitation to the size of the batch?
- How does it handle the writes failures in a batch? Explain redis transaction vs how producers and consumers behave in case of failure scenarios?
- What use case does this batch API can be used for ? Please list out the use cases in HLD.
- How did you test this API? Do you have any performance numbers?
- PR is out for review - https://github.com/Azure/SONiC/pull/959/files
- General: Community must focus on 202205 SONiC May release for feature delivery.
Thanks,-MadhuOn Tue, Mar 8, 2022 at 8:29 AM MS Reddy <msreddy...@gmail.com> wrote:MoM of today's OCP SONiC call 3/08/2022Topics discussed.
- Discussed on 202205 release feature list and did house cleaning activities.
- We have Identified the owners to migrate docker images to BullsEye.
- Watch out for updated xls from Ying/Zhang and contribute for missing docker packages and update the ownership and reviewers list.
Thanks,-MadhuOn Tue, Mar 1, 2022 at 9:38 PM MS Reddy <msreddy...@gmail.com> wrote:Thanks,MoM of today's OCP SONiC call 3/01/2022Topics discussed.
- How much build time reduction is seen with new improvements? Currently it is measured for one file change, it just takes 5 min to build, however more tests need to be done, more details refer to the HLD below.
- Does the sonic-build-system improvements support ARM builds both 32 & 64 ? yes it is applicable, however tested only for AMD
- Can single user support to issue more than one builds simultaneously? yes it is.
- PR is out for review - https://github.com/Azure/SONiC/blob/9bc83902da0ae1a5db01713a5d0a0611fe876897/doc/sonic-build-system/build-enhancements.md
-MadhuMoM of today's OCP SONiC call 2/22/2022Topics discussed.
- Is the CLI to get current memory usage of the container ? yes docker stats provides that
- If memory condition persists in monit , do u generate tech dump again again? - No generate once by special instruction from Monit
- Are these memory threshold users configurable ? yes and the 200 MB is default available memory
- what kind of report is this tech support? Is there a way to provide a summary report?
- could we support multiple thresholds ? start at 60%, jump at 80% collect one more tech support?
- Is this memory leak or memory thresholds for a container being reported by. syslog today? yes by the Monit process.
- PR is out for review - https://github.com/Azure/SONiC/blob/669409c18d32db90adb92486a1d877c176fb356a/doc/auto_techsupport_and_coredump_mgmt.md
Thanks,-MadhuThanks,MoM of today's OCP SONiC call 1/11/2022Topics discussed.
- Are these passwd rules/policies mandated for REST/HTTP users? sure, will be added to the design.
- Is the passw hardening supported for remote users? No, only local users.
- Is pam_cracklib FIPS compliant? Is it using the open ssl for encrypt/decrypt?
- Is this code part of sonic_mgmt repo ? yes
- Is the feature enabled default? No will be included by compile time, and choose by run time
- Is the passwords can be rotated ? not part of this design, can be thought through the use cases
- Is it possible to provide an informational log for the users/applications about the passwd expiry? will be included..
- How does it handle switch image upgrades w.r.t passwd hardening?
- PR out for review- https://github.com/Azure/SONiC/blob/8edc92e2139d1fd2b7a088396877281116717830/doc/passw_hardening/hld_password_hardening.md
-MadhuOn Thu, Dec 2, 2021 at 5:39 PM MS Reddy <msreddy...@gmail.com> wrote:MoM of today's OCP SONiC call 11/29/2021.Topics discussed.
- Discussed on 202111 community release fork date (11/30/2021 PST)
- Release planning what is in and what will be moved to the next release
- There is a Xls from Zhang for more details marked Yellow & Red.
- Features marked with Yellow need be handled by today with priority
- Features marked with Red will be moved to next release
Thanks,-MadhuMoM of today's OCP SONiC call 11/02/2021.Topics discussed.
- SONiC SAI Challenger - SAI Testing by PLVision.
- Slides will be published soon by PLVision.
Thanks,-MadhuMoM of today's OCP SONiC call 10/19/2021.Topics discussed.Had 202111 release features review, stay tuned for the update.Thanks,-MadhuOn Tue, Oct 12, 2021 at 8:30 AM MS Reddy <msreddy...@gmail.com> wrote:Dynamic policy based Hashing- by NvidiaMoM of today's OCP SONiC call 10/12/2021.Topics discussed.NVGRE - by Vadym/Nvidia
- Does the design provide capability checks for tunnel resources from vendors (no.of NVGRE supported)? Ans>> No.
- How about vNet routing support on NVGRE tunnels? Not supported, this feature does encap/decap tunnelled packets.
- PR is out for review - https://github.com/Azure/SONiC/pull/869
Note: Today was the last HLD discussion for the 2106 release.
- Not ready, will be postponed to the next release.
Thanks,-MadhuSystem Ready Enhancements - by Senthil Kumar Guruswamy
- MoM of today's OCP SONiC call 10/05/2021.
Topics discussed.CMIS Diagnostics - by Dante Su
- There is SFP Refactoring, how does this design different from that effort? >> Debate: This solution will coexist with sfp refactoring efforts, however later will merge into SFP refactoring.
- Are there any impacts with current SFPUtil show commands with new additions ? No impact, there will be new application advertisement, pl refer the CLI section for review
- PR is out for review - https://github.com/Azure/SONiC/pull/876
- How is this different from the current Monit feature ? Ans>> The Monit summary provides platform status such as LED etc.. it shows the running status of the container not the application readiness
- Does this design consider Application readiness vs liveness ? Readiness means all the dependent modules up and application ready to serve the traffic, how about the application hogs on memory cycle or runs out of threads not able to service requests? will it be possible to include liveness capability into the design?
- https://github.com/Azure/SONiC/pull/875/files
Miscellaneous - by XIn
- OCP date - Nov 9th- 10th
- OCP Schedule will be published on OCP website
- What is the mode of OCP workshop - virtual
- SONiC/ SAI Workshop - Tech Talk / Contribution / Proposal / Innovations - Let's plan for immediate after /next day OCP Nov 9th-10th
Thanks,-MadhuOn Tue, Sep 28, 2021 at 9:01 AM MS Reddy <msreddy...@gmail.com> wrote:Topics discussed.Host Interface counters - MLNX/Chen
- Can this design support packet drop counters due to DDOS attacks? a few options: 1. You can attach a policer to the drop counter. You can get flow counters from policer stats>> Chen will look into it.
- PR is out for review - https://github.com/Azure/SONiC/pull/858
Guidelines for reference proprietary code - John/Metaswitch
- Is this design proposal applicable only to Metaswitch? >> No
- How about every company wanting to add their own routing stack, what is the recommendation? >> application extension model would be a great fit here.
- Reference to check - https://github.com/vadymhlushko-mlnx/SONiC/blob/fdb2cae32421affba8a3cec3fda0fee40c091708/doc/cli_auto_generation/cli_auto_generation.md
- PR is out for review - https://github.com/Azure/SONiC/pull/860
Thanks,-MadhuOn Tue, Sep 21, 2021 at 8:58 AM MS Reddy <msreddy...@gmail.com> wrote:MPLS TC_to_TC_map HLD - Alexander (Metaswitch)
- MoM of today's OCP SONiC call 9/21/2021.
Topics discussed.SONiC TACACS+ HLD - Hua Liu (IPAM)
- Is the design supported to show the list of authorised commands ? >> No, the list of commands managed by tacacs server.
- Is the audit support
- How does the design work when a remote TACACS+ failover? >>> as the local server don't know the list of commands? >> this is an issue, in SONiC, users can login and run using local permissions.
- How do I block the commands using bash(/bin/sh)/python etc?
- PR is out for review - https://github.com/liuh-80/SONiC/blob/master/doc/aaa/TACACS%2B%20Design.md
- DB schema should be aligned with Yang model.
- PR is out for review - https://github.com/Azure/SONiC/blob/96a65f0a4d67dc3b0949d5798be51ab10da99c07/doc/qos/mpls_tc_to_tc_map.md
Thanks,
-MadhuOn Tue, Sep 14, 2021 at 6:36 PM MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 9/14/2021.
Topics discussed.ECMP Overlay BFD support - Prince / MSFT
- Does this feature support BUM traffic ? No - Since Vnet routes are based on unicast routes, it supports Unicast only.
- Does the feature support control plan BFD / FRR BFD ? No, this BFD offload to ASIC
- What is the motivation/use case endpoint monitor ip is different from the actual endpoint running BFD ? The use case is - There are devices support data vs control plane on different ports for monitor purpose
- Can the design support query ASIC BFD capabilities before writing the BFD session ? could be possible, will add into the HLD
- Can the control plan & hardware offload BFD coexists on the same device ? yes/No - Need to brainstorm complexity - Will split the HLD into two, one dedicated to BFD to describe all the scenarios.
- What are the default BFD timers used for offload ? WIll be included into HLD
- Is there any global session bfd table for default values? No
- Community suggested to have BFD into a separate HLD ? Ex: coexistence may cause issues, need to discuss more in details
- How about end user CLI to control BFD session either hardware offload or FRR BFD ? Next phase
- Do you have a BFD state db schema mapped to the Rfc BFD schema ? Will be included in HLD
- How do handle or notify BFD sessions from HW?
- Is it possible to remove BFD routes from the ECMP group ? yes, vNetOrch
- PR is out, please leave comments here - https://github.com/Azure/SONiC/pull/861/files
General Comments
- SUggest that use HLD PR as tracking PRs for related Code PRs. REference - - https://github.com/Azure/SONiC/pull/806
- 202111 feature release Deadline - Oct 1st
- Some features will be delayed to next release - No list yet identified
- Feature owners with HLD ready - if you want to schedule the review - reach out
Yanzhao Zhang
Thanks,-MadhuOn Thu, Sep 2, 2021 at 7:25 AM MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 8/31/2021.
Topics discussed.SHow Running Command Enhancement - - EdgeCore/MaxChen
- What are the use cases of this feature ? Customers familiar with CISCO like CLI, goal is to make it more convenient or trivial to understand the current running command .
- How does the design handle the maintainability of these improvements?
- How does Yang show the running command ?
- It looks like manual work, shouldn't it be duplicated to the sonic-mgmt-framework which is yang driven/auto generated? >> Not really!
- PR is out, please leave comments here - https://github.com/Azure/SONiC/pull/838
Routed Subinterfaces Enhancement - Preetham/BRCM
- What are the use cases of shorter sub interface naming ?
- Why does it required to bring in the short names? >> Kernel naming limited to 15.
- Where do you store the mapping short to long name child to parent ?
- Suggestion - for consistency reason keep vlan interface mandate.
- Is this config change only for routed subinterfaces ? How do they differentiate ?
- There must be a short name convention in SONiC already, is this design considered that?
- Why can't this short vs long name conversion hide in intrfaceMgrD? >> it looks like the changes are widespread
- PR is out for review - please leave comments here - https://github.com/Azure/SONiC/pull/833
Thanks,-MadhuThanks,
- MoM of today's OCP SONiC call 8/24/2021.
Topics discussed.SAG - Static Anycast Gateway - EdgeCore/MaxChen
- Is the SAG feature enabled by default? >> No, the feature is default disabled.
- Why do I need a knob for SAG? >> Please list down in HLD sections, is there any implications for taking care in data path handling of SAG vs macvlan interfaces.
- Unless it's absolutely necessary, it's not required to have a global knob. A global knob will introduce a lot of complexity and cases to handle. >> So is it necessary to have a knob for SAG?? can the gateway & ip address list can't be sufficient ? >> It seems to be true.
- SagMgrD is not required when we plan to use SVI instead of maxvlan interfaces.
- Is there my hardware resource limit on SAG interfaces ? can it be referred to in CRM?
- Please list down what are the complexities when SAG is enabled along with SVI/macvlan interfaces in data path routing?
- CLI SAG command can be part of interface command.
- PR is out - https://github.com/Azure/SONiC/pull/837
Show running enhancement - EdgeCoreGeneral comments1. Feature owners speed up as the deadline approaches2. Test quality is highest important for Community Features-Madhu
- MoM of today's OCP SONiC call 8/17/2021.
Topics discussed.PINS - P4 Integrated Network Stack - Google/Intel/ONF
- Why don't the design leverage the existing Error DB framework for feedback loop? >> It seems the PINS team is closely working on the Error framework team to address the gaps.
- Is this design different from FlexSAI ? Using PINS, you can model the entire SAI pipeline, not the case with FlexSAI
- What are the advantages of exposing the entire SAI pipeline using PINS? >>> we can do Fuzzing, Automation testing the entire pipeline being exposed.
- What kind of intelligence does the design provide to the applications in terms of network/application/resource errors? >> there is new HLD work in progress.
- What is the plan to support vendor SAI extensions? Can vendors SAI extension be added without recompiling libSAI? >> yes, HLD described it
- How does the design handle the missing redis pub/sub response path / notifications ?
- How about the PINS migration plan in terms of software upgrades vs ASIC upgrades? >>>Please add a section in HLD.
- Can this design work on packet I/O performance improvement? >> So far, the numbers are promising, and will be looked into.
- Can admins run SONiC without P4RT? yes.
- PR is out for review, please provide comments offline - https://github.com/pins/SONiC/blob/pins-hld/doc/pins/pins_hld.md
Thanks,-MadhuThanks,
- MoM of today's OCP SONiC call 8/10/2021.
Topics discussed.SONiC_SFP_refactoring HLD - Arista/MSFT
- How does the design support backward compatibility of existing sfp modules ?
- What are the guidelines for Vendors to implement the common sfp refactor packages? Please list out few examples for vendors to embrace it
- Can the design provide sfp data as Dict including all the sfp fields?
- PR is out for review please leave comments here - https://github.com/Azure/SONiC/blob/bf657839e521fb71e407df18e566a3e09c7e6958/doc/sfp-refactor/sfp-refactor.md
What next:
- Routed Subinterface Enhancement HLD review by Preetham Singh from BRCM – 30 mins
- PINS Main HLD review by Bhagat Janarthannan and team from Google, Intel, ONF – 30 mins
-MadhuOn Sat, Aug 7, 2021 at 1:19 PM MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 8/03/2021.
Topics discussed.
- Class based Forwarding HLD - MSFT /Tom
- Here is the PR to provide review comments - https://github.com/Azure/SONiC/pull/796
- Q & A
- How does any SONiC user consume this feature? What is the plan? How to enable this feature
- Is the regularSONiC release not enabled by default? yes Tom please confirm
- It doesn't provide any CLI and asks users directly to play around with app_db tables. What is the guidance? Please list down the instructions in the HLD section
- Shouldn't the design be limited to the number of FC ? platform specific
- Can it be the DSCP values more than FC values right? Yes, it is
- Can the design expose the DSCP/EXP values to applications? yes
- Is there any plan to introduce click commands / CLI ? >>> Tom
- A separate table for DSCP_to_FC, and refer to those table names in CLASS_BASED_NEXT_HOP_GROUP_TABLE ?
Thanks,-MadhuOn Sun, Aug 1, 2021 at 5:02 PM MS Reddy <msreddy...@gmail.com> wrote:Thanks,
- MoM of today's OCP SONiC call 7/27/2021.
Topics discussed.
- Tech support dump improvements - Nvidia (Vivek) - Please share the HLD here
- Q & A - How does it handle if the device end up with continuous coredumps? Is there way to ship the core files external?
- 202111 release plan (HLD & Code PR's reviewers)- will be posted shortly by Yanzhao Zhang
- July 23 - call for Paper OCP / OCP website to submit abstract - Select Networking Track - https://www.opencompute.org/summit/global-summit/call-for-papers
- OCP Updates for call for papers.
-MadhuOn Tue, Jul 13, 2021 at 8:58 AM MS Reddy <msreddy...@gmail.com> wrote:Thanks,
- MoM of today's OCP SONiC call 7/13/2021.
Topics discussed.
- 202111 release plan (HLD & Code PR's reviewers)- will be posted shortly by Yanzhao Zhang
- July 23 - call for Paper OCP / OCP website to submit abstract - Select Networking Track - https://www.opencompute.org/summit/global-summit/call-for-papers
- Paper Selection - Aug
-Madhu
- MoM of today's OCP SONiC call 6/22/2021.
Topics discussed.
- 202106 release status & discussed code & PR status
- Wiki will be posted shortly.
- 202106 release cut - 06/30
Thanks,-MadhuOn Tue, Jun 15, 2021 at 8:55 AM MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 6/15/2021.
Topics discussed.CMIS-C-CMIS [Coherent - Common Management Interface] - Chuan Qin/MSFT
- What is the goal of CMIS? >>>It is to configure and monitor optics/ transceivers.
- Why do need a special daemon, shouldn't be used xrvd or transceiver? .>> It is extending the xrvd
- How does SONiC consume these interfaces? Is it REst interface or CLI? will be discussed
- What are the plans to integrate with SONIC, need more detailed steps w.r.t CMIS interfaces as well as CMD firmware upgrades?
It would be very helpful if there would be a list of APIs which need to be implemented by vendors and how these are used. We are looking to understand which daemons are using it, CLI, etc.
General updates
202106 release updates
202111 release planning
Few timelines
- 202111 feature contribution submission end by 6/25/2021
- 202111 feature roadmap review in community on 7/6/2021
- 202111 release roadmap finalization 7/15/2021
Thanks,-MadhuOn Tue, Jun 1, 2021 at 8:42 AM MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 6/01/2021.
Topics discussed.SAI Failure Handling - Shi-su
- Can this design extend the error feedback loop to CLI? not in the scope - Shi will come back on it.
- Can this design work only with app_db or can be extended to other db's as well? It could, not in the scope.
- How does it work in conjunction with multi redis db?
- There is Error handling HLD out in the community, please make sure this design should be augmented to it.
- What is the life cycle of error db entries? How do the entries be consumed and cleaned up?
- PR is out for review - https://github.com/Azure/SONiC/blob/312e885c3c19f3e9506cfd10fcc86dbb8eac0309/doc/SAI_failure_handling/SAI_failure_handling.md
Thanks,-MadhuThanks,
- MoM of today's OCP SONiC call 5/18/2021.
Topics discussed.Sonic Dump Utility - Vivek from NVDA
Can this HLD support multi ASIC DB? yes. Need VS image to test the utility, please share a multi ASIC VS image to NVDA team.
Can this utility be used for counters db? yes.
Is there a code PR raised? Not yet.
How is this utility different from redis-dump tools ?
HLD PR is out for review - https://github.com/Azure/SONiC/blob/791a6a22d989ec7d3daa8efd3a45a56fdc3fa156/doc/Dump-Utility.md#overview
-MadhuOn Tue, May 11, 2021 at 5:41 PM MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 5/11/2021.
Topics discussed.SRv6 HLD - Houdi from Alibaba + Intel
How did you arrive at this requirement ex: SRv6 sidList per Policy = 4 ? Is this specific to Alibaba deployment?
How deep can transit nodes be ? Why is this limited to 3 in HLD? - Again, deployment specific - more headers results in SRH header compression.
How does the design support tying up the policy with routes? right now it supports policy tie up with prefixes. can be enhanced.
The workflow diagram is a little confusing, needs an update and should discuss it once again - pl focus on w.r.t ownership of the route data and who does what modifications + in terms of consumers/producers.
Alibaba/Intel will share the PR for review...
Thanks,-MadhuOn Fri, Apr 30, 2021 at 6:52 PM Madhu Pal <mad...@aviznetworks.com> wrote:Hi Srinadh,Please find answers line:Thanks,-MadhuI would like to understand more about "Don't overload stateDB for events & alarms? It should be advised to use separate redis DB for events & alarms?" comment. Appreciate some responses from the community. I hope I am using the right channel.Eventd is planning to use stateDB to house event history table, alarm table and stats tables. They get updated every time an event/alarm is raised.These tables are of fixed size: event history table size is customizable with maximum size being 40k or time limit of 30 days - at which time, eventd deletes older records. Stats table is of fixed size with a handful of records. Alarm table only contains a record when an alarm is raised and record is removed when the alarm is cleared.How does stateDB gets overloaded?Madhu>> As the operational data stored in state db today, by adding events, alarms and stats (frequent data) into it makes state db easily out of limits. In addition to that a software (eventD open source)/design flaw makes it worse. Unlike APP_DB, State DB is more read friendly, more writes/updates due to events/alarms/stats become performane issues? Btw, do you get a chance to estiamte on state DB with new data / what is plan to mesure performance of state db with ne data writes?Is it because of db writes? OR 40k for history table is too much in a DB?Madhu>> Today each redis instansce setup with multile redis db's ex: APP_DB, STATE_DB etc.. Not 100% sure REDIS memory limits and how does it set for each redis db or any redis profile?? MSFT team can help on this.Using serparate redis DB means, I need to create a new redis instance and create a DB with that insance? OR create a DB off existing redis, redis2, redis3 instances?Madhu>>I'd suggest to use new redis db similar to state db ex: event db or altogether new redis instance (redis1, redis2 - ) ex: https://github.com/Azure/SONiC/blob/master/doc/database/multi_namespace_db_instances.mdOn Tuesday, April 27, 2021, 09:15:18 PM PDT, MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 4/27/2021.
Topics discussed.Event & Alarm Framework HLD- Srinath - Dell
- How does the design handle an alarm storm ? event cache
- Which component does the event Cache? events
- What is the plan to upstream fixes into eventD?
- What exactly does eventD do? Why can't this be done with redis DB + lua scripts ?
- PR is out for review - https://github.com/Azure/SONiC/blob/ce60b64ee1560d0e6f9f4f19b4e860292a235bad/doc/event-alarm-framework/event-alarm-framework.md
Generalizing config.bcm support all brcm platforms - BRCM
- Who will maintain the common config file? BRCM
- How does the design handle ODM config files vs common config file?
- PR is out for review - https://github.com/Azure/SONiC/pull/699
Thanks,-MadhuOn Tue, Apr 20, 2021 at 6:42 PM MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 4/20/2021.
Topics discussed.Event & Alarm Framework HLD- Srinath - Dell
- Is this HLD supports SONiC CLi? No, It is supported by mgmt-framework.
- Why can't current syslog be utilized or enhanced?
- What is the motivation choosing eventD?
- How does this HLD integrate with Thermal design HLD which has similar eventing.
- How does this design handle event re-ordering? What suggestions if the events are out of order?
- Don't overload stateDB for events & alarms? It should be advised to use separate redis DB for events & alarms?
- How does the design handle the life cycle of an event/alarm? What are the eviction policies enforced on to the DB?
- Advised to use dynamic json event profile instead of using a static map?
- PR is out for review - https://github.com/Azure/SONiC/blob/ce60b64ee1560d0e6f9f4f19b4e860292a235bad/doc/event-alarm-framework/event-alarm-framework.md
- Review will be continued..
Thanks,-Madhu
- MoM of today's OCP SONiC call 4/13/2021.
Topics discussed.Policy based Hashing - Nvidia
- How does the design calculate hash resources ? There is no SAI API to calculate this. Please add a comment in the HLD.
- How does CRM resources handled with PbH?
- When port is part of the LAG, how does PbH rules PhB table - User/Orch agent should pass LAG to ASIC. - Validation should be taken care of in application.
- Add data flow sequence diagram explains the precedence or out of order handling of hash vs rule ?
- What is the behaviour if PBH table/resources is full ? there is no API - currently raise syslog/error to user and fallback to no PBH - hash will not be created
- Does SONiC track any thresholds ? Currently the ACL thresholds are being tracked but not the scarce resource like ALU / mirror sessions / no thresholds being tracked
- Hashing will be calculated only on inner frames . based on type ipv4 or ipv6 or vxlan user defined
- what are the fields expected to be configured for NVGRE? There is a reference example in HLD
- The PbH data model should be yang complaint? yes there is a section below.
- Is there any way to track ASIC hash resources today? No SOniC Infra support yet.
- PR is out for review - https://github.com/Azure/SONiC/blob/a3f2bde7f938c3db0b49d8acfe947a1320337bb8/doc/pbh/pbh-design.md
Thanks,MadhuThanks,
- MoM of today's OCP SONiC call 4/06/2021.
Topics discussed.SONiC BUM control support - Mohan S
- How does the design calculate CBR, as there is no support for user to configure? >> It is internally calculated by the application - BRCM
- How do you handle unknown multicast & unknown unicast storm control as there are no separate SAI policers? - Mohan will look into it
- Are there any statistics for drop counters due to storm control for each category - Not supported
- How does user stop storm control - By delete storm control config.
- Do you query strom control capability on asic ? How does the application know ASIC has this capability ? mgmt framework would support this feature ?? Mohan will check it
- Is there a sonic_yang support for storm control ? yes
- Share code PRs and Sonic Yang to the community for review - Mohan
- PR out for Review -https://github.com/Azure/SONiC/pull/441
SONiC Mgmt framework - ''show techsupport dump" - Kerry
- Can the design support flexibility of what content needs to be added to the tech support dump ? - yes
- can the design support download the tech support tarBall to clients? - Not yet, will be supported
- PR is out for review - https://github.com/Azure/SONiC/pull/756
-MadhuDHCPV6 Relay support in SONiC -Shlomi Bitton
- MoM of today's OCP SONiC call 3/30/2021.
Topics discussed.Recycle Port support for VOQ Chassis - Eshwaran
- Are these special ports supported by SAI?
- How does the recycle port be discovered and enabled for forwarding?
- Do you have any ASIC capability check? No
- Does it require a reboot ? currently yes.
- Does HLD support CLI for recycle ports - Not yet.
- Is there any strong reason behind the port naming ? Why do we choose the ReCycle port ? Recycle ports vs Recirculation ports? Ex: Recirculation ports already being referenced in p4
- Where do these ports exist? If it is in port-config.ini or platform.json? Pl update the HLD and add these config into platform.json as port_config.init may not be appropriate going forward.
- Are these recycle ports per core or per ASIC?
- DO you define a schema for these ports?
- Are these Recycle ports supported by configDB schema? If yes, should we develop a yang model for it? Yes, I will support it.
- PR is out - https://github.com/Azure/SONiC/blob/eaffd39b1a7887230b8e5f7213c1e12650d07537/doc/voq/recycle_port.md
- How does the design handle dhcpv6 relay w.r.t dynamic port/route changes ?
- Is there any limitation with isp-dhcp package that it requires static configuration? which is not the case with ipv4
- How does the design handle port down/up events wrt dhcp relay packets?
- Scale recommendation - 32 dhcp relay servers
- The Nvidia team will syncUp with Tamar on DHCPV6 Relay for any overlap.
- Nvidia will evaluate isp-dhcp vs dhcp-relay-6 package to find out a way to simplify the configuration.
- PR is out for review- https://github.com/Azure/SONiC/blob/04a14bdf2805b3d3ed5e07b65062958568886e44/doc/DHCPv6_Relay/DHCPv6_Relay_HLD.md
Thanks,-MadhuThanks,
- MoM of today's OCP SONiC call 3/16/2021.
Topics discussed.SONiC Generic Update and Rollback- Mohamad
- How does the design takes care of the JsonPatch ordering? >> there is a design doc linked to the HLD.
- You can do apply-patch any time? What is the gain of using checkpoints? Can these checkpoints leverage successful config rollbacks or only specific to failures? >> for both the cases you can use checkpoints to bring back the configs.
- Is there any data path disruption expected during JsonPatch apply? Yes, there will be minimal disruption.
- How many checkpoints does design support? Where does it store it? >> Checkpoints are stored in a device and there is no limit. There is CLI support for users to control it.
- PR is out for review - https://github.com/Azure/SONiC/blob/b71ebdb8ff959454f372bcda3446211b9a941de9/doc/config-generic-update-rollback/Json_Change_Application_Design.md#111-functional-requirements
- https://github.com/Azure/SONiC/blob/b71ebdb8ff959454f372bcda3446211b9a941de9/doc/config-generic-update-rollback/SONiC_Generic_Config_Update_and_Rollback_Design.md#3115-change-applier
-MadhuThanks,
- MoM of today's OCP SONiC call 3/16/2021.
Topics discussed.SONiC In_band mgmt_vrf - Venkat/Dell
- Is there any special/distinct trap that exists for mgmt_vrf today? Can you update HLD what needs to be configured for mgmt_vrf in order to prioritise and lift the mgmt traffic to the CPU.
- PR is out for review - https://github.com/Azure/SONiC/blob/7781c097a92d9fbac3fc2fe2f8c6ce175839f473/doc/vrf/SONiC_in_band_mgmt_via_mgmt_Vrf_HLD.md
-MadhuOn Tue, Mar 16, 2021 at 8:35 AM MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 3/9/2021.
Topics discussed.MPLS HLD- Ann Pokora/Juniper
- Can you explain how Juniper crpD coexists with FRR routing stack in SONiC? >> using build time flags
- How does this feature can be unit tested without crpd ? Ex: Few thoughts - user can push directly to redisDB
- What is the use case are we targeting?
- How does the community can leverage this feature?
- For MPLS, why does it only works withJuniper stack? Is there any MPLS functionality missing in FRR? >> yes there are few fixes from Juniper need to push to FRR community
- Can Juniper open source crpd ? >>> There is internal discussion at Juniper.
- What is the reason fpm syncd in a bgp container instead of switch? >> Can be discussed!!
- PR is out for review - https://github.com/Azure/SONiC/pull/706
Thanks,-MadhuThanks,
- MoM of today's OCP SONiC call 3/2/2021.
Topics discussed.Weighted ECMP HLD- ZhenCAI/Juniper
- How does warm boot work with code versions with EMCP comes up Weighted ECMP on warm boot?
- How does applications handle if SAI doesn't support weighted ECMP?
- Can this design support static next hop groups?
- How does this design work in conjunction with Metaswitch nexthop group proposal?
- Is the design query SAI capability before programming SAI about the weight ECMP ?
- Is there any config knob to enable weighted ECMP feature?
- What is the default weight nextHop ?
- Is this feature tightly coupled with the routing stack?
- Is the design providing error handling to handle the weight mismatch ?
- How does the design treat If all the nexthop entries have no weight ?- Ex: treat them as ECMP
- How do know SAI supports weighted ECMP or not? If SAI doesn't support how does ASIC handle it?
- PR is out for review - https://github.com/Azure/SONiC/pull/738
-MadhuOn Tue, Feb 23, 2021 at 9:10 AM MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 2/23/2021.
Topics discussed.SONiC Yang Models- Praveen/LNKD
- Do you enforce any yang model prerequisites before HLD approvals? yes, yang models should be discussed in sub group
- Is there any demo on how to run the yang tests? >> Not yet, can be provided check with Praveen
- Are there any alternatives to validate the yang models except build time?
- Did we think about the backward compatibility of sonic ynag models? >> will be discussed in yang model subgroup
- Do you have any guidelines for sonic yang model release migration ? >> will be discussed in the yang model subgroup
- Where should approval for the sonic models be ? >> sub group
- What about sonic yang models for FRR/bgp routing stack ? >>will be discussed in yang model subgroup
- PR is out for review - https://github.com/Azure/SONiC/pull/748/files
Thanks,
-MadhuOn Tue, Feb 16, 2021 at 9:28 AM MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 2/16/2021.
Topics discussed.CPU queue stats- Prabhu/BRCM
- Don't SONiC support the CPU queue stats today? No, front panel ports have this support today, extending it for CPU ports.
- Can it be extended to show port counters & show port pfc instead of new CLI? yes
- SNMP/MIB OID support be considered? right now the design skips CPU ports, but we can consider it
- Telemetry use case supported - yes (counters written into flex db)
- Do support debug counters ? yes
- HLD listed only MC CPU queues, don't it support unicast packet counters? Yes supports, however BRCM all CPU queues are Multicast queues- based on ASIC support
- How does this feature work with Multi ASIC platform feature? will be added into HLD
- Can this feature be included in a system test? yes
- PR is out for review : https://github.com/Azure/SONiC/pull/743
Miscellaneous:
- Release notes 202012 branch - review in progress
- 202106 release planning - wiki will be published soon
Thanks,-MadhuOn Tue, Feb 9, 2021 at 9:04 AM MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 2/09/2021.
Topics discussed.Fabric Port Support SONiC/BGP Setup for VoQ system - Arista
- Is the eBGP & iBGP treats the next hops the same? yes - there is a FRR patch (additional-path all) from Arista for it.
- How does it handle next hop group membership in addition to the EMCP groups?
- Is there HLD discussed about how the next hops picked up between eBGP & iBGP peers?
- PR's - https://github.com/Azure/SONiC/blob/30b27dc6b3287d477830a406754834a7e3b48620/doc/voq/fabric.md
- https://github.com/Azure/SONiC/blob/550fc24667ed41d25528fbae2a3577f6a5520b1a/doc/voq/bgp_voq_chassis.md
- https://github.com/Azure/SONiC/blob/master/doc/voq/architecture.md
- https://github.com/Azure/SONiC/blob/2f320430c8199132c686c06b5431ab93a86fb98f/doc/multi_asic/SONiC_multi_asic_hld.md
- F
_._,_._,_
Groups.io Links:You receive all messages sent to this group.
View/Reply Online (#459) | Reply To Group | Reply To Sender | Mute This Topic | New Topic
Your Subscription | Contact Group Owner | Unsubscribe [msreddy...@gmail.com]
_._,_._,_
Thanks,MoM of today's OCP SONiC call 3/29/2022Topics discussed.DSCP/TC Remapping for Tunnel traffic HLD - MSFT
- Yang models - the table names not used anymore, the yang model definitions need to update the HLD and commit.
- Is the design DSCP/TC remapping is vendor specific, as some vendors don't need to remap as the inner header will be used for mapping? Yes.
- HLD should be updated with various vendor support for remapping DSCP/TC remapping.
- PR is out for review - https://github.com/Azure/SONiC/pull/950
-Madhu
_._,_._,_
Thanks,MoM of today's OCP SONiC call 3/29/2022Topics discussed.DSCP/TC Remapping for Tunnel traffic HLD - MSFT
- Yang models - the table names not used anymore, the yang model definitions need to update the HLD and commit.
- Is the design DSCP/TC remapping is vendor specific, as some vendors don't need to remap as the inner header will be used for mapping? Yes.
- HLD should be updated with various vendor support for remapping DSCP/TC remapping.
- PR is out for review - https://github.com/Azure/SONiC/pull/950
-Madhu
_._,_._,_
- MoM of today's OCP SONiC call 4/19/2022
Topics discussed.Fast-reboot Flow enhancements - NVIDIA
- How is this feature different from existing warm/fast reboot sequences ? What is the performance/downtime improvement here?
- How does this design measure the control plane's downtime ? Ex: is it considered ports, lags, routes, VRF's etc?
- Does this feature work with existing control plane assist with warm reboot? Reference:https://github.com/Azure/sonic-utilities/blob/3ff68c4e5287ab2f5d23c23176ebd75a4f629bf0/scripts/neighbor_advertiser
- Do you have benchmarks to control/data planes or downtime w.r.t configurations?
- How does the design handles image upgrade use case w.r.t schema updates? >> Existing db_migrator does take care of config_db & app_db for image upgrade schema changes
- What is the restoration logic to be handled in orchagent? >> The logic makes sure all the do_tasks are completed without any items in the queue.
- Does the new fast fast-reboot design support bulk-api ASIC ? >> yes
- Is the bulk-api support from ASIC mandatory or choice to use fast fast reboot feature ? >> not mandatory that ASIC must support bulk API to get fast fast reboot feature
- PR is out review - https://github.com/Azure/SONiC/pull/980
Project announcements:
- SONiC moved to LF - Software for Open Networking in the Cloud (SONiC) Moves to the Linux Foundation - Linux Foundation
- Note: There is no change on our 202205 release plan.
Thanks,-Madhu
Thanks,MoM of today's OCP SONiC call 3/29/2022Topics discussed.DSCP/TC Remapping for Tunnel traffic HLD - MSFT
- Yang models - the table names not used anymore, the yang model definitions need to update the HLD and commit.
- Is the design DSCP/TC remapping is vendor specific, as some vendors don't need to remap as the inner header will be used for mapping? Yes.
- HLD should be updated with various vendor support for remapping DSCP/TC remapping.
- PR is out for review - https://github.com/Azure/SONiC/pull/950
-Madhu
_._,_._,_
- MoM of today's OCP SONiC call 4/26/2022
Topics discussed.DASH SAI PTF - INTEL
- How is this DASH proposal different from existing PTF frameworks Ex: Spy Test framework? >> this proposal for data plane testing based out of DASH (Disaggregated API for SONiC HOST)
- Where do you get the DASH proposal ? Is there any workgroup discussing it?
- What is the scope of the DASH SAI PTF proposal? Is this proposal for functionality testing or does it cover scale as well?
- How about underlay testing with DASH ? Is this proposal considered the underlay - >> Guohan suggested working with Prince on this item.
- Pdf is out for feedback - https://github.com/reshmaintel/DASH/blob/main/doc_SAI-Proposal-SAI-PTF.md
- Do you have a plan to raise PR to review this proposal?
Thanks,-Madhu
- MoM of today's OCP SONiC call 4/19/2022
Topics discussed.Fast-reboot Flow enhancements - NVIDIA
- How is this feature different from existing warm/fast reboot sequences ? What is the performance/downtime improvement here?
- How does this design measure the control plane's downtime ? Ex: is it considered ports, lags, routes, VRF's etc?
- Does this feature work with existing control plane assist with warm reboot? Reference:https://github.com/Azure/sonic-utilities/blob/3ff68c4e5287ab2f5d23c23176ebd75a4f629bf0/scripts/neighbor_advertiser
- Do you have benchmarks to control/data planes or downtime w.r.t configurations?
- How does the design handles image upgrade use case w.r.t schema updates? >> Existing db_migrator does take care of config_db & app_db for image upgrade schema changes
- What is the restoration logic to be handled in orchagent? >> The logic makes sure all the do_tasks are completed without any items in the queue.
- Does the new fast fast-reboot design support bulk-api ASIC ? >> yes
- Is the bulk-api support from ASIC mandatory or choice to use fast fast reboot feature ? >> not mandatory that ASIC must support bulk API to get fast fast reboot feature
- PR is out review - https://github.com/Azure/SONiC/pull/980
Project announcements:
- SONiC moved to LF - Software for Open Networking in the Cloud (SONiC) Moves to the Linux Foundation - Linux Foundation
- Note: There is no change on our 202205 release plan.
Thanks,-Madhu
Thanks,MoM of today's OCP SONiC call 3/29/2022Topics discussed.DSCP/TC Remapping for Tunnel traffic HLD - MSFT
- Yang models - the table names not used anymore, the yang model definitions need to update the HLD and commit.
- Is the design DSCP/TC remapping is vendor specific, as some vendors don't need to remap as the inner header will be used for mapping? Yes.
- HLD should be updated with various vendor support for remapping DSCP/TC remapping.
- PR is out for review - https://github.com/Azure/SONiC/pull/950
-Madhu
_._,_._,_