- MoM of today's OCP SONiC call 04/14/2020.
Topics discussed.D-BUS - Mike[DELL]
- Why Dbus used for ztp and why not control panel aclD?
- How to secure the users shouldn't mis use the critical host access to commands like reboot? Can you provide some guidelines around it?
- How does host communicate to ex: reboot/warm reboot execution status to clients (containers issued commands to host)? How does D-Bus helps here?
- Will the design provide some infrastructure level audit mechanism who does what w.r.t host commands like reboot/warm reboot etc.
- Why D-Bus used specific listed services/activities alone, why can't it use for all the services? Can you add some guidelines to the document?
Virtual Summit
- OCP virtual summit registration - https://www.opencompute.org/summit/global-summit/registration
- May 13-14 sonic hardware/software technical sessions - https://www.opencompute.org/summit/global-summit/schedule
- May 15 - student symposium
- Detail available Virtual Summit Schedule updated
On Fri, Apr 10, 2020 at 9:27 AM MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 04/06/2020.
Topics discussed.AAA Enhancements - Martin [DELL]
- How to deal with Redis failures w.r.t account management?
- What is the target sonic customer deployments trying to address here?
- How do we protect the account creations from docker/applications like sonic-mgmt-framework or any other applications? Do you have any suggestions?
- Can this design support Radius based account management? Ans: It seems there is a PR from BRCM
- Today SONiC has Tacacs, what are the improvements expect from this HLD?
- How does the design handles multiple component failures? Ex: HAMD, REDIS, TACACS etc
- Can hamD decoupled from sonic-mgmt-framework?
202006 RoadmapThanks,-MadhuOn Tue, Mar 31, 2020 at 9:07 AM MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 03/31/2020.
Topics discussed.Proposal for System Health for LED settings - Liat (MLNX)
- Can this design consider multiple system LED's? AI Liat
- How do you test and quality LED management? Ans: Unit testing
- Is it possible to integrate with Monit for docker/process critical monitoring? Ans : Yes
- What is the suggestions for BMC systems ? Ans: this design is not for BMS based systems
- Can this alert/syslog messages duplicate to SNMP traps? Ans: It seems SONiC don't have traps today
- Can this design support system ready status? AI Liat (ASIC/vendor specific APIs)
- Is this design provide any port level LED blink functionality? AI Liat (ASIC/Vendor specific)
- This design should take care the fact that syslog format shouldn't be break BMC messaging/ extra work for BMC based systems ?
Non Technical
- OCP virtual summit - May
- Pre-recorded talk - deadline by this friday.
- Virtual booths - video recording of Demo [OCP will play demos several multiple months.
On Tue, Mar 24, 2020 at 9:26 AM MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 03/24/2020.
Topics discussed.Port Mirroring Rupesh Kumar (BRCM)1. Do you have a data model to support this feature? Please list out in HLD AI Rupesh2. Can this feature support OpenConfig data model? No3. How does the design handles both ACL & port based mirror enabled? Can we expect two copies ? AI Rupesh4. Can this design support config validations? AI Rupesh5. How does the design handles link events w.r.t to activate inactivate sessions?6. If platform doesn't support SPAN/ERSPAN features what is the behavior? Does this design consider SAI capability check? AI Rupesh/BRCM7. What are the tests written and How are these test cases organised / which repos will be used? AI - Rupesh8. What are the SAI attributes used in the feature.9. Can we police SPAN/ERSPAN sessions? AI RupeshThanks,-MadhuOn Tue, Mar 17, 2020 at 9:14 AM MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 03/17/2020.
Topics discussed.Sonic Qos Schedular & Shaping - Michael (BRCM)
- Can shaping config support on sub interfaces/ port channels ? No
- Do we have a plan to support vlan/port channel as many DC supports this use case?
- Why do need SAI_SCHEDULER_GROUP_ATTR_SCHEDULAR_PROFILE_ID when SAI_QUEUE_SCHEDULAR_PROFILE_ID available ? Ans: schedular_group will help for finer granularity.
- Do you have plan to publish spytest cases with feature? yes
- How to configure these new shaping configuration? is this coexists with config_db? Ans : new shaping attributes supported through sonic-mgmt-framework open-config model
- What are the shaping constraints w.r.t DPB (dynamic port breakout) ?
- Can shaping applied to CPU queues? suggestion - CoPP is the right place to configure the CPU queues?
- Can it be possible to configure shaping in percentage so that DPB can dynamically change the shaping per port?
- Can this shaping applied on operation port (profile applied and under traffic) ? No CLI to configure that today
Thanks,-MadhuOn Tue, Mar 10, 2020 at 9:22 AM MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 03/10/2020.
Topics discussed.Monitoring and auto-Mitigating unhealthy containers n SONiC - Joe
- How Monit service is different from Supervisord? Monit does track and alert every 5 min, so user will not miss any, where as supervisord alerts only once.
- Can Monit service monitor resource usage? >> possible
- How does Monit service detects dockers come online dynamically? >> right now it's driven by confiig file
- Can we make Thresholds human readable? Can it be possible to calculate threshold in % values ? >> Joe will look into itt
- How can we determine these throsholds ? ex: How much threshold shall we fix for a BGP container? >> Joe will look into it
- Can Monit service help user specify docker system quotas? >> no
- Can database is consistent after auto restart ? >> Not right now. Monit service provide provision to disable features from auto-restart ex: database
Thanks,-MadhuOn Wed, Feb 26, 2020 at 8:09 PM MS Reddy <msreddy...@gmail.com> wrote:Thanks,
- MoM of today's OCP SONiC call 02/25/2020.
Topics discussed.Gear Box Design - BRCM
- How does this design support for dynamic loading/discovering of external phy
- Why do pick multiple versions of phy-syncd? why can't glorify the existing syncd?
- Can vendors cherry-pick one synd instance out of all the instances/ is this possible switch off other syncd instances ?
- It is advised GearBox design should discuss with breakout/sub port work groups?
- Can this design supports dynamically updating eeprom / firmware ?
- How does this design treats system side vs line card link events ?
-MadhuOn Tue, Feb 11, 2020 at 9:27 AM MS Reddy <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 02/11/2020.
Topics discussed.SONiC Line card Hot Swap - Inspur
- How does it detect line card is broken ? Line card status must have [present/not present/ abnormal]
- How does this line card event detected by sonic ? is it event or poll based?
- I/O module / Linecard type detection is HW based detection
- List out the I/O or line cards supported in the design?
- How do you initilize the ASIC to line card type
- It should be discussed w.r.t sku/ dynamic breakout and gear box features.
Inspur team work with Xin to have sub group meeting to discuss more on the interested parties.Pre-OCP registration remainderThanksMadhuOn Tue, Feb 4, 2020 at 9:09 AM <msreddy...@gmail.com> wrote:MoM of today's OCP SONiC call 02/04/2020.Topics discussed.Debian 10 buster kernel upgrade 202006 release
- 201911 release bugs triage - watch out for bug comments and provide your update.
- OCP pre-summit workshop
ThanksOn Tue, Jan 14, 2020 at 8:43 AM MSREDDY P <msreddy...@gmail.com> wrote:MoM of today's OCP SONiC call 01/14/2020.Topics discussed.
- OCP pre-summit workshop - watch out for invitation from EventBrite
- Hackathon - watch out for invitation
- New workgroups - watch out for workgroup page
- Round table for workgroup meetups
- Next release - 202006
Thanks,-MadhuOn Tue, Dec 17, 2019 at 8:57 AM MSREDDY P <msreddy...@gmail.com> wrote:MoM of today's OCP SONiC call 12/17/2019.Topics discussed:PCI_E Diagnostics:
- How does it help? the feature captures all the pci_e components in pcie.yaml config file and verify the components on demand.
- How is different from lspci? the feature aggregates the lspci command info and capture components list in pcie.yaml file for component check
- When does the pcie.yaml generated? Platform shall generate the file when there is any change.
- Is the feature generic for all the platforms ? yes, there is specific file for each platform and the platform owners should update it.
- Can the feature distinguish critical/non critical PCI_e components? can this be included in the design ?
- Next meeting - Jan 7th.
-MadhuOn Tue, Dec 10, 2019 at 9:18 AM MSREDDY P <msreddy...@gmail.com> wrote:- OCP planning :MoM of today's OCP SONiC call 12/10/2019.Topics discussed:Release 201910 tracking status:
- ZTP - Ready to merge #599 is failed to pass tests. (failed tests are vs related.)
- MGMT Framework - Should support mgmt docker can be turned off / not build at all. Ready to merge.
- PDDF - on review pending
- STP/PVSP - Pending on Jenkins jobs.
- L3 performance - Merged
- L2 performance - ready to merge
- BFD - one more review needed and sync up with FRR upstream / feature on or off capabilities(next releases) - BRCM will share test results for review
- NAT - SAI dependency, BRCM will work with SAI community to close on this next few months (next release) - BRCM will share test results for review (No automation tests)
MeetupHackathonTo be focussed Topics - Community should focus on topics such as test improvementsVenue - LNKDOn Tue, Dec 3, 2019 at 9:11 AM MSREDDY P <msreddy...@gmail.com> wrote:Discussions will be continued next week.MoM of today's OCP SONiC call 12/03/2019.Topics discussed:Release 201910 tracking status:
- ZTP - [Approvals Pending] - Guohan approved it
- BFD - one more review needed and sync up with FRR upstream / feature on or off capabilities(next releases) - BRCM will share test results for review
- NAT - SAI dependency, BRCM will work with SAI community to close on this next few months (next release)
- STP/PVST - Build Job in Zenkins should be included
- MGMT Framework - Short group to review the changes / Alibaba provided comments on gNMI bring together with mgmt container.
ThanksOn Tue, Nov 26, 2019 at 9:11 AM MSREDDY P <msreddy...@gmail.com> wrote:MoM of today's OCP SONiC call 11/26/2019.Topics discussed:Release 201910 tracking status:
- ZTP - Mostly ready BRCM & Dell will take a look PR and close it
- BFD - one more review needed and sync up with FRR upstream / feature on or off capabilities(next releases)
- NAT - SAI dependency, BRCM will work with SAI community to close on this next few months
- L2 enhancements - extra file should be removed from SAI REDIS PR.
- MultiDB - Need to raise PR with all new changes
- MGMT Framework - Short group to review the changes
Announcements:
- Deadline for OCP participation extended- Dec 5th.
On Tue, Nov 19, 2019 at 9:15 AM MSREDDY P <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 11/19/2019.
Topics discussedThermal Control Design - MLNXReview (Q & A):
- Can the design support to integrate with thirdparty ? ex: BCM enabled CPU /thermal controls? yes
- Can this new daemon enabled/disabled? yes
- Is it possible to select few objects (fan/cpu/thermal) ? yes (all/none)
- How about supporting ASIC Internal Sensors - Dell will raise a PR on it
- Can this design supports to have flexibility use their own thermal/fan algorithm? yes
- A single daemon handles all the change events (fan/cpu/thermal)? there us high chance that some events should wait in the loop with the current design? Why can't we propose different change events for different cpu/fan/optics?
- Will the design supports more policies? yes
- Verbose on API definition on threshold levels about Average/Max/ Snapshot
- The current design discussed about 60 sec polling interval ? Is there a way to include timestamp in polling mechanism?
- Is there any API exposed for fanTray contain more than one fan?
Miscellaneous:OCP deadline - Nov 29thThanks,-MadhuOn Sun, Nov 17, 2019 at 6:23 PM MSREDDY P <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 11/12/2019.
Topics discussedDPKG Caching Framework - BRCMReview (Q & A):
- How does the framework track/calculate the GIT hash for new files and dependencies pulling from the Internet?
- Where does the debug cache stored?
- Can this framework increase build time (first time, how much does it take to populate the cache)?
- How much memory the debug cache takes ? ~600MB
- The framework discussed about two SHA hases? Does it have any significance? can this solved using single hash?
- How does the framework capture the file modified or not ?
- How does the framework deals with sonic patch files get applied to dpkg cache framework?
- Does the sha calculated for the depends files or depend files content ?
- How does commit and content SHA utilized in the framework? Why can't content-sha alone should solve the local modified files ?
- Do need to reevaluate/ recommend memory requirements with the DPKG feature?
-ThanksMadhuOn Tue, Nov 5, 2019 at 9:53 AM MSREDDY P <msreddy...@gmail.com> wrote:MoM of today's OCP SONiC call 11/05/2019.Topics discussed
- DPKG Caching Framework - BRCM
- Review (Q & A):
- How do track/calculate the GIT hash for new files and dependencies from the internet?
- Where does the deebug cache stored?
- Can this framework takes more time?
- How much memory the debug cache takes ? ~600MB
On Tue, Oct 29, 2019 at 9:39 AM MSREDDY P <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 10/29/2019.
Topics discussed
- RADIUS - BRCM
Review (Q & A):
- Where does the Cached MPL (management-privilege-level)stored ? It stored at protected file /var/run/radius
- Can the framework support user change from tacacs+ to radius?
- What is the radius agent planning to use ? pam-radius
- How about user login's on device reboots? does it expect login failed/success? No, as soon as MPL cache preserved, users can logged in.
- Can the MPL cache associate with TTL? No, right now we refresh the session on every user logging in.
- There are 3 radius options [many-to-one = Y/N/A] discussed, what is appropriate for SONiC usage?
DPKG Caching Framework - BRCM
- How do track/calculate the GIT hash for new files and dependencies from the internet?
- Where does the deebug cache stored?
- Can this framework takes more time?
- How much memory the debug cache takes ? ~600MB
PR is available, discussion will continue next week.Thanks,On Tue, Oct 22, 2019 at 10:00 AM MSREDDY P <msreddy...@gmail.com> wrote:Announcement
- MoM of today's OCP SONiC call 10/22/2019.
Topics discussed
- VRRP (Virtual Router Redundancy Protocol)- BRCM
Review (Q & A):
- What is preventing not to support vrrp3?
- How is it different from FRR VRRP support? do you get a chance to evaluate FRR VRRP stack?
- It would be good to list the out the possible use cases/deployments for the sonic user to enable to this feature? Can this feature work with data center MLAG kind of deployments ?
- How does the uplink tracking works? for instance let's say there are more than 8 uplink interfaces how do we does it effects on mastership?
- How to handle split-brain scenarios?
- What are the supported VRID ranges ?
Sub group on test framework proposal - starting tomorrow 8-9 AM PSTOn Tue, Oct 15, 2019 at 6:50 PM MSREDDY P <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 10/15/2019.
Topics discussed
- Tech Support export Service
Review (Q & A):
- Can custom scripts will be added to tech support service ? This is minimal service, can be added.
- Can the Journal data should be part of tech support ? will be included
- What kind of granular level support provided by tech support service? Minimal service for now, can be added filters, custom plugins to ex: to export to cloud etc.
- It seems tech support keep adding repeated data? How to avoid this storage back pressure on remote server? will look it
- do you have per process core support ? Can the user cherry pick on few process instead all ? With current schema of things with containers seems not possible right now. BRCM will look into it.
- Core File Manager
Review (Q & A):
- Can File Manager do automatic analysis on core dumps ? yes
- Will it be possible to export only analysis reports to tech support ? yes
- Is the Core file uploaded with back traces ? yes
- Does the systemD increase footprint - little bit
Thanks,MadhuOn Tue, Oct 8, 2019 at 10:27 AM MSREDDY P <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 10/08/2019.
Topics discussed
- Checkout for OCP summit https://www.opencompute.org/events/past-summits
- Test sub group will be back next week [mid of OCT]
- SONiC Document work group - news-letter bi-weekly [end of October]
- 201908 Code PR reviews - target next 2 weeks.
- 201908 Code complete - by Oct 31st
- 201908 QA start - Nov 1st
Thanks,-Madhu
- MoM of today's OCP SONiC call 9/24/2019.
Topics discussed
- MGMT Framework - BRCM & DELL
Review (Q & A):
- List out examples where does the developers/users need transLib hints?
- W.r.t CVL library, do you have any performance numbers ex: add-del-add config objects work flow ex: vlan ? Do you see any performance hit? what are the improveements?
On Tue, Sep 24, 2019 at 9:22 AM MSREDDY P <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 9/24/2019.
Topics discussed
- Dynamic Port BreakOut - LKND
- This talk is extension of previous discussion.
Review (Q & A):
- Can the design incorporate port groups ? offline discussion with Dell, LKND.
- Can the design support to add port persona ex: FC/FCoE or Ethernet?
- What is the default admin status of fanned out ports ? admin staus is DOWN by default.
- How does the design guarantee the sequencing of delete/add configurations?
Thanks,-MadhuOn Tue, Sep 17, 2019 at 9:49 AM MSREDDY P <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 9/17/2019.
Topics discussed
- Firmware Utils - MLNX
Review (Q & A):
- Why don't leverage ONIE updater, what is the design rational behind the fwUtils?
- What is the significance of chassis? Does Sonic supports multiple chassis?
- Can the design supports module level installations?
- Is the design support remote image path? yes
- What are the supported methods to download images? remote url http/https
- What about the image validations ? ex: compatibility between CPLD/BIOS etc..
- Can user skip/install specific image version using the fwUtils? - you should use it manually [skip the fwUpgrade]
- Can fwUtils supports scheduling of reloads after component updates?
2019 Oct ReleaseCheckout below for release trackingThanks,-MadhuOn Tue, Sep 10, 2019 at 9:10 AM MSREDDY P <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 9/10/2019.
Topics discussed
- Drop Counters HLD - MSFT
Review (Q & A):
- Does the design preserve the counters on warm reboots? No
- Can the design reports the user if the drop counter is not supported with platforms? yes
- List out the caveats with warm reboot cases. Ex: if the device went wrong after warm reboots, does the drop counters distinguish the failure reasons?
- Do we have default settings for the debug counters with the device? No
- Can the design provide any templates for the debug counters to configure it?
- Can the lifecycle (ex: clear)of these counters will not effect the existing counters? No
- Can the design support logical/aggregate debug counters?
- Does these counters are ASIC independent? what platforms do you guys cover it?
- Can this integrate with mgmt framework?
Thanks,-MadhuOn Tue, Sep 3, 2019 at 9:22 AM MSREDDY P <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 9/3/2019.
Topics discussed
- BGP Error handling - BRCM
Review (Q & A):
- Is there any perf impact on disable this feature: No
- Data shows the RIB-in convergence performance degradation is 44%, it should be linear, but why is 44%? can it be improved?
- What is the scope of the QuickTests? Is it covered only happy paths alone? do you have numbers with non-happy path scenarios?
- does the QuickTest covers both Ipv4 or Ipv6 ? QuickTest supports mix scenario of ipv4 & ipv6 ? not yet done for pure Ipv6 routes, will be explored.
- do you have any special handling for default route ? No
- Does it supports any debug commands check the failed route ? yes
- What is the reconciliation on daemon crashes (Ex: BGP)- how to reconcile the routes? Please list out the scenarios in HLD.
- Can this feature turn-off on demand ? is yes, can this affect the system stability?
- PR - https://github.com/Azure/SONiC/pull/424#pullrequestreview-283110975
Error Handling - BRCMReview (Q & A):
- Overall framework is thinking about two approaches - 1) Introduce Opaque ID to track the add-delete-add kind of error handling scenarios 2) Introduce an Sync SAI API in addition to current Async SAI API.
- HLD is out for the community review. https://github.com/Azure/SONiC/pull/391
Thanks,MadhuOn Wed, Aug 28, 2019 at 7:19 AM MSREDDY P <msreddy...@gmail.com> wrote:
- MoM of today's OCP SONiC call 8/27/2019.
Topics discussed
- Dynamic Port BreakOut HLD - LNKD
Review (Q & A):
- Can't SONiC query SAI API to fetch the break out capabilities?
- Generic question : Why breakout support only per interface? why can' t it per device ? Platform's don't allow certain ports due to silicon issues or the feature is not ready to use the breakout port on this relesase
- Can the breakout feature supports range of ports together?
- Can ASIC vendors support breakout on range/group of ports?
- What does platform vendors do to support this feature? It seems vendors should provide platforms.ini file.
- Can this feature support the list of breakout supported to the user?
- Can breakout feature enforce lanes and aliases to the sonic application?
- Why can't we define platform files per HWSKU?
- Can this HLD covers Platform LED feature?
- How about the configuration validations during port-breakout ? Can this integrate with MGMT framework CVL lib? yes.
Thanks,Madhu