Zhixiong Chi 134d5d2fbd Add the pci reboot quirk in DMI table for Dell PowerEdge R750
Problem:
The Dell R750 will hang after the following command being executed:
$sudo -i /bin/bash -c 'echo b > /proc/sysrq-trigger'
This issue can be reproduced almost within 5 times testing cycle.

The activated controller will send reboot command to mtcClient on the
standby controller due to the SM failure(heartbeat missed), and then
mtcClient tries to reboot the system gracefully. But if the standby
controller isn't rebooted within 120s, mtcClient tries to force reboot
it using the following command "echo b > /proc/sysrq-trigger".
Unfortunately the machine Dell PowerEdge R750 is stuck and the BMC
console doesn't show anything.

Solution:
After searching if there is any revelant clues about this machine,
nothing was found but the kernel parameter 'reboot=p' to change the
reboot type to pci_reboot for the sysrq magic key. With doing the test
cycle multiple times, and the issue has been gone with the kernel
option. The behavior that the system can reboot properly is expected.
So this way should be helpful for the Dell R750 reset.
Considering this kernel option should not be applicable to all target
machines, we just adjust the method to change reboot type for R750
machine based on DMI table quirk. The other kind of machine still uses
the default reboot type, and this commit just affects the R750 machine.

Base on the above, we add the pci reboot quirk in DMI table to change
the reboot_type to pci_reboot to make sure the kernel On Dell PowerEdge
R750 reboot properly.

On the R750 target we can see the following dmidecode information:
$sudo dmidecode |grep 'Product Name'
	Product Name: PowerEdge R750
$sudo dmidecode |grep 'Vendor'
	Vendor: Dell Inc.

TestPlan:
PASS: downloader && build-pkgs && build-image
PASS: Jenkins Installation on R750 machine and the other labs.
PASS: Execute the following testing cycle more than 20 times:
       $sudo -i /bin/bash -c 'echo b > /proc/sysrq-trigger'
       The system can reboot properly every time during test cycles.
       The stuck issue after reset hasn't been seen anymore.

Closes-Bug: 2041606

Signed-off-by: Zhixiong Chi <zhixiong.chi@windriver.com>
Change-Id: I05467cc6d5105aa813852dca0c935278741b043f
2023-10-30 22:30:42 -04:00

74 lines
4.1 KiB
Plaintext

0001-Notification-of-death-of-arbitrary-processes.patch
0002-PCI-Add-ACS-quirk-for-Intel-Fortville-NICs.patch
0003-affine-compute-kernel-threads.patch
0004-Affine-irqs-and-workqueues-with-kthread_cpus.patch
0005-Make-kernel-start-eth-devices-at-offset.patch
0006-intel-iommu-allow-ignoring-Ethernet-device-RMRR-with.patch
0007-turn-off-write-same-in-smartqpi-driver.patch
0008-Allow-dmar-quirks-for-broken-bioses.patch
0009-tpm-ignore-burstcount-to-improve-tpm_tis-send-perfor.patch
0010-bpf-cgroups-Fix-cgroup-v2-fallback-on-v1-v2-mixed-mo.patch
0011-scsi-smartpqi-Enable-sas_address-sysfs-for-SATA-dev.patch
0012-workqueue-Affine-rescuer-threads-and-unbound-wqs.patch
0015-Revert-scsi-sd-Inline-sd_probe_part2.patch
0016-Revert-commit-f049cf1a7b.patch
0017-genirq-Export-affinity-setter-for-modules.patch
0018-genirq-Provide-new-interfaces-for-affinity-hints.patch
0019-ixgbe-Use-irq_update_affinity_hint.patch
0020-Add-auxiliary-bus-support.patch
0021-driver-core-auxiliary-bus-move-slab.h-from-include-f.patch
0022-driver-core-auxiliary-bus-make-remove-function-retur.patch
0023-driver-core-auxiliary-bus-minor-coding-style-tweaks.patch
0024-driver-core-auxiliary-bus-Fix-auxiliary-bus-shutdown.patch
0025-driver-core-auxiliary-bus-Fix-calling-stage-for-auxi.patch
0026-driver-core-auxiliary-bus-Remove-unneeded-module-bit.patch
0027-driver-core-auxiliary-bus-Fix-memory-leak-when-drive.patch
0028-driver-core-auxiliary-bus-Enable-by-default.patch
0029-Enable-CONFIG_PAGE_POOL-by-default.patch
0030-x86-Enumerate-AVX512-FP16-CPUID-feature-flag.patch
0031-KVM-x86-Expose-AVX512_FP16-for-supported-CPUID.patch
0032-tools-headers-cpufeatures-Sync-with-the-kernel-sourc.patch
0033-rcu-Avoid-running-boost-kthreads-on-isolated-CPUs.patch
0035-xfs-open-code-ioend-needs-workqueue-helper.patch
0036-xfs-drop-unused-ioend-private-merge-and-setfilesize-.patch
0037-xfs-drop-unnecessary-setfilesize-helper.patch
0038-samples-bpf-use-kprobe-and-urandom_read_iter.patch
0039-Revert-sched-idle-Move-quiet_vmstate-into-the-NOHZ-c.patch
0040-kernfs-move-revalidate-to-be-near-lookup.patch
0041-kernfs-add-a-revision-to-identify-directory-node-cha.patch
0042-kernfs-use-VFS-negative-dentry-caching.patch
0043-kernfs-switch-kernfs-to-use-an-rwsem.patch
0044-kernfs-use-i_lock-to-protect-concurrent-inode-update.patch
0045-kernfs-dont-call-d_splice_alias-under-kernfs-node-lo.patch
0046-kernfs-also-call-kernfs_set_rev-for-positive-dentry.patch
0047-kernfs-don-t-create-a-negative-dentry-if-inactive-no.patch
0048-kernfs-switch-global-kernfs_rwsem-lock-to-per-fs-loc.patch
0049-kernfs-prevent-early-freeing-of-root-node.patch
0050-kernfs-remove-redundant-kernfs_rwsem-declaration.patch
0051-kernfs-fix-NULL-dereferencing-in-kernfs_remove.patch
0052-cpufreq-intel_pstate-Add-Icelake-servers-support-in-.patch
0053-cpufreq-intel_pstate-Add-Ice-Lake-server-to-out-of-b.patch
0054-cpufreq-intel_pstate-Support-Sapphire-Rapids-OOB-mod.patch
0055-cpufreq-intel_pstate-Add-Sapphire-Rapids-support-in-.patch
0056-Port-negative-dentries-limit-feature-from-3.10.patch
0057-platform-x86-intel-uncore-freq-Add-Sapphire-Rapids-s.patch
0058-intel_idle-add-SPR-support.patch
0059-intel_idle-add-preferred_cstates-module-argument.patch
0060-intel_idle-add-core-C6-optimization-for-SPR.patch
0061-intel_idle-Fix-the-preferred_cstates-module-paramete.patch
0062-intel_idle-Fix-SPR-C6-optimization.patch
0063-intel_idle-make-SPR-C1-and-C1E-be-independent.patch
0064-perf-x86-cstate-Add-ICELAKE_X-and-ICELAKE_D-support.patch
0065-perf-x86-cstate-Add-SAPPHIRERAPIDS_X-CPU-support.patch
0066-powercap-intel_rapl-support-new-layout-of-Psys-Power.patch
0067-perf-x86-rapl-Add-msr-mask-support.patch
0068-perf-x86-rapl-Only-check-lower-32bits-for-RAPL-energ.patch
0069-perf-x86-rapl-Fix-psys-energy-event-on-Intel-SPR-pla.patch
0070-perf-x86-rapl-Use-standard-Energy-Unit-for-SPR-Dram-.patch
0071-perf-core-Don-t-pass-task-around-when-ctx-sched-in.patch
0072-perf-core-Use-perf_cgroup_info-active-to-check-if-cg.patch
0073-perf-core-Fix-perf_cgroup_switch.patch
0074-perf-core-Always-set-cpuctx-cgrp-when-enable-cgroup-.patch
0075-Add-the-pci-reboot-quirk-in-DMI-table-for-Dell-Power.patch