robin.thoni/ipxe - git rthoni.com

Commit Graph

Author	SHA1	Message	Date
Michael Brown	639632b059	[hyperv] Assume that VMBus xfer page ranges correspond to RNDIS messages The (undocumented) VMBus protocol seems to allow for transfer page-based packets where the data payload is split into an arbitrary set of ranges within the transfer page set. The RNDIS protocol includes a length field within the header of each message, and it is known from observation that multiple RNDIS messages can be concatenated into a single VMBus message. iPXE currently assumes that the transfer page range boundaries are entirely arbitrary, and uses the RNDIS header length to determine the RNDIS message boundaries. Windows Server 2012 R2 generates an RNDIS_INDICATE_STATUS_MSG for an undocumented and unknown status code (0x40020006) with a malformed RNDIS header length: the length does not cover the StatusBuffer portion of the message. This causes iPXE to report a malformed RNDIS message and to discard any further RNDIS messages within the same VMBus message. The Linux Hyper-V driver assumes that the transfer page range boundaries correspond to RNDIS message boundaries, and so does not notice the malformed length field in the RNDIS header. Match the behaviour of the Linux Hyper-V driver: assume that the transfer page range boundaries correspond to the RNDIS message boundaries and ignore the RNDIS header length. This avoids triggering the "malformed packet" error and also avoids unnecessary data copying: since we now have one I/O buffer per RNDIS message, there is no longer any need to use iob_split(). Signed-off-by: Michael Brown <mcb30@ipxe.org>	11 years ago
Michael Brown	a69c9953ac	[hyperv] Increase TX ring size Empirical observation suggests that 32 is a sensible size to minimise the number of deferred packet transmissions without overflowing the VMBus transmit ring buffer. Signed-off-by: Michael Brown <mcb30@ipxe.org>	11 years ago
Michael Brown	4e6821662c	[hyperv] Receive all VMBus messages in a poll Allow for elision of transmitted TCP ACKs by handling all received VMBus messages in each network device poll operation. Signed-off-by: Michael Brown <mcb30@ipxe.org>	11 years ago
Michael Brown	cd68d93b6e	[hyperv] Tear down NetVSC RX buffer GPADL after closing VMBus device On Windows Server 2012 R2, the receive buffer teardown completion message seems to occasionally be deferred until after the VMBus channel has been closed. This happens even if there are no packets currently in the receive buffer. Work around this problem by separating the revocation and teardown of the receive buffer, and deferring the teardown until after the VMBus channel has been closed. Signed-off-by: Michael Brown <mcb30@ipxe.org>	11 years ago
Michael Brown	ef16d0d2b3	[hyperv] Add support for NetVSC paravirtual network devices Signed-off-by: Michael Brown <mcb30@ipxe.org>	11 years ago
Michael Brown	d08547309b	[vmxnet3] Add profiling code to exclude time spent in the hypervisor Signed-off-by: Michael Brown <mcb30@ipxe.org>	11 years ago
Michael Brown	a937615151	[intel] Use autoloaded MAC address instead of EEPROM MAC address The i350 (and possibly other Intel NICs) have a non-trivial correspondence between the PCI function number and the external physical port number. For example, the i350 has a "LAN Function Sel" bit within the EEPROM which can invert the mapping so that function 0 becomes port 3, function 1 becomes port 2, etc. Unfortunately the MAC addresses within the EEPROM are indexed by physical port number rather than PCI function number. The end result is that when anything other than the default mapping is used, iPXE will use the wrong address as the base MAC address. Fix by using the autoloaded MAC address if it is valid, and falling back to reading the MAC address directly from the EEPROM only if no autoloaded address is available. Signed-off-by: Michael Brown <mcb30@ipxe.org>	11 years ago
Michael Brown	af17abf67f	[efi] Include NII driver within "snp" and "snponly" build targets End users almost certainly don't care whether the underlying interface is SNP or NII/UNDI. Try to minimise surprise and unnecessary documentation by including the NII driver whenever the SNP driver is requested. Signed-off-by: Michael Brown <mcb30@ipxe.org>	11 years ago
Michael Brown	feb3a0f7d5	[efi] Check for presence of UNDI in NII protocol iPXE itself exposes a dummy NII protocol with no UNDI. Avoid potentially dereferencing a NULL pointer by checking for a non-zero UNDI address. Signed-off-by: Michael Brown <mcb30@ipxe.org>	11 years ago
Michael Brown	047baaba38	[efi] Add NII / UNDI driver Some UEFI network drivers provide a software UNDI interface which is exposed via the Network Interface Identifier Protocol (NII), rather than providing a Simple Network Protocol (SNP). The UEFI platform firmware will usually include the SnpDxe driver, which attaches to NII and provides an SNP interface. The SNP interface is usually provided on the same handle as the underlying NII device. This causes problems for our EFI driver model: when efi_driver_connect() detaches existing drivers from the handle it will cause the SNP interface to be uninstalled, and so our SNP driver will not be able to attach to the handle. The platform firmware will eventually reattach the SnpDxe driver and may attach us to the SNP handle, but we have no way to prevent other drivers from attaching first. Fix by providing a driver which can attach directly to the NII protocol, using the software UNDI interface to drive the network device. Signed-off-by: Michael Brown <mcb30@ipxe.org>	11 years ago
Michael Brown	b9a5ff2b03	[efi] Generalise snpnet_dev_info() to efi_device_info() Signed-off-by: Michael Brown <mcb30@ipxe.org>	11 years ago
Michael Brown	b71e93bb71	[efi] Free transmit ring entry before calling netdev_tx_complete() The snpnet driver uses netdev_tx_defer() and so must ensure that space in the (single-entry) transmit descriptor ring is freed up before calling netdev_tx_complete(). Signed-off-by: Michael Brown <mcb30@ipxe.org>	11 years ago
Anton D. Kachalov	86285d1d54	[intel] Add 8086:1557 card (Intel 82599 10G ethernet mezz) Signed-off-by: Anton D. Kachalov <mouse@yandex-team.ru> Signed-off-by: Michael Brown <mcb30@ipxe.org>	11 years ago
Jan Kiszka	0df7ce9641	[intel] Add I217-LM PCI ID Add the ID for the LM variant and differentiate it from the I217-V. Signed-off-by: Jan Kiszka <jan.kiszka@web.de> Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	0ce3c97095	[efi] Allow for non-PCI snpnet devices We currently require information about the underlying PCI device to populate the snpnet device's name and description. If the underlying device is not a PCI device, this will fail and prevent the device from being registered. Fix by falling back to populating the device description with information based on the EFI handle, if no PCI device information is available. Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	4c5b7945c3	[efi] Use the SNP protocol instance to match the SNP chainloading device Some systems will install a child of the SNP device and use this as our loaded image's device handle, duplicating the installation of the underlying SNP protocol onto the child device handle. On such systems, we want to end up driving the parent device (and disconnecting any other drivers, such as MNP, which may be attached to the parent device). Fix by recording the SNP protocol instance at initialisation time, and using this to match against device handles (rather than simply comparing the handles themselves). Reported-by: Jarrod Johnson <jarrod.b.johnson@gmail.com> Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	ead70bf920	[intel] Apply PBS/PBA errata workaround only to ICH8 PCI device IDs ICH8 devices have an errata which requires us to reconfigure the packet buffer size (PBS) register, and correspondingly adjust the packet buffer allocation (PBA) register. The "Intel I/O Controller Hub ICH8/9/10 and 82566/82567/82562V Software Developer's Manual" notes for the PBS register that: 10.4.20 Packet Buffer Size - PBS (01008h; R/W) Note: The default setting of this register is 20 KB and is incorrect. This register must be programmed to 16 KB. Initial value: 0014h 0018h (ICH9/ICH10) It is unclear from this comment precisely which devices require the workaround to be applied. We currently attempt to err on the side of caution: if we detect an initial value of either 0x14 or 0x18 then the workaround will be applied. If the workaround is applied unnecessarily, then the effect should be just that we use less than the full amount of the available packet buffer memory. Unfortunately this approach does not play nicely with other device drivers. For example, the Linux e1000e driver will rewrite PBA while assuming that PBS still contains the default value, which can result in inconsistent values between the two registers, and a corresponding inability to transmit or receive packets. Even more unfortunately, the contents of PBS and PBA are not reset by anything less than a power cycle, meaning that this error condition will survive a hardware reset. The Linux driver (written and maintained by Intel) applies the PBS/PBA errata workaround only for devices in the ICH8 family, identified via the PCI device ID. Adopt a similar approach, using the PCI_ROM() driver data field to indicate when the workaround is required. Reported-by: Donald Bindner <dbindner@truman.edu> Debugged-by: Donald Bindner <dbindner@truman.edu> Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	d461b8ddf2	[intel] Display before and after values for both PBS and PBA Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	c845740b88	[intel] Display PBS value when applying ICH errata workaround Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	3953ddd2ac	[smc9000] Avoid using CONFIG as a preprocessor macro Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	8b2942a7db	[xen] Cope with unexpected initial backend states Under some circumstances (e.g. if iPXE itself is booted via iSCSI, or after an unclean reboot), the backend may not be in the expected InitWait state when iPXE starts up. There is no generic reset mechanism for Xenbus devices. Recent versions of xen-netback will gracefully perform all of the required steps if the frontend sets its state to Initialising. Older versions (such as that found in XenServer 6.2.0) require the frontend to transition through Closed before reaching Initialising. Add a reset mechanism for netfront devices which does the following: - read current backend state - if backend state is anything other than InitWait, then set the frontend state to Closed and wait for the backend to also reach Closed - set the frontend state to Initialising and wait for the backend to reach InitWait. Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	be79ca535a	[xen] Use version 1 grant tables by default Using version 1 grant tables limits guests to using 16TB of grantable RAM, and prevents the use of subpage grants. Some versions of the Xen hypervisor refuse to allow the grant table version to be set after the first grant references have been created, so the loaded operating system may be stuck with whatever choice we make here. We therefore currently use version 2 grant tables, since they give the most flexibility to the loaded OS. Current versions (7.2.0) of the Windows PV drivers have no support for version 2 grant tables, and will merrily create version 1 entries in what the hypervisor believes to be a version 2 table. This causes some confusion. Avoid this problem by attempting to use version 1 tables, since otherwise we may render Windows unable to boot. Play nicely with other potential bootloaders by accepting either version 1 or version 2 grant tables (if we are unable to set our requested version). Note that the use of version 1 tables on a 64-bit system introduces a possible failure path in which a frame number cannot fit into the 32-bit field within the v1 structure. This in turn introduces additional failure paths into netfront_transmit() and netfront_refill_rx(). Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	5c4f1da2ce	[efi] Generalise snpnet_pci_info() to efi_locate_device() Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Curtis Larsen	2ce1c70166	[efi] Try various possible SNP receive filters The behavior observed in the Apple EFI (1.10) RecieveFilters() call is: - failure if any of the PROMISCUOUS or MULTICAST filters are included - success if only UNICAST is included, however the result is UNICAST\|BROADCAST - success if only UNICAST and BROADCAST are included - if UNICAST, or UNICAST\|BROADCAST are used, but the previous call tried (and failed) to set UNICAST\|BROADCAST\|MULTICAST, then the result is UNICAST\|BROADCAST\|MULTICAST Work around this apparently broken SNP implementation by trying RecieveFilterMask, then falling back to UNICAST\|BROADCAST\|MULTICAST, then UNICAST\|BROADCAST, and finally UNICAST. Modified-by: Michael Brown <mcb30@ipxe.org> Tested-by: Curtis Larsen <larsen@dixie.edu> Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	7b3cc18462	[efi] Open device path protocol only at point of use Some EFI 1.10 systems (observed on an Apple iMac) do not allow us to open the device path protocol with an attribute of EFI_OPEN_PROTOCOL_BY_DRIVER and so we cannot maintain a safe, long-lived pointer to the device path. Work around this by instead opening the device path protocol with an attribute of EFI_OPEN_PROTOCOL_GET_PROTOCOL whenever we need to use it. Debugged-by: Curtis Larsen <larsen@dixie.edu> Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	3b42ed477f	[efi] Provide centralised definitions of commonly-used GUIDs Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	471fdfab79	[efi] Reset multicast filter list when setting SNP receive filters According to the UEFI specification, the MCastFilter parameter (which we currently pass as NULL, along with a zero MCastFilterCnt) is optional only if ResetMCastFilter is true. Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	1f7b269954	[efi] Add ability to dump SNP device mode information Originally-implemented-by: Curtis Larsen <larsen@dixie.edu> Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	16d99cc8ef	[efi] Dump existing openers when we are unable to open a protocol Dump the existing openers of a protocol whenever we are unable to open a protocol using attributes of BY_DEVICE, EXCLUSIVE, or BY_CHILD_CONTROLLER. Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	60891f699a	[efi] Use efi_handle_name() instead of efi_devpath_text() where applicable Using efi_devpath_text() is marginally more efficient if we already have the device path protocol available, but the mild increase in efficiency is not worth compromising the clarity of the pattern: DBGC ( device, "THING %p %s ...", device, efi_handle_name ( device ) ); Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	2e0821b9ed	[efi] Use efi_handle_name() instead of efi_handle_devpath_text() Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	793a806611	[xen] Add support for Xen netfront virtual NICs Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	3a02409fc8	[natsemi] Check for ioremap() failures Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	720ae17aa4	[myson] Check for ioremap() failures Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	022ef91984	[skel] Check for ioremap() failures Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	7ab3035749	[vmxnet3] Check for ioremap() failures Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	857e4f56a7	[realtek] Check for ioremap() failures Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	9ce2b56af6	[intel] Check for ioremap() failures Debugged-by: Anton D. Kachalov <mouse@yandex-team.ru> Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	d0cfbd01f5	[efi] Rewrite SNP NIC driver Rewrite the SNP NIC driver to use non-blocking and deferrable transmissions, to provide link status detection, to provide information about the underlying (PCI) hardware device, and to avoid unnecessary I/O buffer allocations during receive polling. Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	c7051d826b	[efi] Allow network devices to be created on top of arbitrary SNP devices Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Hannes Reinecke	bb5a4a111b	[igbvf] Allow changing of MAC address The VF might not have assigned a MAC address upon startup, and will end up with a random MAC address during probe(). With this patch the MAC address can be changed later on. Signed-off-by: Hannes Reinecke <hare@suse.de> Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Hannes Reinecke	f63ec19dca	[igbvf] Assign random MAC address if none is set If the VF doesn't have a MAC address assigned we should create a random MAC address. Signed-off-by: Hannes Reinecke <hare@suse.de> Modified-by: Michael Brown <mcb30@ipxe.org> Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	d5cf058994	[iscsi] Include IP address origin in iBFT The iBFT includes an "origin" field to indicate the source of the IP address. We use the heuristic of assuming that the source should be "manual" if the IP address originates directly from the network device settings block, and "DHCP" otherwise. This is an imperfect guess, but is likely to be correct in most common situations. Originally-implemented-by: Hannes Reinecke <hare@suse.de> Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	059adae434	[iscsi] Read IPv4 settings only from the relevant network device Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	e047811c85	[scsi] Improve sense code parsing Parse the sense data to extract the reponse code, the sense key, the additional sense code, and the additional sense code qualifier. Originally-implemented-by: Hannes Reinecke <hare@suse.de> Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	4e4fc678c2	[intel] Increase receive ring fill level As of commit d28bb51 ("[tcp] Defer sending ACKs until all received packets have been processed"), increasing the RX ring size will increase the number of received packets per transmitted ACK (since each poll will process up to one complete receive ring). Under KVM, this can make a substantial (up to ~200%) difference to the overall download speed, since transmissions are very expensive. Increase the ring fill level from four to eight packets: this increases the download speed by around 50% at a cost of around 8kB of heap space. Further speedups are possible by increasing the ring size further, but it would be preferable to find alternative methods which do not use noticeable amounts of heap space. Tested-by: Robin Smidsrød <robin@smidsrod.no> Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	abf875a2e5	[intel] Exclude time spent in hypervisor from profiling When profiling, exclude any time spent inside the hypervisor responding to our MMIO accesses. This substantially reduces the variance accumulated on many other profilers. Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	b2c7b6a85e	[intel] Push new RX descriptors in batches Inside a virtual machine, writing the RX ring tail pointer may incur a substantial overhead of processing inside the hypervisor. Minimise this overhead by writing the tail pointer once per batch of descriptors, rather than once per descriptor. Profiling under qemu-kvm (version 1.6.2) shows that this reduces the amount of time taken to refill the RX descriptor ring by around 90%. Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	8a3dcefc0c	[intel] Profile common virtual machine operations Operations which are negligible on physical hardware (such as issuing a posted write to the transmit ring tail register) may involve substantial amounts of processing within the hypervisor if running in a virtual machine. Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago
Michael Brown	27884298a3	[intel] Avoid completely filling the TX descriptor ring It is unclear from the datasheets whether or not the TX ring can be completely filled (i.e. whether writing the tail value as equal to the current head value will cause the ring to be treated as completely full or completely empty). It is very plausible that this edge case could differ in behaviour between real hardware and the many implementations of an emulated Intel NIC found in various virtual machines. Err on the side of caution and always leave at least one ring entry empty. Signed-off-by: Michael Brown <mcb30@ipxe.org>	12 years ago

First Previous ... 3 4 5 6 7 ... Next Last

1081 Commits (15759e539e2fd0d33ef4f68c92942807f2048a29)