US8380882B2 - Packet validation in virtual network interface architecture - Google Patents

Packet validation in virtual network interface architecture Download PDF

Info

Publication number
US8380882B2
US8380882B2 US12/612,078 US61207809A US8380882B2 US 8380882 B2 US8380882 B2 US 8380882B2 US 61207809 A US61207809 A US 61207809A US 8380882 B2 US8380882 B2 US 8380882B2
Authority
US
United States
Prior art keywords
network interface
interface device
transmit
network
transmit queue
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active, expires
Application number
US12/612,078
Other versions
US20100049876A1 (en
Inventor
Steve L. Pope
David J. Riddoch
Ching Yu
Derek Roberts
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xilinx Inc
Original Assignee
Solarflare Communications Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Solarflare Communications Inc filed Critical Solarflare Communications Inc
Priority to US12/612,078 priority Critical patent/US8380882B2/en
Publication of US20100049876A1 publication Critical patent/US20100049876A1/en
Assigned to COMERICA BANK, AS AGENT, A TEXAS BANKING ASSOCIATION reassignment COMERICA BANK, AS AGENT, A TEXAS BANKING ASSOCIATION SECURITY AGREEMENT Assignors: SOLARFLARE COMMUNICATIONS, INC., A DELAWARE CORPORATION
Assigned to SOLARFLARE COMMUNICATIONS INC. reassignment SOLARFLARE COMMUNICATIONS INC. CHANGE OF ADDRESS OF THE ASSIGNEE Assignors: SOLARFLARE COMMUNICATIONS INC.
Priority to US13/765,579 priority patent/US9912665B2/en
Application granted granted Critical
Publication of US8380882B2 publication Critical patent/US8380882B2/en
Assigned to LEVEL 5 NETWORKS, INC. reassignment LEVEL 5 NETWORKS, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: POPE, STEVE, RIDDOCH, DAVID, ROBERTS, DEREK, YU, CHING
Assigned to SOLARFLARE COMMUNICATIONS, INC. reassignment SOLARFLARE COMMUNICATIONS, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: LEVEL 5 NETWORKS, INC.
Assigned to SOLARFLARE COMMUNICATIONS, INC. reassignment SOLARFLARE COMMUNICATIONS, INC. RELEASE OF SECURITY INTEREST Assignors: COMERICA BANK
Assigned to SOLARFLARE COMMUNICATIONS, INC. reassignment SOLARFLARE COMMUNICATIONS, INC. RELEASE BY SECURED PARTY (SEE DOCUMENT FOR DETAILS). Assignors: COMERICA BANK
Assigned to PACIFIC WESTERN BANK reassignment PACIFIC WESTERN BANK SECURITY INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SOLARFLARE COMMUNICATIONS, INC.
Assigned to ALLY BANK reassignment ALLY BANK SECURITY INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SOLARFLARE COMMUNICATIONS, INC.
Assigned to SOLARFLARE COMMUNICATIONS, INC. reassignment SOLARFLARE COMMUNICATIONS, INC. RELEASE BY SECURED PARTY (SEE DOCUMENT FOR DETAILS). Assignors: PACIFIC WESTERN BANK
Priority to US15/888,498 priority patent/US10924483B2/en
Assigned to SOLARFLARE COMMUNICATIONS, INC. reassignment SOLARFLARE COMMUNICATIONS, INC. RELEASE OF INTELLECTUAL PROPERTY SECURITY AGREEMENT Assignors: ALLY BANK
Assigned to XILINX, INC. reassignment XILINX, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SOLARFLARE COMMUNICATIONS, INC.
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/10Network architectures or network communication protocols for network security for controlling access to devices or network resources
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L47/00Traffic control in data switching networks
    • H04L47/50Queue scheduling
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/90Buffering arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/90Buffering arrangements
    • H04L49/901Buffering arrangements using storage descriptor, e.g. read or write pointers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/90Buffering arrangements
    • H04L49/9031Wraparound memory, e.g. overrun or underrun detection
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/90Buffering arrangements
    • H04L49/9063Intermediate storage in different physical parts of a node or terminal

Definitions

  • the invention relates to network interfaces, and more particularly to mechanisms for validating network traffic sent or received by user level libraries in a virtual network architecture.
  • a typical computer system includes a processor subsystem (including one or more processors), a memory subsystem (including main memory, cache memory, etc.), and a variety of “peripheral devices” connected to the processor subsystem via a peripheral bus.
  • Peripheral devices may include, for example, keyboard, mouse and display adapters, disk drives and CD-ROM drives, network interface devices, and so on.
  • the processor subsystem communicates with the peripheral devices by reading and writing commands and information to specific addresses that have been preassigned to the devices.
  • the addresses may be preassigned regions of a main memory address space, an I/O address space, or another kind of configuration space.
  • Communication with peripheral devices can also take place via direct memory access (DMA), in which the peripheral devices (or another agent on the peripheral bus) transfers data directly between the memory subsystem and one of the preassigned regions of address space assigned to the peripheral devices.
  • DMA direct memory access
  • Most modern computer systems are multitasking, meaning they allow multiple different application programs to execute concurrently on the same processor subsystem.
  • Most modern computer systems also run an operating system which, among other things, allocates time on the processor subsystem for executing the code of each of the different application programs.
  • One difficulty that might arise in a multitasking system is that different application programs may wish to control the same peripheral device at the same time.
  • another job of the operating system is to coordinate control of the peripheral devices.
  • only the operating system can access the peripheral devices directly; application programs that wish to access a peripheral devices must do so by calling routines in the operating system.
  • the placement of exclusive control of the peripheral devices in the operating system also helps to modularize the system, obviating the need for each separate application program to implement its own software code for controlling the hardware.
  • the placement of exclusive control of the peripheral devices in the operating system also permits management of another potential difficulty, that of improper control or handling of the peripheral device.
  • improper or inappropriate control of the devices could compromise other applications running in the computer system, or could compromise or otherwise negatively impact operation of the network to which the device is connected.
  • the software code in the operating system has therefore developed a certain level of trust: users, network administrators, network architects and other network devices can presume that the great majority of packets originating from this software code will conform to network protocol specifications.
  • Additional code for controlling each particular peripheral device is incorporated into the operating system in the form of a device driver specific to the particular peripheral device. Device drivers are usually written by or in association with the manufacturer of the particular peripheral device, so they too are afforded a certain level of trust.
  • the part of the operating system that controls the hardware is usually the kernel.
  • the kernel which performs hardware initializations, setting and resetting the processor state, adjusting the processor internal clock, initializing the network interface device, and other direct accesses of the hardware.
  • the kernel executes in kernel mode, also sometimes called trusted mode or a privileged mode, whereas application level processes execute in a user mode.
  • the processor subsystem hardware itself which ensures that only trusted code, such as the kernel code, can access the hardware directly.
  • the processor enforces this in at least two ways: certain sensitive instructions will not be executed by the processor unless the current privilege level is high enough, and the processor will not allow user level processes to access memory locations (including memory mapped addresses associated with specific hardware resources) which are outside of a user-level physical or virtual address space already allocated to the process.
  • kernel space or “kernel address space” refers to the address and code space of the executing kernel. This includes kernel data structures and functions internal to the kernel. The kernel can access the memory of user processes as well, but “kernel space” generally means the memory (including code and data) that is private to the kernel and not accessible by any user process.
  • user space refers to the address and code space allocated by a code that is loaded from an executable and is available to a user process, excluding kernel private code data structures.
  • all four terms are intended to accommodate the possibility of an intervening mapping between the software program's view of its own address space and the physical memory locations to which it corresponds.
  • the software program's view of its address space is contiguous, whereas the corresponding physical address space may be discontiguous and out-of-order, and even potentially partly on a swap device such as a hard disk drive.
  • Address spaces are sometimes referred to herein as “virtual” address spaces, in order to emphasize the possibility of such mappings.
  • kernel routine by executing a system call, which is a function that causes the kernel to execute some code on behalf of the process.
  • the “current process” is still the user process, but during system calls it is executing “inside of the kernel”, and therefore has access to kernel address space and can execute in a privileged mode.
  • Kernel code is also executed in response to an interrupt issued by a hardware device, since the interrupt handler is found within the kernel.
  • the kernel also, in its role as process scheduler, switches control between processes rapidly using the clock interrupt (and other means) to trigger a switch from one process to another.
  • the current privilege level increases to kernel mode in order to allow the routine to access the hardware directly.
  • the kernel relinquishes control back to a user process, the current privilege level returns to that of the user process.
  • the operating system implements a system level protocol processing stack which performs protocol processing on behalf of the application, and also performs certain checks to make sure outgoing data packets have authorized characteristics and are not malformed.
  • an application wishing to transmit a data packet using TCP/IP calls the operating system API (e.g. using a send( ) call) with data to be transmitted.
  • This call causes a context switch to invoke kernel routines to copy the data into a kernel data buffer and perform TCP send processing.
  • protocol is applied and fully formed TCP/IP packets are enqueued with the interface driver for transmission. Another context switch takes place when control is returned to the application program.
  • kernel routines for network protocol processing may be invoked also due to the passing of time.
  • One example is the triggering of retransmission algorithms.
  • the operating system provides all OS modules with time and scheduling services (driven by the hardware clock interrupt), which enable the TCP stack to implement timers on a per-connection basis.
  • the operating system performs context switches in order to handle such timer-triggered functions, and then again in order to return to the application.
  • network transmit and receive operations can involve excessive context switching, and this can cause significant overhead.
  • the problem is especially severe in networking environments in which data packets are often short, causing the amount of required control work to be large as a percentage of the overall network processing work.
  • TCP and other protocols are implemented twice: once built into the kernel and once built into a user level transport library accessible to application programs.
  • an application issues API (application programming interface) calls.
  • Some API calls may be handled by the user level transport libraries, and the remainder can typically be passed on through the interface between the application and the operating system to be handled by the libraries that are available only to the operating system.
  • the transport libraries For implementation with many operating systems it is convenient for the transport libraries to use existing Ethernet/IP based control-plane structures: e.g. SNMP and ARP protocols via the OS interface.
  • user-level transport libraries One particular problem with user-level transport libraries is that in bypassing many of the routines normally performed in the kernel, they also lose the trust normally accorded those routines. This is because the kernel no longer has control of the user-level routines and cannot enforce their identity with those in the kernel. Users or application programs are able to modify the user-level transport routines, or replace them with others provided by a third party. As a result, the support of user-level transport libraries to bypass kernel routines and avoid context switches, increases the risk of malformed or even malicious traffic driven onto the network.
  • Part of the risk of permitting user-level transport libraries can be overcome by virtualizing the network interface device in such a way that each process is aware of only its own resources.
  • the hardware can be virtualized in such a way that one process cannot transmit or receive data on behalf of another, nor can one process see the data belonging to another process. But this kind of virtualization does not prevent a process from transmitting problematic data packets out onto the network through its own assigned resources; hence trust is still not ensured.
  • a network interface device receiving data packets from a computing device for transmission onto a network, the data packets having a certain characteristic, transmits the packet only if the sending queue has authority to send packets having that characteristic.
  • the data packet characteristics can include transport protocol number, source and destination port numbers, source and destination IP addresses, for example.
  • Authorizations can be programmed into the NIC by a kernel routine upon establishment of the transmit queue, based on the privilege level of the process for which the queue is being established. In this way, a user process can use an untrusted user-level protocol stack to initiate data transmission onto the network, while the NIC protects the remainder of the system from certain kinds of compromise.
  • FIG. 1 is a simplified block diagram of a typical computer system incorporating features of the invention.
  • FIG. 2 is another view of the computer system of FIG. 1 .
  • FIG. 2A illustrates an example bus mapping table of FIG. 2 .
  • FIG. 3 is a flowchart example of steps that an application process might follow in the system of FIGS. 1 and 2 for the creation and transmission of IP packets onto a network.
  • FIG. 4 is a flowchart of steps that might be performed during the library initialization step of FIG. 3 .
  • FIG. 5 is a block diagram of data structures used by the system of FIGS. 1 and 2 in order to support separate transmit and receive queues for different VNICs.
  • FIG. 6 is a flowchart illustrating significant steps performed by the kernel resource allocation routine of FIG. 4 .
  • FIGS. 7 and 8 illustrate alternative example authorizations databases maintained in the NIC of FIG. 1 .
  • FIG. 9 is a flowchart detail of the user socket routine in FIG. 3 .
  • FIG. 10 is a flowchart detail of the bind routine in FIG. 3 .
  • FIG. 11 is a flowchart detail of the sendTo routine in FIG. 3 .
  • FIG. 12 is a functional block diagram of certain structures on a NIC which can be used to perform packet validation for transmission of data packets.
  • FIG. 13 is a flowchart of typical steps taken on the NIC 116 for transmission of data packets arriving from multiple queues.
  • FIG. 1 is a simplified block diagram of a typical computer system 110 which can communicate via a network 112 with other computer systems such as 130 , 132 and 134 .
  • Computer system 110 includes a network interface card (NIC) 116 communicating via a communication channel 118 with a host subsystem 114 .
  • the host subsystem 114 includes a processor subsystem 120 which includes at least one processor, a host memory subsystem 122 , and a core logic subsystem 124 .
  • the core logic subsystem 124 provides bridges among the processor subsystem 120 , the host memory subsystem 122 and the communication channel 118 .
  • the host subsystem 114 may also include other devices 126 in communication with the communication channel 118 .
  • the host subsystem 114 is considered to be one type of “computing device”. Other types of computing devices include non-reconfigurable processing devices for which network connectivity is desired.
  • the network interface card 116 provides an interface to outside networks, including an interface to the network 112 , and is coupled via network 112 to corresponding interface devices in other computer systems.
  • the physical hardware component of network interfaces are referred to herein as network interface cards (NICs), although they need not be in the form of cards: for instance they could be in the form of integrated circuits (ICs) and connectors fitted directly onto a motherboard, or in the form of macrocells fabricated on a single integrated circuit chip with other components of the computer system.
  • NICs network interface cards
  • Network 112 may comprise many interconnected computer systems and communication links. These communication links may be wireline links, optical links, wireless links or any other mechanism for communication of information. While in one embodiment network 112 is the Internet, in other embodiments, network 112 may be any suitable computer network or combination of networks. In and embodiment described herein, network 112 supports an Ethernet protocol.
  • Host memory subsystem 122 typically includes a number of memories including a main random access memory (RAM) for storage of instructions and data during program execution, and a read only memory (ROM) in which fixed instructions and data are stored.
  • RAM main random access memory
  • ROM read only memory
  • One or more levels of cache memory may also be included in the host memory subsystem 122 .
  • the host memory subsystem 122 is sometimes referred to herein simply as “host memory”.
  • virtual memory is considered part of the host memory subsystem even though part of it may be stored physically at various times on a peripheral device.
  • the communication channel 118 provides a mechanism for allowing the various components and subsystems of computer system 110 to communicate with each other.
  • the communication channel 118 comprises a PCI bus.
  • Other embodiments may include other buses, and may also include multiple buses.
  • Computer system 110 itself can be a varying types including a personal computer, a portable computer, a workstation, a computer terminal, a network computer, a television, a mainframe, a server or any other data processing system or user devices. Due to the ever-changing nature of computers and networks, the description of computer system 110 depicted in FIG. 1 is intended only as a specific example for purposes of illustrating an embodiment of the present invention. Many other configurations of computer system 110 are possible having more or less components, and configured similarly or differently than, the computer system depicted in FIG. 1 .
  • FIG. 2 is another view of the computer system 110 , presented to better illustrate software and functional features.
  • the computer runs an operating system 221 which is capable of supporting application processes 222 also running on the computer.
  • a “process” is a program, for example an application program, in execution.
  • a process may run multiple “threads”, which can also be thought of as lightweight processes.
  • a library 223 of instructions is stored by the computer and available to the applications. The part of the library usable for communications with the NIC 116 is termed a transport library 224 .
  • Included in the operating system kernel is a driver component 225 : a dedicated set of instructions which allow for data transfer with the NIC 116 .
  • Each application would normally take the form of a software program running on the computer, but it could be embedded in firmware.
  • Some operating systems include applications in addition to fundamental operating system code. Aspects of the system that are not described herein may be as set out in PCT publication No. WO2004/025477, incorporated herein by reference in its entirety
  • the NIC 116 can support resources of a number of types: i.e. resources having capabilities of different natures. Examples include DMA queues, event queues, timers and support resources for remote apertures of the type described in WO2004/025477.
  • Each type of resource ( 231 - 234 ) is provided from a dedicated hardware resource pool which can support numerous instances of resources of the respective type. In order for such an instance to be made operational it must be configured by means of instructions from the computing device 114 , as described in more detail below.
  • the NIC 116 communicates with the computing device 114 over the bus 118 .
  • the bus is a PCI bus, but the invention is not limited to such a bus.
  • Data transmitted over the PCI bus 118 is associated with a destination address and is received by whichever entity that is connected to the bus has had that address allocated to it.
  • the addresses are allocated in pages of 4 or 8 kB. One or more of these pages may be allocated to the NIC 116 .
  • Blocks 251 and 252 represent allocated pages on the PCI bus 118 .
  • the NIC 116 has a bus interface controller 235 , a resource configuration unit 236 and a bus mapping table 237 .
  • the resource configuration unit processes communications received from the computer that provide instructions on the allocation, re-allocation and de-allocation of resources on the NIC 116 , and configures the resources in accordance with such instructions.
  • the kernel driver 225 stores a record of which resources on the NIC 116 are allocated. When a resource is to be allocated the driver 225 identifies a suitable free resource of the required type on the NIC 116 and transmits an allocation instruction to the NIC 116 .
  • the instruction identifies the resource and specifies the details of how it is to be allocated, including details of the internal configuration of the resource (e.g.
  • That instruction is passed to the resource configuration unit.
  • the resource configuration unit then loads the specified configuration into the identified resource.
  • the instruction also includes an ownership string, which may be an identification of which application or process on the computer is using the resource.
  • the resource configuration unit stores these in a row of the bus mapping table.
  • An example of entries in the bus mapping table is shown in FIG. 2A and is described in more detail below.
  • one or more pages ( 251 , 252 ) on the bus 118 are allocated to the NIC 116 .
  • Part of this address space page 251
  • Other pages e.g. page 252
  • the resource configuration unit 236 stores a record of the pages that are allocated to the NIC 116 for use by resources. Note that in some embodiments, some or all of the functions of the resource configuration unit 236 may alternatively be provided by the kernel driver 225 itself.
  • Standard types of network connection require standard sets of resources; for example: an event queue, transmit (TX) and receive (RX) DMA queues, and a set of direct memory accessible (DMA'able) memory buffers.
  • TX transmit
  • RX receive
  • DMA'able direct memory accessible
  • the user level transport library 224 includes routines that can be called directly by the application process 222 and that initiate the allocation of such standard sets of resources, including set numbers of resources of different types.
  • the transport library also includes routines that allow a resource of each type to be allocated, re-allocated or de-allocated individually.
  • the presence of both these types of instruction means that standard connections can be set up efficiently, and yet non-standard groups of resources can be created, and existing connections can be reconfigured on a resource-by-resource basis.
  • a “user level stack” is any protocol processing software that runs in unprotected mode.
  • a “protocol stack” is the set of data structures and logical entities associated with the networking interfaces. This includes sockets, protocol drivers, and the media device drivers.
  • the routines for allocation, re-allocation and de-allocation of resources require access to restricted memory mapped addresses, such as page 251 for sending configuration instructions to the NIC 116 . Since the user level transport library 224 lacks the necessary privilege level to perform these accesses, these routines in the user level transport library 224 make calls to the kernel driver 225 . In a Unix environment, for example, such calls might take the form of IOCtl( ) system calls. These calls cause an initial context switch to a kernel level process, which in turn communicate the instructions to the NIC 116 for the allocation of the resources as specified in the routines. Those instructions specify the identity of the application or process with which the resources are to be associated, and the nature of the resources. The instructions are processed by the resource configuration unit 236 of the NIC 116 .
  • a feature of the system of FIG. 2 is that the space on the bus 118 that is allocated to the NIC 116 can be split dynamically between the resources on the bus 118 . Once one or more pages 252 have been allocated to the NIC 116 for use by resources those resources can be allocated one or more individual sub-page addresses within that page, corresponding to locations as illustrated at 253 , 254 . Thus each resource can have a part of the total space allocated to it. A record of which part of the total space is allocated to which resource is stored in the bus mapping table 237 . The effect is that a single page of the bus can be used for communication to resources of multiple types and/or resources that relate to multiple connections and/or resources that are associated with multiple applications or processes on the computer 114 . As a result, the total bus space can be used relatively efficiently.
  • the usage of the allocated bus space 252 is managed by the kernel driver 225 .
  • the RCU identifies using a data store whose content it manages an unused block in the space on the bus that has already been allocated for use by resources of the NIC 116 , the space being of the size required for the resource. It then stores in that data store the identity of the resource (“resource ID”), the address of the block within the allocated space (“sub-page ID”), and the identity of the application or process that is to use the resource (“process tag”); and sends a message to the resource configuration unit (RCU) 236 to cause it to store corresponding data in the bus mapping table 237 (as shown in FIG. 2A ).
  • resource ID the identity of the resource
  • sub-page ID the address of the block within the allocated space
  • process tag the identity of the application or process that is to use the resource
  • the RCU finds that table 237 indicates the address to be already occupied then it returns an error code to the driver.
  • the sub-page address may need to be supplemented with the address of the page in which the sub-page lies if that cannot be inferred as a result of only a single page having been allocated for use by the resources. If the total space allocated for use by resources is insufficient then the kernel driver allocates it more space. Having allocated the resources, the RCU returns a success message to the kernel driver.
  • the allocated page and sub-page addresses are returned to and mapped into the virtual address space of the user level process that requested the resources in order that it can access them by means of that data. Another context switch then takes place back to the user level calling process.
  • An application that has had resources allocated to it can access them by sending data (e.g. by means of load/store cycles through a virtual memory mapping) to the relevant bus page, at the sub-page address corresponding to the respective resource. Since these addresses are part of the application's virtual address space, no context switch to any kernel level processes are required in order to perform these accesses. Any data sent to pages allocated to resources is picked off the bus 118 by the bus interface controller 235 . It directs that data to the appropriate one of the resources 231 - 234 by performing a look-up in the table 237 to identify the identity of the resource to which the sub-page address has been allocated.
  • An application can also access a resource by means other than a bus write: for example by means of direct memory access (DMA).
  • DMA direct memory access
  • the NIC 116 checks that the identity of the application/process from which the access has been received matches the identity indicated in the table 237 for the resource. If it does not match, the data is ignored. If it matches, it is passed to the relevant resource. This adds to security and helps to prevent corruption of the resources by other applications.
  • the set of resources allocated to an application or process may be considered to constitute a virtual network interface (VNIC).
  • VNIC virtual network interface
  • a virtual interface Once a virtual interface has been composed, it may be reconfigured dynamically. As one example of dynamic reconfiguration, a resource that is no longer required may be freed-up. To achieve this the application using the resource calls a de-allocation routine in the user level transport library 223 .
  • the de-allocation routine calls the kernel driver 225 , which instructs the RCU to de-allocate the resource by disabling it, clearing its status and deleting its row in the table 237 .
  • additional resources may be added to the VNIC.
  • the process is analogous to that described above for initial composition of the VNIC.
  • resources may be passed from one application or process to another. This is most useful in the situation where a single application has multiple processes and wants to pass control of a resource from one process to another, for example if data from the network is to be received into and processed by a new process.
  • the application using the resource calls a re-allocation routine in the transport library 223 .
  • the re-allocation routine calls the kernel driver 225 , which instructs the RCU to re-allocate the resource modifying its row in the table 237 to specify the identity of the application or process that is taking over its control.
  • data received from the network 240 may be being passed to an application 222 for processing.
  • the application has a queue 226 in a memory 227 connected to the bus 118 .
  • the queue is managed in part by the transport library 223 , which provides a DMA queue resource 231 on the NIC 116 with an up-to-date pointer to the next available location on the queue 226 . This is updated as the application reads data from the queue 226 .
  • an event queue resource 232 When data is received from the network it is passed to an event queue resource 232 , which writes it to the location identified by the pointer and also triggers an event such as an interrupt on the computing device 114 to indicate that data is available on the queue. In order for this to happen the event queue resource 232 must learn the pointer details from the DMA queue resource 231 . This requires data to be passed from the DMA queue resource to the event queue resource.
  • the “process tag” column of the table 237 can be treated more generally as an ownership tag, and can link the DMA queue to the related event queue.
  • the ownership tag of the event queue can be set to the identity of the related DMA queue.
  • the DMA queue needs to pass data to the related event queue it can identify the event queue from the table 237 by performing a look-up on its own identity in the ownership tag column.
  • Data intended to be passed from one resource to another can be checked by the bus controller 235 to ensure that it is compatible with the settings in the table 237 .
  • the bus controller checks that there is a row in the table 237 that has the identity of the resource that is the source of the data in the ownership tag field, and the identity of the resource that is the intended destination of the data in the resource ID field. If there is no match then the data is prevented from reaching its destination. This provides additional security and protection against corruption.
  • the identities of resources linked in this way can also be reconfigured dynamically by means of the re-configuration routines in the transport library.
  • FIG. 3 is an example broad outline of steps that an application process might follow in the system of FIGS. 1 and 2 for the creation and transmission of IP packets onto the network 112 .
  • This flowchart covers only the TCP and UDP transport level protocols; other embodiments can support other protocols, including but not limited to SCTP, RTP, ICMP and IGMP.
  • a step 310 when the application first starts up, its libraries are initialized. This includes the user level transport library 224 , which is initialized into the application's virtual address space.
  • Step 312 begins an example sequence of steps in which the application process uses a UDP transport protocol.
  • the application makes a call to the socket( ) routine of the user level transport library 224 , specifying that it would like a UDP socket.
  • the application process binds the socket to a port using a call to the bind( ) routine of the user level transport library 224 , and in step 316 , it begins writing its transmit data into the applications buffers in the application's virtual address space.
  • step 318 after sufficient data has been written into the buffers for one or more data packets, the application process makes a call to the sendTo( ) routine of the user level transport library 224 , specifying the socket handle, the buffer or buffers, the destination IP address and the destination port to which the packet is to be sent. Steps 316 and 318 are repeated many times, most likely interspersed with many other functions performed by the application process.
  • the application makes a call to the close( ) routine of the user level transport library 224 in order to close the socket.
  • step 322 begins an example sequence of steps in which the application process uses a TCP transport protocol.
  • step 322 instead of calling the socket routine of the user level transport library 224 to specify the UDP protocol, it calls the socket routine to specify the TCP protocol.
  • step 324 the application process calls the bind( ) routine similarly to step 314 , in order to bind the socket to a port.
  • step 326 since the transport protocol is now TCP, the application process calls the connect( ) routine of the user level transport library 224 , in order to form a TCP connection with a specified destination IP address and port.
  • step 328 the application process writes transmit data into buffers in the application program's virtual address space, similarly to step 316 , and in step 330 , when ready, the application process calls the send( ) routine of the user level transport library 224 in order to have the data packet processed according to the TCP protocol and transmitted out to the network via network interface card 116 .
  • steps 328 and 330 can be repeated many times, and when the application process has finished with the socket, it calls the close( ) routine of the user level transport library 224 (step 332 ).
  • step 310 of initializing the transport library 224 need involve a context switch to a kernel level process.
  • all of the remaining steps can be performed by the user level transport library 224 without involvement of the kernel driver 225 . While this feature can help improve performance markedly, it also creates a risk that non-standard or third-party transport libraries will be installed for the application program in place of trusted code.
  • the network interface card 116 itself protects against one or more of the risks that might arise.
  • FIG. 4 is a flowchart of steps that might be performed during the library initialization step 310 of FIG. 3 .
  • the user level transport library 224 is loaded into the application's virtual address space.
  • the transport library loaded in step 410 might be a very thin layer (or incorporated within another general purpose library such as glibc), which does little more than make calls to the kernel driver 225 and return any results to the application.
  • the transport library loaded in step 410 is more sophisticated in that it includes the user level routines described herein. To the extent that the user level transport library loaded in step 410 bypasses or performs functions differently than as described herein, the system can protect against certain kinds of errors.
  • a resource allocation routine in the kernel driver 225 is invoked.
  • the kernel level routine is required for allocating resources in the network interface card and the host memory subsystem 122 , since these resources are outside the virtual address space of the application, or involve direct hardware accesses that advisedly are restricted to kernel processes.
  • the user level driver initialization routine 310 may perform a number of other steps before it returns to the application in step 414 .
  • the kernel resource allocation routine 412 allocates memory and an initial set of resources for the application program, and maps these into the application's virtual address space. Before discussing the particular steps performed by the kernel resource allocation routine 412 , it will be useful to understand some of the formats in which the system maintains its queue structures.
  • FIG. 5 is a block diagram of various data structures used by the system of FIGS. 1 and 2 in order to support separate transmit and receive queues for each of the VNIC.
  • the diagram indicates which structures exist in host memory 122 and which exist on the NIC 116 .
  • the transmit and receive data buffers, the transmit and receive DMA descriptor queues, as well as one or more event queues, are all resident in host memory 122 and made up of generalized buffers which can be discontiguous and interspersed with each other in host memory 122 .
  • the buffers being used as transmit data buffers are identified as “TX DATA BUF #n”, and the buffers being used for a transmit queue are identified as “TX QUEUE BUF #n”.
  • the buffers being used for the event queue are identified as “EV QUEUE BUF #n”. Additional buffers in host memory 122 , not shown explicitly in FIG. 5 , are used for receive data buffers and for a receive queue.
  • One process may have any number of transmit, receive and event queues, and all of them share the pool of generalized buffers that have been mapped into that process's virtual address space. Other processes use a different pool of generalized buffers mapped into their respective virtual address spaces
  • Individual buffers may be either 4 k or 8 k bytes long in one embodiment, and they are chained together into logically contiguous sequences by means of physically contiguous descriptors in a buffer descriptor table 510 stored in the NIC 116 .
  • a buffer descriptor table 510 stored in the NIC 116 .
  • one transmit queue might occupy buffers 512 , 514 and 516 in host memory 122 , which are discontiguous and possibly out-of-order regions of memory. They are chained together into a single logically contiguous space by the physically contiguous entries 518 , 520 and 522 in the buffer descriptor table 510 .
  • the entries 518 , 520 and 522 are written and managed by the host 114 and are viewed as a wrap-around ring.
  • the host will allocate a physically contiguous sequence of 16 entries in buffer descriptor table 510 for this transmit buffer list.
  • one event queue might occupy buffers 526 , 528 and 530 in host memory 122 .
  • These buffers are discontiguous and possibly out-of-order in host memory, but are chained together into a single logically contiguous wrap-around space by the physically contiguous entries 532 , 534 and 536 in the buffer descriptor table 510 .
  • the buffer descriptor table 510 is indexed by “buffer ID”, and each of its entries identifies, among other things, the base address of the corresponding buffer in host memory 122 .
  • the NIC 116 includes a transmit queue descriptor table 540 , a receive queue descriptor table 541 , and an event queue descriptor table 542 .
  • Each transmit queue has a corresponding transmit queue ID, which is used as an index into the transmit queue descriptor table 540 .
  • the designated entry in the transmit queue descriptor table 540 is the starting point for describing the state and other characteristics of that particular transmit queue, as viewed by the NIC 116 . Each such entry identifies, among other things:
  • the NIC 116 In order to retrieve current transmit data from a particular transmit queue in host memory 122 , the NIC 116 first uses the ID of the particular transmit queue to look up, in the transmit queue descriptor table 540 , the buffer ID of the base buffer containing the transmit descriptor queue. The NIC 116 also obtains from the same place, the current device centric buffer list read pointer into that transmit descriptor queue. It then uses the base buffer ID as a base, and the device centric buffer list read pointer high order bits as an offset, into the buffer descriptor table 510 , to obtain the base address in host memory 122 of the buffer that contains the particular transmit buffer list 312 . The NIC then uses that base address as a base, and the device centric buffer list read pointer low order bits times the number of bytes taken up per descriptor as an offset, to retrieve from host memory 122 the current entry in the particular transmit descriptor queue.
  • the current entry in the particular transmit descriptor queue contains, among other things:
  • the system handles receive queues in a similar manner.
  • the transmit queue descriptor table 540 entry designated by the transmit queue ID also contains the ID of the transmit event queue associated with the particular transmit queue.
  • the receive queue descriptor table 541 entry designated by the receive queue ID contains the ID of the event queue associated with the particular receive queue. All of the event queues for all the applications 222 are described by respective entries in the event queue descriptor table 542 .
  • the entry in the event queue descriptor table 542 identified by a queue ID from the transmit or receive queue descriptor table 540 or 541 is the starting point for describing the state and other characteristics of that particular event queue, as viewed by the NIC 116 .
  • each slot (e.g. 532 , 534 , 518 ) shown in the buffer descriptor table 510 represents a single descriptor
  • each slot (e.g. 526 , 528 , 514 ) in the host memory 122 represents a memory “page” of information.
  • a page might be 4 k or 8 k bytes long for example, so if a transmit data buffer descriptor in a transmit queue occupies either 4 or 8 bytes, then each slot 512 , 514 or 516 as shown in FIG. 5 might hold 512 , 1 k or 2 k transmit data buffer descriptors.
  • FIG. 6 is a flowchart illustrating significant steps performed by the kernel resource allocation routine 412 . If the user level transport library 224 fails to call this routine, then the addresses to which the user level transport library 224 would have to access in order to bypass subsequent trusted kernel routines will not be accessible to the user level process since they will not have been mapped into the application's virtual address space.
  • the kernel begins by allocating memory for the generalized buffers that will be used to hold the transmit, receive and event queues. It then maps the buffers into the application's virtual address space so that the application can read and write to them directly.
  • the kernel routine installs descriptors for these buffers in the buffer descriptor table 510 .
  • the kernel routine allocates a minimum set of the buffers for each of the transmit, receive and event queues requested, and programs their buffer IDs into the transmit, receive and event queue descriptor tables 540 , 541 and 542 .
  • the kernel routine determines the “doorbell” address in the NIC 116 for each of the transmit and receive queues, and maps them as well into the application's virtual address space.
  • the doorbell address is the address to which the application will write a value, in order to notify the NIC either that a transmit buffer is ready or that a receive buffer can be released.
  • the doorbell address is the address of the device centric transmit queue read pointer in the transmit queue descriptor table 540 entry for the particular transmit queue.
  • the doorbell address is the address of the device centric receive queue write pointer in the receive queue descriptor table 541 entry for the particular receive queue.
  • the kernel routine programs into the NIC 116 certain access rights (authorization rights) that are to be associated with the particular transmit queue. These are the authorization rights to which the NIC 116 will look in order to determine whether a particular sending process has authority to send packets having certain characteristics, and the kernel routine programs them in dependence upon the privilege level of the process that made the kernel resource allocation call. Note that although the privilege level of an application process running in the computer system 114 is maintained on a per-process basis, the authorization rights are maintained on the NIC 116 on a per-queue basis. This enables the NIC 116 to validate outgoing transmit packets without having to know anything about the particular operating system running in the host computer or the privilege mechanisms that it uses.
  • the NIC can refer to different characteristics of a transmit data packet in order to determine whether the transmit queue has sufficient authority to send it.
  • the NIC checks only whether the packet is formed according to an allowed transport protocol. For example, in one embodiment user level processes may be permitted to send packets using only the TCP or UDP transport protocols, and no others.
  • Each data packet has a header field which identifies the transport protocol according to which it was formed, and the NIC can compare that protocol number with those that have been programmed into the NIC 116 as being allowed for the transmit queue from which the data packet was retrieved.
  • a field may be allocated in each entry of the transmit queue descriptor table 540 for identifying the allowed protocols. For example if only eight protocols are supported, and eight-bit field might be used, with each bit representing one of the protocols. If a bit is active, then the corresponding transport protocol is allowed; if it is inactive, then it is not.
  • the allowed protocols may be listed in a separate authorizations “database” maintained in the NIC 116 , such as that shown in FIG. 7 .
  • the authorizations database takes the form of a table in which each entry contains a queue ID and an indication of an allowed protocol for that queue. If several different protocols are allowed for particular queue, then the queue ID appears in several different entries in the table.
  • the NIC 116 When the NIC 116 is checking the validity of a transmit packet from a particular transmit queue, it searches the table for an entry that contains both the transmit queue ID and the transport protocol according to which the packet was formed (retrieved from the packet header). If the table does contain such an entry, then the packet is valid. If not, then the packet is rejected.
  • the term “database” does not necessarily imply any unity of structure. For example, two or more separate databases, when considered together, still constitute a “database” as that term is used herein.)
  • the NIC can validate other characteristics of a transmit data packet. For example it can validate the source IP address, the source port number, the destination IP address and destination port number, either instead of or additionally to the allowed protocols.
  • FIG. 8 illustrates an example authorizations database that can support validating all such characteristics. As shown in FIG. 8 , each entry in the database table contains six fields: source IP address, source port number, destination IP address, destination port number, queue ID and allowed protocol number. In order to keep table short, some of these fields may be filled with indications of numeric ranges, rather than only a specific number. For example, in some systems only privileged processes can transmit packets indicating that they were sourced from a port number in the range 0-1023.
  • the “source port” field of the table a FIG. 8 might contain only a single bit indicating whether source port numbers within the range 0-1023 are permitted.
  • the NIC 116 when the NIC 116 is checking the validity of a transmit packet from a particular transmit queue, it searches the table of FIG. 8 for a single entry that contains (or includes, if numeric ranges are specified) the source IP address, the source port number, destination IP address, destination port number and the allowed protocol (all taken from the packet header), as well as the ID of the transmit queue from which the packet was retrieved. If the table does contain such an entry, then the packet is valid. If not, then it is rejected.
  • the kernel resource allocation routine programs the authorization rights for the transmit queue into the NIC 116 , it returns to the application with handles for the resources allocated, with the base virtual addresses of the transmit, receive and event queues, and virtual memory addresses corresponding to the doorbells allocated in the transmit and receive queue descriptor tables 540 and 541 (step 620 ).
  • FIG. 9 is a flowchart of the user level routine 900 in the transport library 224 for establishing a socket.
  • the socket routine is called in both steps 312 and 322 of FIG. 3 .
  • the routine first determines whether the user level transport library 224 contains the code necessary to support the specified protocol. If not, then in step 912 , makes a system call to pass the request on to the kernel to handle. If it does support the specified protocol, then in step 914 it allocates its internal state for the new socket. In step 916 the routine returns to the application program with a handle for the new socket.
  • the kernel may check at this time whether the calling process has authority to use the specified protocol number. This call, and the accompanying context switches, are unnecessary in the system FIGS. 1 and 2 since any illegal protocols will be detected and rejected downstream by the NIC 116 .
  • FIG. 10 is a flowchart of the user level routine transport library 224 for binding a port number to a socket.
  • the bind routine is called in both steps 314 and 324 of FIG. 3 .
  • the routine first determines whether the caller provided a port number. If not, then in step 1012 , a port number is assigned. If the caller did specify a port number, then in step 1014 , the routine determines whether the port number is legal for the current user process.
  • At least four different mechanisms might be used in the same or different embodiments to ensure that different processes do not interfere with each other's use of particular IP address/port number combinations, and that user processes do not improperly operate through physical or logical port numbers or other resources that should be reserved for the kernel.
  • a system-wide policy exists which allocates all port numbers within a particular range to the user stack only.
  • the user level bind( ) routine can be designed to immediately accept only those requests from a user level process to bind to one of such port numbers, or to immediately pass such requests on to the kernel to handle.
  • the kernel allocates a unique IP address for that instance to use as it wishes. If each instance of the user level driver has its own exclusively assigned IP address, then the instance can manage the available port numbers for use with that IP address without risking interference with any other process. This mechanism is useful only if there are sufficient numbers of IP addresses available to the computer system to allocate to the various requesting processes. Again, this mechanism can be used in conjunction with the first, to reject or pass on to the kernel all user level requests to bind to a kernel-only port number, regardless of the exclusivity of an assigned IP address.
  • the initialization routine makes a number of anticipatory bind( ) calls to the kernel in order to form a pool of port numbers that the user level driver instance can later allocate to the application program upon receipt of bind( ) calls to the user level driver.
  • This mechanism can succeed with far fewer IP addresses available to the computer system, but also undesirably involves a context switch (during library initialization) for each port number to be added to the pool.
  • no IP address/port number combinations are pre-allocated to the particular instance of the user level driver. Instead, the user level bind( ) routine invokes the kernel bind( ) routine for each user level bind( ) call received.
  • This mechanism utilizes IP address/port number combinations most conservatively, but may require more context switches than any of the first, second and third mechanisms. In an embodiment, this fourth mechanism is used only as a backup, for example if the user level process requires more port numbers than were made available using the anticipatory bind( ) calls in the third mechanism.
  • step 1014 the user level bind( ) routine determines that the requested port number is not available to the current instance of the user level driver, or otherwise cannot determine whether is available
  • step 1016 the routine makes a call to the kernel bind( ) routine to pass the request on to the kernel to handle. If the fourth mechanism above is the only way that the particular embodiment avoids conflicting or illegal allocation of address/port number combinations, then step 1016 will be taken during every user level call to the bind( ) routine 1000 . Otherwise, step 1016 will be taken only as a backup if pre-allocated port numbers have been exhausted, or if the routine otherwise cannot determine that the requested port number is available.
  • step 1018 the routine updates the application's state internally, to bind the port number with the specified socket.
  • the routine returns to the caller in step 1020 .
  • the user level bind routine of FIG. 10 attempts to prevent the allocation of problematical port numbers to user level processes as described above, it will be appreciated that a different implementation of the user level bind routine may not be as careful. It is a feature of the invention that the NIC 116 can be designed to detect and reject transmit packets which do designate a source port number that is illegal, or for which the sending process lacks sufficient privilege to use. Therefore, no damage will occur if transport library routines are used which do not follow the steps set forth, as long as any errors they cause are of a type that the NIC 116 is designed to detect downstream.
  • FIG. 11 is a flowchart of the user level sendTo routine called in step 318 of FIG. 3 .
  • This routine is called for UDP protocol packets for which no connection to a destination IP address or port is established.
  • the routine therefore takes as arguments not only the socket handle and the data buffer address, but also the destination IP address and destination port number.
  • the routine in step 1106 , the routine first determines whether there are known improper arguments. If so, then in step 1108 , it invokes the kernel or returns an error. If not, then in step 1110 , the routine first applies the transport protocol, forming a UDP header and an IP header. It then forms a transmit descriptor pointing to the headers and to the data buffer identified by the caller.
  • step 1112 the routine pushes the transmit descriptor onto the transmit queue by writing the transmit descriptor into the next available entry in transmit queue buffers 512 , 514 or 516 ( FIG. 5 ).
  • the routine then updates its own host centric transmit queue write pointer.
  • step 1114 the routine writes its updated transmit queue write pointer into the NIC 116 at the doorbell address that had been previously associated with the current transmit queue.
  • step 1116 the routine returns to the caller.
  • FIG. 12 is a functional block diagram of certain structures on the NIC 116 which can be used to perform packet validation for transmission of data packets.
  • the structures include a DMA controller 1210 which controls the retrieval of transmit data packets from the host memory subsystem 122 . Data packets, as they arrive, are enqueued into a transmit FIFO 1214 . From the head of the transmit FIFO 1214 , data packets are presented to a physical interface 1216 which performs any further protocol processing (such as appending an Ethernet header) and writes them out onto the network 112 .
  • the transmit FIFO 1214 also has associated therewith FIFO control logic 1218 , which includes read and write pointers into the transmit FIFO 1214 as well as other functions described hereinafter.
  • the NIC 116 also includes header validation logic 1220 which observes the contents of data packets as they arrive from the bus 118 and are placed into the transmit FIFO 1214 .
  • the header validation logic 1220 makes reference to the authorizations database 1222 described previously
  • the header validation logic 1220 compares the data packet characteristics to those in the authorizations database 1222 to determine whether the data packet arriving from the bus 118 is authorized. If it is not, then the header validation logic 1220 can cause the DMA controller 1210 to abort the current transfer, and can also cause the FIFO control logic 1218 to unwind its write pointer back to the end of the previous packet in the transmit FIFO 1214 .
  • FIG. 13 is a flowchart of typical steps taken on the NIC 116 for transmission of data packets arriving from multiple queues. Initially, the NIC implements an algorithm for choosing among the multiple transmit queues for the next queue to service. This algorithm is unimportant for an understanding of the invention, and therefore is not described. In step 1310 , the algorithm selects one particular transmit queue to examine.
  • the NIC 116 determines whether the device centric write pointer for the current transmit queue modulo-exceeds the device centric read pointer for current transmit queue. These values are available to be NIC in the transmit queue descriptor table 540 entry for the current transmit queue, and the test will be positive if one of the transport libraries 224 or 225 has updated the device centric write pointer to notify the NIC of the availability of the data packet for transmission.
  • modulo-exceeds is used herein to accommodate wrap-around (circular) queues. That is, the device centric write pointer “modulo-exceeds” the device centric read pointer for a queue if the write pointer exceeds the read pointer, modulo the queue length.
  • step 1314 the NIC 116 proceeds to examine the next transmit queue according to its algorithm.
  • step 1312 the NIC 116 reads one or more transmit descriptors from the current transmit queue, beginning at the entry pointed to by the device centric read pointer.
  • step 1318 the NIC 116 programs the DMA controller 1210 to retrieve the packet from host memory 122 into transmit FIFO 1214 .
  • step 1320 during the retrieval process, the NIC examines the header information on the packet as it is being retrieved, and tests the current queue's authority to send packets having the characteristics of that being retrieved. If the NIC 116 determines that the packet is authorized (step 1322 ), then in step 1324 , after packet retrieval, the NIC 116 will updated its device centric transmit queue read pointer.
  • the NIC 116 then writes a transmit completion event into the event queue associated with the current transmit queue, for eventual retrieval by the user level process.
  • the NIC 116 might wait to complete retrieval of a number of transmit data packets before writing a “batched” transmit completion event covering all of them.
  • the process then returns to step 1310 for the queue selection algorithm to select the same or another transmit queue.
  • the NIC 116 transmits the packet from the head of the transmit FIFO 1214 out onto the network 112 .
  • step 1326 the header validation logic 1220 ( FIG. 12 ) causes the DMA controller 1210 to abort the current transfer, thereby freeing up the bus 118 . It also notifies the FIFO control logic 1218 to unwind the transmit FIFO queue write pointer as previously described. The NIC 116 may also report an error back to the application program.
  • the NIC 116 transmits packets onto network 112 only if the sending transmit queue is authorized to transmit packets having the characteristics for which header validation logic 1220 checks. In some embodiments still other requirements might be necessary before the NIC will allow the packet to go out.
  • identification does not necessarily require the direct specification of that item of information.
  • Information can be “identified” in a field simply by referring to the actual information through one or more layers of indirection, or by identifying one or more items of different information which are together sufficient to determine the actual item of information.
  • indicate is used herein to mean the same as “identify”.

Abstract

Roughly described, a network interface device receiving data packets from a computing device for transmission onto a network, the data packets having a certain characteristic, transmits the packet only if the sending queue has authority to send packets having that characteristic. The data packet characteristics can include transport protocol number, source and destination port numbers, source and destination IP addresses, for example. Authorizations can be programmed into the NIC by a kernel routine upon establishment of the transmit queue, based on the privilege level of the process for which the queue is being established. In this way, a user process can use an untrusted user-level protocol stack to initiate data transmission onto the network, while the NIC protects the remainder of the system or network from certain kinds of compromise.

Description

REFERENCE TO RELATED APPLICATIONS
This application is a continuation of U.S. patent application Ser. No. 11/116,018, filed 27 Apr. 2005, incorporated herein by reference in its entirety.
BACKGROUND
1. Field of the Invention
The invention relates to network interfaces, and more particularly to mechanisms for validating network traffic sent or received by user level libraries in a virtual network architecture.
2. Description of Related Art
A typical computer system includes a processor subsystem (including one or more processors), a memory subsystem (including main memory, cache memory, etc.), and a variety of “peripheral devices” connected to the processor subsystem via a peripheral bus. Peripheral devices may include, for example, keyboard, mouse and display adapters, disk drives and CD-ROM drives, network interface devices, and so on. The processor subsystem communicates with the peripheral devices by reading and writing commands and information to specific addresses that have been preassigned to the devices. The addresses may be preassigned regions of a main memory address space, an I/O address space, or another kind of configuration space. Communication with peripheral devices can also take place via direct memory access (DMA), in which the peripheral devices (or another agent on the peripheral bus) transfers data directly between the memory subsystem and one of the preassigned regions of address space assigned to the peripheral devices.
Most modern computer systems are multitasking, meaning they allow multiple different application programs to execute concurrently on the same processor subsystem. Most modern computer systems also run an operating system which, among other things, allocates time on the processor subsystem for executing the code of each of the different application programs. One difficulty that might arise in a multitasking system is that different application programs may wish to control the same peripheral device at the same time. In order to prevent such conflicts, another job of the operating system is to coordinate control of the peripheral devices. In particular, only the operating system can access the peripheral devices directly; application programs that wish to access a peripheral devices must do so by calling routines in the operating system. The placement of exclusive control of the peripheral devices in the operating system also helps to modularize the system, obviating the need for each separate application program to implement its own software code for controlling the hardware.
The placement of exclusive control of the peripheral devices in the operating system also permits management of another potential difficulty, that of improper control or handling of the peripheral device. For network interface devices, for example, improper or inappropriate control of the devices could compromise other applications running in the computer system, or could compromise or otherwise negatively impact operation of the network to which the device is connected. In established operating systems, much of the software code for controlling these devices has evolved over a number of years and has been updated and improved in response to numerous tests by numerous people on numerous types of network interface devices. The software code in the operating system has therefore developed a certain level of trust: users, network administrators, network architects and other network devices can presume that the great majority of packets originating from this software code will conform to network protocol specifications. Additional code for controlling each particular peripheral device is incorporated into the operating system in the form of a device driver specific to the particular peripheral device. Device drivers are usually written by or in association with the manufacturer of the particular peripheral device, so they too are afforded a certain level of trust.
The part of the operating system that controls the hardware is usually the kernel. Typically it is the kernel which performs hardware initializations, setting and resetting the processor state, adjusting the processor internal clock, initializing the network interface device, and other direct accesses of the hardware. The kernel executes in kernel mode, also sometimes called trusted mode or a privileged mode, whereas application level processes execute in a user mode. Typically it is the processor subsystem hardware itself which ensures that only trusted code, such as the kernel code, can access the hardware directly. The processor enforces this in at least two ways: certain sensitive instructions will not be executed by the processor unless the current privilege level is high enough, and the processor will not allow user level processes to access memory locations (including memory mapped addresses associated with specific hardware resources) which are outside of a user-level physical or virtual address space already allocated to the process. As used herein, the term “kernel space” or “kernel address space” refers to the address and code space of the executing kernel. This includes kernel data structures and functions internal to the kernel. The kernel can access the memory of user processes as well, but “kernel space” generally means the memory (including code and data) that is private to the kernel and not accessible by any user process. The term “user space”, or “user address space”, refers to the address and code space allocated by a code that is loaded from an executable and is available to a user process, excluding kernel private code data structures. As used herein, all four terms are intended to accommodate the possibility of an intervening mapping between the software program's view of its own address space and the physical memory locations to which it corresponds. Typically the software program's view of its address space is contiguous, whereas the corresponding physical address space may be discontiguous and out-of-order, and even potentially partly on a swap device such as a hard disk drive. Address spaces are sometimes referred to herein as “virtual” address spaces, in order to emphasize the possibility of such mappings.
Although parts of the kernel may execute as separate ongoing kernel processes, much of the kernel is not actually a separate process running on the system. Instead it can be thought of as a set of routines, to some of which the user processes have access. A user process can call a kernel routine by executing a system call, which is a function that causes the kernel to execute some code on behalf of the process. The “current process” is still the user process, but during system calls it is executing “inside of the kernel”, and therefore has access to kernel address space and can execute in a privileged mode. Kernel code is also executed in response to an interrupt issued by a hardware device, since the interrupt handler is found within the kernel. The kernel also, in its role as process scheduler, switches control between processes rapidly using the clock interrupt (and other means) to trigger a switch from one process to another. Each time a kernel routine is called, the current privilege level increases to kernel mode in order to allow the routine to access the hardware directly. When the kernel relinquishes control back to a user process, the current privilege level returns to that of the user process.
When a user level process desires to communicate with the NIC, conventionally it can do so only through calls to the operating system. The operating system implements a system level protocol processing stack which performs protocol processing on behalf of the application, and also performs certain checks to make sure outgoing data packets have authorized characteristics and are not malformed. In particular, an application wishing to transmit a data packet using TCP/IP calls the operating system API (e.g. using a send( ) call) with data to be transmitted. This call causes a context switch to invoke kernel routines to copy the data into a kernel data buffer and perform TCP send processing. Here protocol is applied and fully formed TCP/IP packets are enqueued with the interface driver for transmission. Another context switch takes place when control is returned to the application program. Note that kernel routines for network protocol processing may be invoked also due to the passing of time. One example is the triggering of retransmission algorithms. Generally the operating system provides all OS modules with time and scheduling services (driven by the hardware clock interrupt), which enable the TCP stack to implement timers on a per-connection basis. The operating system performs context switches in order to handle such timer-triggered functions, and then again in order to return to the application.
It can be seen that network transmit and receive operations can involve excessive context switching, and this can cause significant overhead. The problem is especially severe in networking environments in which data packets are often short, causing the amount of required control work to be large as a percentage of the overall network processing work.
One solution that has been attempted in the past has been the creation of user level protocol processing stacks operating in parallel with those of the operating system. Such stacks can enable data transfers using standard protocols to be made without requiring data to traverse the kernel stack. In one implementation, TCP and other protocols are implemented twice: once built into the kernel and once built into a user level transport library accessible to application programs. In order to control and/or communicate with the network interface device an application issues API (application programming interface) calls. Some API calls may be handled by the user level transport libraries, and the remainder can typically be passed on through the interface between the application and the operating system to be handled by the libraries that are available only to the operating system. For implementation with many operating systems it is convenient for the transport libraries to use existing Ethernet/IP based control-plane structures: e.g. SNMP and ARP protocols via the OS interface.
There are a number of difficulties in implementing transport protocols at user level. Most implementations to date have been based on porting pre-existing kernel code bases to user level. Examples of these are Arsenic and Jet-stream. These have demonstrated the potential of user-level transports, but have not addressed a number of the problems required to achieve a complete, robust, high-performance commercially viable implementation.
One particular problem with user-level transport libraries is that in bypassing many of the routines normally performed in the kernel, they also lose the trust normally accorded those routines. This is because the kernel no longer has control of the user-level routines and cannot enforce their identity with those in the kernel. Users or application programs are able to modify the user-level transport routines, or replace them with others provided by a third party. As a result, the support of user-level transport libraries to bypass kernel routines and avoid context switches, increases the risk of malformed or even malicious traffic driven onto the network.
Part of the risk of permitting user-level transport libraries can be overcome by virtualizing the network interface device in such a way that each process is aware of only its own resources. The hardware can be virtualized in such a way that one process cannot transmit or receive data on behalf of another, nor can one process see the data belonging to another process. But this kind of virtualization does not prevent a process from transmitting problematic data packets out onto the network through its own assigned resources; hence trust is still not ensured.
In order to address issues like the latter, roughly described, a network interface device receiving data packets from a computing device for transmission onto a network, the data packets having a certain characteristic, transmits the packet only if the sending queue has authority to send packets having that characteristic. The data packet characteristics can include transport protocol number, source and destination port numbers, source and destination IP addresses, for example. Authorizations can be programmed into the NIC by a kernel routine upon establishment of the transmit queue, based on the privilege level of the process for which the queue is being established. In this way, a user process can use an untrusted user-level protocol stack to initiate data transmission onto the network, while the NIC protects the remainder of the system from certain kinds of compromise.
BRIEF DESCRIPTION OF THE DRAWINGS
The invention will be described with respect to specific embodiments thereof, and reference will be made to the drawings, in which:
FIG. 1 is a simplified block diagram of a typical computer system incorporating features of the invention.
FIG. 2 is another view of the computer system of FIG. 1.
FIG. 2A illustrates an example bus mapping table of FIG. 2.
FIG. 3 is a flowchart example of steps that an application process might follow in the system of FIGS. 1 and 2 for the creation and transmission of IP packets onto a network.
FIG. 4 is a flowchart of steps that might be performed during the library initialization step of FIG. 3.
FIG. 5 is a block diagram of data structures used by the system of FIGS. 1 and 2 in order to support separate transmit and receive queues for different VNICs.
FIG. 6 is a flowchart illustrating significant steps performed by the kernel resource allocation routine of FIG. 4.
FIGS. 7 and 8 illustrate alternative example authorizations databases maintained in the NIC of FIG. 1.
FIG. 9 is a flowchart detail of the user socket routine in FIG. 3.
FIG. 10 is a flowchart detail of the bind routine in FIG. 3.
FIG. 11 is a flowchart detail of the sendTo routine in FIG. 3.
FIG. 12 is a functional block diagram of certain structures on a NIC which can be used to perform packet validation for transmission of data packets.
FIG. 13 is a flowchart of typical steps taken on the NIC 116 for transmission of data packets arriving from multiple queues.
DETAILED DESCRIPTION
The following description is presented to enable any person skilled in the art to make and use the invention, and is provided in the context of a particular application and its requirements. Various modifications to the disclosed embodiments will be readily apparent to those skilled in the art, and the general principles defined herein may be applied to other embodiments and applications without departing from the spirit and scope of the present invention. Thus, the present invention is not intended to be limited to the embodiments shown, but is to be accorded the widest scope consistent with the principles and features disclosed herein.
FIG. 1 is a simplified block diagram of a typical computer system 110 which can communicate via a network 112 with other computer systems such as 130, 132 and 134. Computer system 110 includes a network interface card (NIC) 116 communicating via a communication channel 118 with a host subsystem 114. The host subsystem 114 includes a processor subsystem 120 which includes at least one processor, a host memory subsystem 122, and a core logic subsystem 124. The core logic subsystem 124 provides bridges among the processor subsystem 120, the host memory subsystem 122 and the communication channel 118. The host subsystem 114 may also include other devices 126 in communication with the communication channel 118. As used herein, the host subsystem 114 is considered to be one type of “computing device”. Other types of computing devices include non-reconfigurable processing devices for which network connectivity is desired.
The network interface card 116 provides an interface to outside networks, including an interface to the network 112, and is coupled via network 112 to corresponding interface devices in other computer systems. The physical hardware component of network interfaces are referred to herein as network interface cards (NICs), although they need not be in the form of cards: for instance they could be in the form of integrated circuits (ICs) and connectors fitted directly onto a motherboard, or in the form of macrocells fabricated on a single integrated circuit chip with other components of the computer system.
Network 112 may comprise many interconnected computer systems and communication links. These communication links may be wireline links, optical links, wireless links or any other mechanism for communication of information. While in one embodiment network 112 is the Internet, in other embodiments, network 112 may be any suitable computer network or combination of networks. In and embodiment described herein, network 112 supports an Ethernet protocol.
Host memory subsystem 122 typically includes a number of memories including a main random access memory (RAM) for storage of instructions and data during program execution, and a read only memory (ROM) in which fixed instructions and data are stored. One or more levels of cache memory may also be included in the host memory subsystem 122. For simplicity of discussion, the host memory subsystem 122 is sometimes referred to herein simply as “host memory”. As used herein, virtual memory is considered part of the host memory subsystem even though part of it may be stored physically at various times on a peripheral device.
The communication channel 118 provides a mechanism for allowing the various components and subsystems of computer system 110 to communicate with each other. In one embodiment the communication channel 118 comprises a PCI bus. Other embodiments may include other buses, and may also include multiple buses.
Computer system 110 itself can be a varying types including a personal computer, a portable computer, a workstation, a computer terminal, a network computer, a television, a mainframe, a server or any other data processing system or user devices. Due to the ever-changing nature of computers and networks, the description of computer system 110 depicted in FIG. 1 is intended only as a specific example for purposes of illustrating an embodiment of the present invention. Many other configurations of computer system 110 are possible having more or less components, and configured similarly or differently than, the computer system depicted in FIG. 1.
FIG. 2 is another view of the computer system 110, presented to better illustrate software and functional features. The computer runs an operating system 221 which is capable of supporting application processes 222 also running on the computer. As used herein, a “process” is a program, for example an application program, in execution. A process may run multiple “threads”, which can also be thought of as lightweight processes. A library 223 of instructions is stored by the computer and available to the applications. The part of the library usable for communications with the NIC 116 is termed a transport library 224. Included in the operating system kernel is a driver component 225: a dedicated set of instructions which allow for data transfer with the NIC 116. Each application would normally take the form of a software program running on the computer, but it could be embedded in firmware. Some operating systems include applications in addition to fundamental operating system code. Aspects of the system that are not described herein may be as set out in PCT publication No. WO2004/025477, incorporated herein by reference in its entirety.
The NIC 116 can support resources of a number of types: i.e. resources having capabilities of different natures. Examples include DMA queues, event queues, timers and support resources for remote apertures of the type described in WO2004/025477. Each type of resource (231-234) is provided from a dedicated hardware resource pool which can support numerous instances of resources of the respective type. In order for such an instance to be made operational it must be configured by means of instructions from the computing device 114, as described in more detail below.
The NIC 116 communicates with the computing device 114 over the bus 118. In this example the bus is a PCI bus, but the invention is not limited to such a bus. Data transmitted over the PCI bus 118 is associated with a destination address and is received by whichever entity that is connected to the bus has had that address allocated to it. In a typical PC implementation the addresses are allocated in pages of 4 or 8 kB. One or more of these pages may be allocated to the NIC 116. Blocks 251 and 252 represent allocated pages on the PCI bus 118.
The NIC 116 has a bus interface controller 235, a resource configuration unit 236 and a bus mapping table 237. The resource configuration unit processes communications received from the computer that provide instructions on the allocation, re-allocation and de-allocation of resources on the NIC 116, and configures the resources in accordance with such instructions. The kernel driver 225 stores a record of which resources on the NIC 116 are allocated. When a resource is to be allocated the driver 225 identifies a suitable free resource of the required type on the NIC 116 and transmits an allocation instruction to the NIC 116. The instruction identifies the resource and specifies the details of how it is to be allocated, including details of the internal configuration of the resource (e.g. in the case of a timer the amount of time it is to run for). That instruction is passed to the resource configuration unit. The resource configuration unit then loads the specified configuration into the identified resource. The instruction also includes an ownership string, which may be an identification of which application or process on the computer is using the resource. The resource configuration unit stores these in a row of the bus mapping table. An example of entries in the bus mapping table is shown in FIG. 2A and is described in more detail below. When a resource is to be re-allocated the relevant entries in the resource's own configuration store and in the bus mapping table are altered as necessary. When a resource is to be de-allocated it is disabled and any rows of the bus mapping table that relate to it are deleted.
The general operation of the system of FIGS. 1 and 2 for the transfer of data to and from the network will now be described.
During setup of the system one or more pages (251, 252) on the bus 118 are allocated to the NIC 116. Part of this address space (page 251) can be used by the kernel driver 225 to send instructions to the NIC 116. Other pages (e.g. page 252) can be used for communication between application processes such as application 222 and the resources 231-234. The resource configuration unit 236 stores a record of the pages that are allocated to the NIC 116 for use by resources. Note that in some embodiments, some or all of the functions of the resource configuration unit 236 may alternatively be provided by the kernel driver 225 itself.
When an application 222 wishes to open a data connection over the network it calls a routine in the user level transport library 224 to cause the NIC resources that are required for the connection to be allocated. Standard types of network connection require standard sets of resources; for example: an event queue, transmit (TX) and receive (RX) DMA queues, and a set of direct memory accessible (DMA'able) memory buffers. For example a typical set may contain one TX queue, one RX queue, two timers, and on the order of 100 DMA memory buffers.
The user level transport library 224 includes routines that can be called directly by the application process 222 and that initiate the allocation of such standard sets of resources, including set numbers of resources of different types. The transport library also includes routines that allow a resource of each type to be allocated, re-allocated or de-allocated individually. The presence of both these types of instruction means that standard connections can be set up efficiently, and yet non-standard groups of resources can be created, and existing connections can be reconfigured on a resource-by-resource basis. As used herein, a “user level stack” is any protocol processing software that runs in unprotected mode. A “protocol stack” is the set of data structures and logical entities associated with the networking interfaces. This includes sockets, protocol drivers, and the media device drivers.
The routines for allocation, re-allocation and de-allocation of resources require access to restricted memory mapped addresses, such as page 251 for sending configuration instructions to the NIC 116. Since the user level transport library 224 lacks the necessary privilege level to perform these accesses, these routines in the user level transport library 224 make calls to the kernel driver 225. In a Unix environment, for example, such calls might take the form of IOCtl( ) system calls. These calls cause an initial context switch to a kernel level process, which in turn communicate the instructions to the NIC 116 for the allocation of the resources as specified in the routines. Those instructions specify the identity of the application or process with which the resources are to be associated, and the nature of the resources. The instructions are processed by the resource configuration unit 236 of the NIC 116.
A feature of the system of FIG. 2 is that the space on the bus 118 that is allocated to the NIC 116 can be split dynamically between the resources on the bus 118. Once one or more pages 252 have been allocated to the NIC 116 for use by resources those resources can be allocated one or more individual sub-page addresses within that page, corresponding to locations as illustrated at 253, 254. Thus each resource can have a part of the total space allocated to it. A record of which part of the total space is allocated to which resource is stored in the bus mapping table 237. The effect is that a single page of the bus can be used for communication to resources of multiple types and/or resources that relate to multiple connections and/or resources that are associated with multiple applications or processes on the computer 114. As a result, the total bus space can be used relatively efficiently.
The usage of the allocated bus space 252 is managed by the kernel driver 225. When a resource is to be allocated the RCU identifies using a data store whose content it manages an unused block in the space on the bus that has already been allocated for use by resources of the NIC 116, the space being of the size required for the resource. It then stores in that data store the identity of the resource (“resource ID”), the address of the block within the allocated space (“sub-page ID”), and the identity of the application or process that is to use the resource (“process tag”); and sends a message to the resource configuration unit (RCU) 236 to cause it to store corresponding data in the bus mapping table 237 (as shown in FIG. 2A). If the RCU finds that table 237 indicates the address to be already occupied then it returns an error code to the driver. The sub-page address may need to be supplemented with the address of the page in which the sub-page lies if that cannot be inferred as a result of only a single page having been allocated for use by the resources. If the total space allocated for use by resources is insufficient then the kernel driver allocates it more space. Having allocated the resources, the RCU returns a success message to the kernel driver. The allocated page and sub-page addresses are returned to and mapped into the virtual address space of the user level process that requested the resources in order that it can access them by means of that data. Another context switch then takes place back to the user level calling process.
An application that has had resources allocated to it can access them by sending data (e.g. by means of load/store cycles through a virtual memory mapping) to the relevant bus page, at the sub-page address corresponding to the respective resource. Since these addresses are part of the application's virtual address space, no context switch to any kernel level processes are required in order to perform these accesses. Any data sent to pages allocated to resources is picked off the bus 118 by the bus interface controller 235. It directs that data to the appropriate one of the resources 231-234 by performing a look-up in the table 237 to identify the identity of the resource to which the sub-page address has been allocated. An application can also access a resource by means other than a bus write: for example by means of direct memory access (DMA). In those instances, the NIC 116 checks that the identity of the application/process from which the access has been received matches the identity indicated in the table 237 for the resource. If it does not match, the data is ignored. If it matches, it is passed to the relevant resource. This adds to security and helps to prevent corruption of the resources by other applications.
The set of resources allocated to an application or process may be considered to constitute a virtual network interface (VNIC).
Once a virtual interface has been composed, it may be reconfigured dynamically. As one example of dynamic reconfiguration, a resource that is no longer required may be freed-up. To achieve this the application using the resource calls a de-allocation routine in the user level transport library 223. The de-allocation routine calls the kernel driver 225, which instructs the RCU to de-allocate the resource by disabling it, clearing its status and deleting its row in the table 237.
As another example of dynamic reconfiguration, additional resources may be added to the VNIC. The process is analogous to that described above for initial composition of the VNIC.
As yet another example of dynamic reconfiguration, resources may be passed from one application or process to another. This is most useful in the situation where a single application has multiple processes and wants to pass control of a resource from one process to another, for example if data from the network is to be received into and processed by a new process. To achieve this the application using the resource calls a re-allocation routine in the transport library 223. The re-allocation routine calls the kernel driver 225, which instructs the RCU to re-allocate the resource modifying its row in the table 237 to specify the identity of the application or process that is taking over its control.
In some instances it may be desirable for resources of one type to communicate with resources of another type. For example, data received from the network 240 may be being passed to an application 222 for processing. The application has a queue 226 in a memory 227 connected to the bus 118. The queue is managed in part by the transport library 223, which provides a DMA queue resource 231 on the NIC 116 with an up-to-date pointer to the next available location on the queue 226. This is updated as the application reads data from the queue 226. When data is received from the network it is passed to an event queue resource 232, which writes it to the location identified by the pointer and also triggers an event such as an interrupt on the computing device 114 to indicate that data is available on the queue. In order for this to happen the event queue resource 232 must learn the pointer details from the DMA queue resource 231. This requires data to be passed from the DMA queue resource to the event queue resource.
To achieve this the “process tag” column of the table 237 can be treated more generally as an ownership tag, and can link the DMA queue to the related event queue. To achieve this the ownership tag of the event queue can be set to the identity of the related DMA queue. When the DMA queue needs to pass data to the related event queue it can identify the event queue from the table 237 by performing a look-up on its own identity in the ownership tag column.
Data intended to be passed from one resource to another can be checked by the bus controller 235 to ensure that it is compatible with the settings in the table 237. Specifically, when data is to be sent from one resource to another the bus controller checks that there is a row in the table 237 that has the identity of the resource that is the source of the data in the ownership tag field, and the identity of the resource that is the intended destination of the data in the resource ID field. If there is no match then the data is prevented from reaching its destination. This provides additional security and protection against corruption. Alternatively, or in addition, it may be permitted for one resource to transmit data to another if both are in common ownership: in this example if their resource ID fields indicate that they are owned by the same process, application or other resource.
The identities of resources linked in this way can also be reconfigured dynamically by means of the re-configuration routines in the transport library.
FIG. 3 is an example broad outline of steps that an application process might follow in the system of FIGS. 1 and 2 for the creation and transmission of IP packets onto the network 112. This flowchart covers only the TCP and UDP transport level protocols; other embodiments can support other protocols, including but not limited to SCTP, RTP, ICMP and IGMP.
In a step 310, when the application first starts up, its libraries are initialized. This includes the user level transport library 224, which is initialized into the application's virtual address space.
Step 312 begins an example sequence of steps in which the application process uses a UDP transport protocol. In step 312 the application makes a call to the socket( ) routine of the user level transport library 224, specifying that it would like a UDP socket. In step 314, the application process binds the socket to a port using a call to the bind( ) routine of the user level transport library 224, and in step 316, it begins writing its transmit data into the applications buffers in the application's virtual address space. In step 318, after sufficient data has been written into the buffers for one or more data packets, the application process makes a call to the sendTo( ) routine of the user level transport library 224, specifying the socket handle, the buffer or buffers, the destination IP address and the destination port to which the packet is to be sent. Steps 316 and 318 are repeated many times, most likely interspersed with many other functions performed by the application process. When the application has finished with the socket that it had created in step 312, then in step 320, the application makes a call to the close( ) routine of the user level transport library 224 in order to close the socket.
Alternatively to the UDP sequence beginning with step 312, step 322 begins an example sequence of steps in which the application process uses a TCP transport protocol. In step 322, instead of calling the socket routine of the user level transport library 224 to specify the UDP protocol, it calls the socket routine to specify the TCP protocol. In step 324 the application process calls the bind( ) routine similarly to step 314, in order to bind the socket to a port. In step 326, since the transport protocol is now TCP, the application process calls the connect( ) routine of the user level transport library 224, in order to form a TCP connection with a specified destination IP address and port. In step 328 the application process writes transmit data into buffers in the application program's virtual address space, similarly to step 316, and in step 330, when ready, the application process calls the send( ) routine of the user level transport library 224 in order to have the data packet processed according to the TCP protocol and transmitted out to the network via network interface card 116. Again, steps 328 and 330 can be repeated many times, and when the application process has finished with the socket, it calls the close( ) routine of the user level transport library 224 (step 332).
As can be seen, of all the steps illustrated FIG. 3, only the step 310 of initializing the transport library 224 need involve a context switch to a kernel level process. In many embodiments, all of the remaining steps can be performed by the user level transport library 224 without involvement of the kernel driver 225. While this feature can help improve performance markedly, it also creates a risk that non-standard or third-party transport libraries will be installed for the application program in place of trusted code. As will be seen, the network interface card 116 itself protects against one or more of the risks that might arise.
FIG. 4 is a flowchart of steps that might be performed during the library initialization step 310 of FIG. 3. In step 410, the user level transport library 224 is loaded into the application's virtual address space. In a conventional system, the transport library loaded in step 410 might be a very thin layer (or incorporated within another general purpose library such as glibc), which does little more than make calls to the kernel driver 225 and return any results to the application. In the system of FIGS. 1 and 2, however, the transport library loaded in step 410 is more sophisticated in that it includes the user level routines described herein. To the extent that the user level transport library loaded in step 410 bypasses or performs functions differently than as described herein, the system can protect against certain kinds of errors.
In step 412, as part of the initialization of the user level transport library, a resource allocation routine in the kernel driver 225 is invoked. The kernel level routine is required for allocating resources in the network interface card and the host memory subsystem 122, since these resources are outside the virtual address space of the application, or involve direct hardware accesses that advisedly are restricted to kernel processes. After resource allocation, the user level driver initialization routine 310 may perform a number of other steps before it returns to the application in step 414.
The kernel resource allocation routine 412 allocates memory and an initial set of resources for the application program, and maps these into the application's virtual address space. Before discussing the particular steps performed by the kernel resource allocation routine 412, it will be useful to understand some of the formats in which the system maintains its queue structures.
FIG. 5 is a block diagram of various data structures used by the system of FIGS. 1 and 2 in order to support separate transmit and receive queues for each of the VNIC. The diagram indicates which structures exist in host memory 122 and which exist on the NIC 116. The transmit and receive data buffers, the transmit and receive DMA descriptor queues, as well as one or more event queues, are all resident in host memory 122 and made up of generalized buffers which can be discontiguous and interspersed with each other in host memory 122. In FIG. 5, the buffers being used as transmit data buffers are identified as “TX DATA BUF #n”, and the buffers being used for a transmit queue are identified as “TX QUEUE BUF #n”. The buffers being used for the event queue are identified as “EV QUEUE BUF #n”. Additional buffers in host memory 122, not shown explicitly in FIG. 5, are used for receive data buffers and for a receive queue. One process may have any number of transmit, receive and event queues, and all of them share the pool of generalized buffers that have been mapped into that process's virtual address space. Other processes use a different pool of generalized buffers mapped into their respective virtual address spaces
Individual buffers may be either 4 k or 8 k bytes long in one embodiment, and they are chained together into logically contiguous sequences by means of physically contiguous descriptors in a buffer descriptor table 510 stored in the NIC 116. For example, one transmit queue might occupy buffers 512, 514 and 516 in host memory 122, which are discontiguous and possibly out-of-order regions of memory. They are chained together into a single logically contiguous space by the physically contiguous entries 518, 520 and 522 in the buffer descriptor table 510. The entries 518, 520 and 522 are written and managed by the host 114 and are viewed as a wrap-around ring. So for example, if the host wishes to define a transmit buffer list 312 having 64 k entries for transmit data buffer descriptors, and each buffer is 4 k in size, then the host will allocate a physically contiguous sequence of 16 entries in buffer descriptor table 510 for this transmit buffer list. Similarly, one event queue might occupy buffers 526, 528 and 530 in host memory 122. These buffers are discontiguous and possibly out-of-order in host memory, but are chained together into a single logically contiguous wrap-around space by the physically contiguous entries 532, 534 and 536 in the buffer descriptor table 510. The buffer descriptor table 510 is indexed by “buffer ID”, and each of its entries identifies, among other things, the base address of the corresponding buffer in host memory 122.
In order to keep track of the state of each of the transmit, receive and event queues for the many user-level applications that might be in communication with NIC 116 at the same time, the NIC 116 includes a transmit queue descriptor table 540, a receive queue descriptor table 541, and an event queue descriptor table 542. Each transmit queue has a corresponding transmit queue ID, which is used as an index into the transmit queue descriptor table 540. The designated entry in the transmit queue descriptor table 540 is the starting point for describing the state and other characteristics of that particular transmit queue, as viewed by the NIC 116. Each such entry identifies, among other things:
    • whether the queue is a kernel queue, user queue or another kind of queue;
    • the size of the queue (number of transmit data buffer descriptors it can contain)
    • the ID of the event queue associated with this transmit queue;
    • buffer ID of base buffer for this transmit queue;
    • “device centric” read and write pointers into this transmit queue.
      The host 114 maintains “host centric” versions of the read and write pointers as well, and when it is ready to have transmit data transmitted, it so notifies the NIC 116 by writing its updated host centric transmit queue write pointer into the address on the NIC of the device centric transmit queue write pointer for the particular transmit queue.
In order to retrieve current transmit data from a particular transmit queue in host memory 122, the NIC 116 first uses the ID of the particular transmit queue to look up, in the transmit queue descriptor table 540, the buffer ID of the base buffer containing the transmit descriptor queue. The NIC 116 also obtains from the same place, the current device centric buffer list read pointer into that transmit descriptor queue. It then uses the base buffer ID as a base, and the device centric buffer list read pointer high order bits as an offset, into the buffer descriptor table 510, to obtain the base address in host memory 122 of the buffer that contains the particular transmit buffer list 312. The NIC then uses that base address as a base, and the device centric buffer list read pointer low order bits times the number of bytes taken up per descriptor as an offset, to retrieve from host memory 122 the current entry in the particular transmit descriptor queue.
The current entry in the particular transmit descriptor queue contains, among other things:
    • the buffer ID of the current transmit data buffer;
    • a byte offset into the current transmit data buffer; and
    • a number of bytes to be transmitted from the current transmit data buffer.
      The NIC 116 then uses the buffer ID of the current transmit data buffer as another index into buffer descriptor table 510 to retrieve the buffer descriptor for the buffer that contains the current transmit data. Note this buffer descriptor is an individual entry in buffer descriptor table 510; unlike the descriptors for buffers containing transmit queues or transmit event queues, this buffer descriptor is not part of a ring. The NIC 116 obtains the physical address in host memory 122 of the current transmit data buffer, and then using that physical address as a base, and the byte offset from the transmit descriptor queue entry as an offset, it determines the physical starting address in host memory 122 of the current data to be transmitted.
The system handles receive queues in a similar manner.
The transmit queue descriptor table 540 entry designated by the transmit queue ID, as previously mentioned, also contains the ID of the transmit event queue associated with the particular transmit queue. Similarly, the receive queue descriptor table 541 entry designated by the receive queue ID contains the ID of the event queue associated with the particular receive queue. All of the event queues for all the applications 222 are described by respective entries in the event queue descriptor table 542. The entry in the event queue descriptor table 542 identified by a queue ID from the transmit or receive queue descriptor table 540 or 541 is the starting point for describing the state and other characteristics of that particular event queue, as viewed by the NIC 116.
Note that as illustrated in FIG. 5, whereas each slot (e.g. 532, 534, 518) shown in the buffer descriptor table 510 represents a single descriptor, each slot (e.g. 526, 528, 514) in the host memory 122 represents a memory “page” of information. A page might be 4 k or 8 k bytes long for example, so if a transmit data buffer descriptor in a transmit queue occupies either 4 or 8 bytes, then each slot 512, 514 or 516 as shown in FIG. 5 might hold 512, 1 k or 2 k transmit data buffer descriptors.
FIG. 6 is a flowchart illustrating significant steps performed by the kernel resource allocation routine 412. If the user level transport library 224 fails to call this routine, then the addresses to which the user level transport library 224 would have to access in order to bypass subsequent trusted kernel routines will not be accessible to the user level process since they will not have been mapped into the application's virtual address space. In step 610, the kernel begins by allocating memory for the generalized buffers that will be used to hold the transmit, receive and event queues. It then maps the buffers into the application's virtual address space so that the application can read and write to them directly. In step 612, the kernel routine installs descriptors for these buffers in the buffer descriptor table 510.
In step 614, the kernel routine allocates a minimum set of the buffers for each of the transmit, receive and event queues requested, and programs their buffer IDs into the transmit, receive and event queue descriptor tables 540, 541 and 542. In step 616, the kernel routine determines the “doorbell” address in the NIC 116 for each of the transmit and receive queues, and maps them as well into the application's virtual address space. The doorbell address is the address to which the application will write a value, in order to notify the NIC either that a transmit buffer is ready or that a receive buffer can be released. For transmit queues, the doorbell address is the address of the device centric transmit queue read pointer in the transmit queue descriptor table 540 entry for the particular transmit queue. For receive queues, the doorbell address is the address of the device centric receive queue write pointer in the receive queue descriptor table 541 entry for the particular receive queue.
In step 618, the kernel routine programs into the NIC 116 certain access rights (authorization rights) that are to be associated with the particular transmit queue. These are the authorization rights to which the NIC 116 will look in order to determine whether a particular sending process has authority to send packets having certain characteristics, and the kernel routine programs them in dependence upon the privilege level of the process that made the kernel resource allocation call. Note that although the privilege level of an application process running in the computer system 114 is maintained on a per-process basis, the authorization rights are maintained on the NIC 116 on a per-queue basis. This enables the NIC 116 to validate outgoing transmit packets without having to know anything about the particular operating system running in the host computer or the privilege mechanisms that it uses.
In different embodiments, the NIC can refer to different characteristics of a transmit data packet in order to determine whether the transmit queue has sufficient authority to send it. In one embodiment, the NIC checks only whether the packet is formed according to an allowed transport protocol. For example, in one embodiment user level processes may be permitted to send packets using only the TCP or UDP transport protocols, and no others. Each data packet has a header field which identifies the transport protocol according to which it was formed, and the NIC can compare that protocol number with those that have been programmed into the NIC 116 as being allowed for the transmit queue from which the data packet was retrieved.
If the total array of transport protocols supported by the NIC 116 is short enough, then a field may be allocated in each entry of the transmit queue descriptor table 540 for identifying the allowed protocols. For example if only eight protocols are supported, and eight-bit field might be used, with each bit representing one of the protocols. If a bit is active, then the corresponding transport protocol is allowed; if it is inactive, then it is not.
Alternatively, the allowed protocols may be listed in a separate authorizations “database” maintained in the NIC 116, such as that shown in FIG. 7. In FIG. 7, the authorizations database takes the form of a table in which each entry contains a queue ID and an indication of an allowed protocol for that queue. If several different protocols are allowed for particular queue, then the queue ID appears in several different entries in the table. When the NIC 116 is checking the validity of a transmit packet from a particular transmit queue, it searches the table for an entry that contains both the transmit queue ID and the transport protocol according to which the packet was formed (retrieved from the packet header). If the table does contain such an entry, then the packet is valid. If not, then the packet is rejected. (As used herein, the term “database” does not necessarily imply any unity of structure. For example, two or more separate databases, when considered together, still constitute a “database” as that term is used herein.)
In other embodiments, the NIC can validate other characteristics of a transmit data packet. For example it can validate the source IP address, the source port number, the destination IP address and destination port number, either instead of or additionally to the allowed protocols. FIG. 8 illustrates an example authorizations database that can support validating all such characteristics. As shown in FIG. 8, each entry in the database table contains six fields: source IP address, source port number, destination IP address, destination port number, queue ID and allowed protocol number. In order to keep table short, some of these fields may be filled with indications of numeric ranges, rather than only a specific number. For example, in some systems only privileged processes can transmit packets indicating that they were sourced from a port number in the range 0-1023. In this case, the “source port” field of the table a FIG. 8 might contain only a single bit indicating whether source port numbers within the range 0-1023 are permitted. As with an embodiment using the FIG. 7 table, when the NIC 116 is checking the validity of a transmit packet from a particular transmit queue, it searches the table of FIG. 8 for a single entry that contains (or includes, if numeric ranges are specified) the source IP address, the source port number, destination IP address, destination port number and the allowed protocol (all taken from the packet header), as well as the ID of the transmit queue from which the packet was retrieved. If the table does contain such an entry, then the packet is valid. If not, then it is rejected.
Returning to FIG. 6, after the kernel resource allocation routine programs the authorization rights for the transmit queue into the NIC 116, it returns to the application with handles for the resources allocated, with the base virtual addresses of the transmit, receive and event queues, and virtual memory addresses corresponding to the doorbells allocated in the transmit and receive queue descriptor tables 540 and 541 (step 620).
FIG. 9 is a flowchart of the user level routine 900 in the transport library 224 for establishing a socket. The socket routine is called in both steps 312 and 322 of FIG. 3. In step 910, the routine first determines whether the user level transport library 224 contains the code necessary to support the specified protocol. If not, then in step 912, makes a system call to pass the request on to the kernel to handle. If it does support the specified protocol, then in step 914 it allocates its internal state for the new socket. In step 916 the routine returns to the application program with a handle for the new socket. Note that in a conventional system, in which the socket routine invokes a kernel level process, the kernel may check at this time whether the calling process has authority to use the specified protocol number. This call, and the accompanying context switches, are unnecessary in the system FIGS. 1 and 2 since any illegal protocols will be detected and rejected downstream by the NIC 116.
FIG. 10 is a flowchart of the user level routine transport library 224 for binding a port number to a socket. The bind routine is called in both steps 314 and 324 of FIG. 3. In step 1010, the routine first determines whether the caller provided a port number. If not, then in step 1012, a port number is assigned. If the caller did specify a port number, then in step 1014, the routine determines whether the port number is legal for the current user process.
At least four different mechanisms might be used in the same or different embodiments to ensure that different processes do not interfere with each other's use of particular IP address/port number combinations, and that user processes do not improperly operate through physical or logical port numbers or other resources that should be reserved for the kernel. In one mechanism, a system-wide policy exists which allocates all port numbers within a particular range to the user stack only. The user level bind( ) routine can be designed to immediately accept only those requests from a user level process to bind to one of such port numbers, or to immediately pass such requests on to the kernel to handle.
In a second mechanism, during the resource allocation step 412, performed during initialization of a particular instance of the user level driver, the kernel allocates a unique IP address for that instance to use as it wishes. If each instance of the user level driver has its own exclusively assigned IP address, then the instance can manage the available port numbers for use with that IP address without risking interference with any other process. This mechanism is useful only if there are sufficient numbers of IP addresses available to the computer system to allocate to the various requesting processes. Again, this mechanism can be used in conjunction with the first, to reject or pass on to the kernel all user level requests to bind to a kernel-only port number, regardless of the exclusivity of an assigned IP address.
In a third mechanism, again during initialization of a particular instance of the user level driver, the initialization routine makes a number of anticipatory bind( ) calls to the kernel in order to form a pool of port numbers that the user level driver instance can later allocate to the application program upon receipt of bind( ) calls to the user level driver. This mechanism can succeed with far fewer IP addresses available to the computer system, but also undesirably involves a context switch (during library initialization) for each port number to be added to the pool.
In yet a fourth mechanism, no IP address/port number combinations are pre-allocated to the particular instance of the user level driver. Instead, the user level bind( ) routine invokes the kernel bind( ) routine for each user level bind( ) call received. This mechanism utilizes IP address/port number combinations most conservatively, but may require more context switches than any of the first, second and third mechanisms. In an embodiment, this fourth mechanism is used only as a backup, for example if the user level process requires more port numbers than were made available using the anticipatory bind( ) calls in the third mechanism.
If in step 1014 the user level bind( ) routine determines that the requested port number is not available to the current instance of the user level driver, or otherwise cannot determine whether is available, then in step 1016, the routine makes a call to the kernel bind( ) routine to pass the request on to the kernel to handle. If the fourth mechanism above is the only way that the particular embodiment avoids conflicting or illegal allocation of address/port number combinations, then step 1016 will be taken during every user level call to the bind( ) routine 1000. Otherwise, step 1016 will be taken only as a backup if pre-allocated port numbers have been exhausted, or if the routine otherwise cannot determine that the requested port number is available.
If the specified port number is legal, or if a port number was assigned by the routine in step 1012, then in step 1018 the routine updates the application's state internally, to bind the port number with the specified socket. The routine returns to the caller in step 1020.
Although the user level bind routine of FIG. 10 attempts to prevent the allocation of problematical port numbers to user level processes as described above, it will be appreciated that a different implementation of the user level bind routine may not be as careful. It is a feature of the invention that the NIC 116 can be designed to detect and reject transmit packets which do designate a source port number that is illegal, or for which the sending process lacks sufficient privilege to use. Therefore, no damage will occur if transport library routines are used which do not follow the steps set forth, as long as any errors they cause are of a type that the NIC 116 is designed to detect downstream.
FIG. 11 is a flowchart of the user level sendTo routine called in step 318 of FIG. 3. This routine is called for UDP protocol packets for which no connection to a destination IP address or port is established. The routine therefore takes as arguments not only the socket handle and the data buffer address, but also the destination IP address and destination port number. Referring to FIG. 11, in step 1106, the routine first determines whether there are known improper arguments. If so, then in step 1108, it invokes the kernel or returns an error. If not, then in step 1110, the routine first applies the transport protocol, forming a UDP header and an IP header. It then forms a transmit descriptor pointing to the headers and to the data buffer identified by the caller. In step 1112, the routine pushes the transmit descriptor onto the transmit queue by writing the transmit descriptor into the next available entry in transmit queue buffers 512, 514 or 516 (FIG. 5). The routine then updates its own host centric transmit queue write pointer. In step 1114, the routine writes its updated transmit queue write pointer into the NIC 116 at the doorbell address that had been previously associated with the current transmit queue. In step 1116, the routine returns to the caller.
Note that all the steps of FIG. 11 take place entirely within the virtual address space of the current user level process. There is no need to copy data into the kernel address space, nor is there any need to perform a context switch to a kernel process either to perform the protocol processing, to enqueue the new UDP packet, or to notify the NIC 116 of the availability of a new packet in the transmit queue. Additionally, as with the user level socket and bind routines, a user level transport library routine which does not perform the steps faithfully as set forth in FIG. 11 will not cause the transmission of malformed or illegal packets to the extent the NIC 116 is designed to reject them during the validation process performed on the NIC 116.
FIG. 12 is a functional block diagram of certain structures on the NIC 116 which can be used to perform packet validation for transmission of data packets. The structures include a DMA controller 1210 which controls the retrieval of transmit data packets from the host memory subsystem 122. Data packets, as they arrive, are enqueued into a transmit FIFO 1214. From the head of the transmit FIFO 1214, data packets are presented to a physical interface 1216 which performs any further protocol processing (such as appending an Ethernet header) and writes them out onto the network 112. The transmit FIFO 1214 also has associated therewith FIFO control logic 1218, which includes read and write pointers into the transmit FIFO 1214 as well as other functions described hereinafter. The NIC 116 also includes header validation logic 1220 which observes the contents of data packets as they arrive from the bus 118 and are placed into the transmit FIFO 1214. The header validation logic 1220 makes reference to the authorizations database 1222 described previously.
In operation, as a data packet is received from the bus 118, the first section received is the IP header. This section contains the source and destination IP addresses, as well as an identification of the transport layer protocol. Next comes the transport layer header, which contains the source and destination port numbers. Based on this information, the header validation logic 1220 compares the data packet characteristics to those in the authorizations database 1222 to determine whether the data packet arriving from the bus 118 is authorized. If it is not, then the header validation logic 1220 can cause the DMA controller 1210 to abort the current transfer, and can also cause the FIFO control logic 1218 to unwind its write pointer back to the end of the previous packet in the transmit FIFO 1214.
FIG. 13 is a flowchart of typical steps taken on the NIC 116 for transmission of data packets arriving from multiple queues. Initially, the NIC implements an algorithm for choosing among the multiple transmit queues for the next queue to service. This algorithm is unimportant for an understanding of the invention, and therefore is not described. In step 1310, the algorithm selects one particular transmit queue to examine.
In step 1312, the NIC 116 determines whether the device centric write pointer for the current transmit queue modulo-exceeds the device centric read pointer for current transmit queue. These values are available to be NIC in the transmit queue descriptor table 540 entry for the current transmit queue, and the test will be positive if one of the transport libraries 224 or 225 has updated the device centric write pointer to notify the NIC of the availability of the data packet for transmission. The term “modulo-exceeds” is used herein to accommodate wrap-around (circular) queues. That is, the device centric write pointer “modulo-exceeds” the device centric read pointer for a queue if the write pointer exceeds the read pointer, modulo the queue length.
If the test of step 1312 is negative, then in step 1314, the NIC 116 proceeds to examine the next transmit queue according to its algorithm.
If the test of step 1312 is positive, then in step 1316, the NIC 116 reads one or more transmit descriptors from the current transmit queue, beginning at the entry pointed to by the device centric read pointer. In step 1318, the NIC 116 programs the DMA controller 1210 to retrieve the packet from host memory 122 into transmit FIFO 1214. In step 1320, during the retrieval process, the NIC examines the header information on the packet as it is being retrieved, and tests the current queue's authority to send packets having the characteristics of that being retrieved. If the NIC 116 determines that the packet is authorized (step 1322), then in step 1324, after packet retrieval, the NIC 116 will updated its device centric transmit queue read pointer. The NIC 116 then writes a transmit completion event into the event queue associated with the current transmit queue, for eventual retrieval by the user level process. In some embodiments, the NIC 116 might wait to complete retrieval of a number of transmit data packets before writing a “batched” transmit completion event covering all of them. The process then returns to step 1310 for the queue selection algorithm to select the same or another transmit queue. Eventually, in step 1328 the NIC 116 transmits the packet from the head of the transmit FIFO 1214 out onto the network 112.
If in step 1322 it is determined that the current packet is not authorized to be sent from the current transmit queue, then in step 1326, the header validation logic 1220 (FIG. 12) causes the DMA controller 1210 to abort the current transfer, thereby freeing up the bus 118. It also notifies the FIFO control logic 1218 to unwind the transmit FIFO queue write pointer as previously described. The NIC 116 may also report an error back to the application program.
It can be seen that the NIC 116 transmits packets onto network 112 only if the sending transmit queue is authorized to transmit packets having the characteristics for which header validation logic 1220 checks. In some embodiments still other requirements might be necessary before the NIC will allow the packet to go out.
As used herein, “identification” of an item of information does not necessarily require the direct specification of that item of information. Information can be “identified” in a field simply by referring to the actual information through one or more layers of indirection, or by identifying one or more items of different information which are together sufficient to determine the actual item of information. In addition, the term “indicate” is used herein to mean the same as “identify”.
The foregoing description of preferred embodiments of the present invention has been provided for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Obviously, many modifications and variations will be apparent to practitioners skilled in this art. In particular, and without limitation, any and all variations described, suggested or incorporated by reference in the Background section of this patent application are specifically incorporated by reference into the description herein of embodiments of the invention. The embodiments described herein were chosen and described in order to best explain the principles of the invention and its practical application, thereby enabling others skilled in the art to understand the invention for various embodiments and with various modifications as are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the following claims and their equivalents.

Claims (13)

1. A method for interfacing a computing device with a network interface device, for use with a network, comprising the steps of:
a first sending process of the computing device initiating establishment of a first transmit queue;
a privileged mode process, in response to the step of the first sending process initiating establishment of a first transmit queue, establishing the first transmit queue in a virtual address space of the first sending process,
the first sending process enqueueing a first data packet onto the first transmit queue for transmission onto the network, without involvement of any privileged mode routines, the first data packet having a first characteristic;
the network interface device receiving at least part of the first data packet from the first transmit queue for transmission onto the network;
the network interface device making a first determination of whether the first sending process has authority to transmit data packets having the first characteristic onto the network, in dependence upon whether the first transmit queue has such authority according to authorization rights maintained on the network interface device on a per-transmit queue basis; and
the network interface device transmitting the first data packet onto the network only if the first determination is positive.
2. A method according to claim 1, wherein the first characteristic comprises a particular network transport protocol,
and wherein the step of the network interface device making a first determination comprises the step of the network interface device determining whether the first sending process is authorized to transmit data packets using the particular network transport protocol.
3. A method according to claim 1, wherein the first characteristic comprises a particular source IP port number,
and wherein the step of the network interface device making a first determination comprises the step of the network interface device determining whether the first sending process is authorized to transmit data packets having the particular source IP port number.
4. A method according to claim 1, wherein the first characteristic comprises a particular destination IP port number,
and wherein the step of the network interface device making a first determination comprises the step of the network interface device determining whether the first sending process is authorized to transmit data packets having the particular destination IP port number.
5. A method according to claim 1, wherein the first characteristic comprises a particular source IP address,
and wherein the step of the network interface device making a first determination comprises the step of the network interface device determining whether the first sending process is authorized to transmit data packets having the particular source IP address.
6. A method according to claim 1, wherein the first characteristic comprises a particular destination IP address,
and wherein the step of the network interface device making a first determination comprises the step of the network interface device determining whether the first sending process is authorized to transmit data packets having the particular destination IP address.
7. A method according to claim 1, wherein the step of the network interface device receiving at least part of the first data packet comprises the step of the network interface device retrieving at least part of the first data packet from the first transmit queue.
8. A method according to claim 1, further comprising the step of the first sending process notifying the network interface device, without invoking any privileged mode routines, of the availability of the first data packet in the first transmit queue.
9. A method according to claim 1, wherein the first sending process is a user level process,
further comprising the step of a privileged mode process, in response to the step of the first sending process initiating establishment of a first transmit queue, programming authorization rights for the first transmit queue into a database accessible to the network interface device,
and wherein the step of the network interface device making a first determination comprises the step of the network interface device examining the authorization rights for the first transmit queue in the database.
10. A method according to claim 1, further comprising the steps of:
a second sending process initiating establishment of a second transmit queue;
a privileged mode process, in response to the step of the second sending process initiating establishment of a second transmit queue, establishing the second transmit queue in a virtual address space of the second sending process;
the second sending process enqueueing a second data packet onto the second transmit queue for transmission onto the network, the second data packet having a second characteristic;
the network interface device receiving at least part of the second data packet from the second transmit queue;
the network interface device making a second determination of whether the second sending process has authority to transmit data packets having the second characteristic onto the network; and
the network interface device transmitting the second data packet onto the network only if the second determination is positive.
11. A method according to claim 10, wherein the second sending process is a user level process, further comprising the step of a privileged mode process, in response to the step of the second sending process initiating establishment of a second transmit queue, programming authorization rights for the second transmit queue into the database accessible to the network interface device,
and wherein the step of the network interface device making a second determination comprises the step of the network interface device examining the authorization rights for the second transmit queue in the database.
12. A method according to claim 1, wherein the step of the network interface device receiving at least part of the first data packet comprises the step of the network interface device retrieving at least part of the first data packet from the first transmit queue,
further comprising the step of aborting retrieval of the first data packet if the first determination is negative.
13. A system comprising:
a computing device; and
network interface device in communication with the computing device via a physical bus, wherein the computing device is configured such that:
in response to a first sending process of the computing device initiating establishment of a first transmit queue, a privileged mode process of the computing device establishes the first transmit queue in a virtual address space of the first sending process,
and in response to the first sending process enqueueing a first data packet onto the first transmit queue for transmission onto a network, the first data packet having a first characteristic, the network interface device receives at least part of the first data packet without involvement of any privileged mode routines of the computing device;
and wherein the network interface device is configured to make a first determination as to whether the first sending process has authority to transmit data packets having the first characteristic onto the network, in dependence upon whether the first transmit queue has such authority according to authorization rights maintained on the network interface device on a per-transmit queue basis, and to transmit the first data packet onto the network only if the first determination is positive.
US12/612,078 2005-04-27 2009-11-04 Packet validation in virtual network interface architecture Active 2025-11-18 US8380882B2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
US12/612,078 US8380882B2 (en) 2005-04-27 2009-11-04 Packet validation in virtual network interface architecture
US13/765,579 US9912665B2 (en) 2005-04-27 2013-02-12 Packet validation in virtual network interface architecture
US15/888,498 US10924483B2 (en) 2005-04-27 2018-02-05 Packet validation in virtual network interface architecture

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/116,018 US7634584B2 (en) 2005-04-27 2005-04-27 Packet validation in virtual network interface architecture
US12/612,078 US8380882B2 (en) 2005-04-27 2009-11-04 Packet validation in virtual network interface architecture

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US11/116,018 Continuation US7634584B2 (en) 2005-04-27 2005-04-27 Packet validation in virtual network interface architecture

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US13/765,579 Continuation US9912665B2 (en) 2005-04-27 2013-02-12 Packet validation in virtual network interface architecture

Publications (2)

Publication Number Publication Date
US20100049876A1 US20100049876A1 (en) 2010-02-25
US8380882B2 true US8380882B2 (en) 2013-02-19

Family

ID=36716645

Family Applications (4)

Application Number Title Priority Date Filing Date
US11/116,018 Active 2026-11-18 US7634584B2 (en) 2005-04-27 2005-04-27 Packet validation in virtual network interface architecture
US12/612,078 Active 2025-11-18 US8380882B2 (en) 2005-04-27 2009-11-04 Packet validation in virtual network interface architecture
US13/765,579 Active 2028-10-06 US9912665B2 (en) 2005-04-27 2013-02-12 Packet validation in virtual network interface architecture
US15/888,498 Active 2025-12-11 US10924483B2 (en) 2005-04-27 2018-02-05 Packet validation in virtual network interface architecture

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US11/116,018 Active 2026-11-18 US7634584B2 (en) 2005-04-27 2005-04-27 Packet validation in virtual network interface architecture

Family Applications After (2)

Application Number Title Priority Date Filing Date
US13/765,579 Active 2028-10-06 US9912665B2 (en) 2005-04-27 2013-02-12 Packet validation in virtual network interface architecture
US15/888,498 Active 2025-12-11 US10924483B2 (en) 2005-04-27 2018-02-05 Packet validation in virtual network interface architecture

Country Status (3)

Country Link
US (4) US7634584B2 (en)
EP (1) EP1884085B1 (en)
WO (1) WO2006116594A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9912665B2 (en) 2005-04-27 2018-03-06 Solarflare Communications, Inc. Packet validation in virtual network interface architecture
US10514937B2 (en) * 2012-01-05 2019-12-24 Vmware, Inc. Auto-discovery service and method of discovering applications within a virtual network

Families Citing this family (51)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7971001B2 (en) 2004-12-28 2011-06-28 Sap Ag Least recently used eviction implementation
US7694065B2 (en) 2004-12-28 2010-04-06 Sap Ag Distributed cache architecture
US20060143256A1 (en) 2004-12-28 2006-06-29 Galin Galchev Cache region concept
US8204931B2 (en) 2004-12-28 2012-06-19 Sap Ag Session management within a multi-tiered enterprise network
US7539821B2 (en) 2004-12-28 2009-05-26 Sap Ag First in first out eviction implementation
US8589562B2 (en) 2005-04-29 2013-11-19 Sap Ag Flexible failover configuration
US7647436B1 (en) * 2005-04-29 2010-01-12 Sun Microsystems, Inc. Method and apparatus to interface an offload engine network interface with a host machine
US7966412B2 (en) * 2005-07-19 2011-06-21 Sap Ag System and method for a pluggable protocol handler
US8707323B2 (en) 2005-12-30 2014-04-22 Sap Ag Load balancing algorithm for servicing client requests
US7650471B2 (en) * 2006-01-06 2010-01-19 Hewlett-Packard Development Company, L.P. Head of queue cache for communication interfaces
US20070220293A1 (en) * 2006-03-16 2007-09-20 Toshiba America Electronic Components Systems and methods for managing power consumption in data processors using execution mode selection
US7594093B1 (en) * 2006-10-12 2009-09-22 Foundry Networks, Inc. Virtual memory mapping for efficient memory usage
US8819311B2 (en) * 2007-05-23 2014-08-26 Rpx Corporation Universal user input/output application layers
US7937449B1 (en) * 2007-10-08 2011-05-03 Empirix, Inc. System and method for enabling network stack processes in user mode and kernel mode to access kernel memory locations
US7979507B2 (en) * 2007-12-27 2011-07-12 Intel Corporation Transmit scaling using multiple queues
US8396953B2 (en) * 2010-03-12 2013-03-12 Hewlett-Packard Development Company, L.P. Processing packets using a virtualized descriptor queue
US8935457B2 (en) 2011-07-29 2015-01-13 International Business Machines Corporation Network filtering in a virtualized environment
US8732371B2 (en) * 2011-08-25 2014-05-20 Facebook, Inc. Managing overhead associated with service requests via software generated interrupts
US8868710B2 (en) 2011-11-18 2014-10-21 Amazon Technologies, Inc. Virtual network interface objects
US9916545B1 (en) 2012-02-29 2018-03-13 Amazon Technologies, Inc. Portable network interfaces for authentication and license enforcement
US8813225B1 (en) 2012-06-15 2014-08-19 Amazon Technologies, Inc. Provider-arbitrated mandatory access control policies in cloud computing environments
US9059868B2 (en) * 2012-06-28 2015-06-16 Dell Products, Lp System and method for associating VLANs with virtual switch ports
CN102970345A (en) * 2012-11-01 2013-03-13 上海文广互动电视有限公司 Content collection system for multi-transmission protocol media content collecting
JP6122135B2 (en) * 2012-11-21 2017-04-26 コーヒレント・ロジックス・インコーポレーテッド Processing system with distributed processor
US9288163B2 (en) * 2013-03-15 2016-03-15 Avago Technologies General Ip (Singapore) Pte. Ltd. Low-latency packet receive method for networking devices
US9660933B2 (en) 2014-04-17 2017-05-23 Go Daddy Operating Company, LLC Allocating and accessing hosting server resources via continuous resource availability updates
US9501211B2 (en) 2014-04-17 2016-11-22 GoDaddy Operating Company, LLC User input processing for allocation of hosting server resources
US9787499B2 (en) 2014-09-19 2017-10-10 Amazon Technologies, Inc. Private alias endpoints for isolated virtual networks
US10021196B1 (en) 2015-06-22 2018-07-10 Amazon Technologies, Inc. Private service endpoints in isolated virtual networks
CN105471859B (en) * 2015-11-20 2019-02-26 中铁工程装备集团有限公司 A kind of access control method based on stream granularity
US20170249162A1 (en) * 2016-02-25 2017-08-31 Red Hat Israel, Ltd. Safe transmit packet processing for network function virtualization applications
US10599494B2 (en) * 2016-05-27 2020-03-24 Huawei Technologies Co., Ltd. Method and system of performing inter-process communication between OS-level containers in user space
US10237171B2 (en) * 2016-09-20 2019-03-19 Intel Corporation Efficient QoS support for software packet processing on general purpose servers
US10554554B2 (en) 2016-12-06 2020-02-04 Microsoft Technology Licensing, Llc Hybrid network processing load distribution in computing systems
US10715424B2 (en) 2016-12-06 2020-07-14 Microsoft Technology Licensing, Llc Network traffic management with queues affinitized to one or more cores
US10826841B2 (en) * 2016-12-06 2020-11-03 Microsoft Technology Licensing, Llc Modification of queue affinity to cores based on utilization
US20190068466A1 (en) * 2017-08-30 2019-02-28 Intel Corporation Technologies for auto-discovery of fault domains
US11792307B2 (en) 2018-03-28 2023-10-17 Apple Inc. Methods and apparatus for single entity buffer pool management
US10664420B2 (en) * 2018-04-19 2020-05-26 Avago Technologies International Sales Pte. Limited System and method for port-to-port communications using direct memory access
US11681625B2 (en) * 2018-12-20 2023-06-20 Intel Corporation Receive buffer management
CN109617833B (en) * 2018-12-25 2021-12-31 深圳市任子行科技开发有限公司 NAT data auditing method and system of multi-thread user mode network protocol stack system
US10853263B1 (en) * 2019-07-23 2020-12-01 Ati Technologies Ulc Unified kernel virtual address space for heterogeneous computing
US11477123B2 (en) 2019-09-26 2022-10-18 Apple Inc. Methods and apparatus for low latency operation in user space networking
US11558348B2 (en) 2019-09-26 2023-01-17 Apple Inc. Methods and apparatus for emerging use case support in user space networking
US11829303B2 (en) 2019-09-26 2023-11-28 Apple Inc. Methods and apparatus for device driver operation in non-kernel space
US20200319812A1 (en) * 2020-06-03 2020-10-08 Intel Corporation Intermediary for storage command transfers
US11606302B2 (en) 2020-06-12 2023-03-14 Apple Inc. Methods and apparatus for flow-based batching and processing
US11775359B2 (en) 2020-09-11 2023-10-03 Apple Inc. Methods and apparatuses for cross-layer processing
US11799986B2 (en) 2020-09-22 2023-10-24 Apple Inc. Methods and apparatus for thread level execution in non-kernel space
US11882051B2 (en) 2021-07-26 2024-01-23 Apple Inc. Systems and methods for managing transmission control protocol (TCP) acknowledgements
US11876719B2 (en) 2021-07-26 2024-01-16 Apple Inc. Systems and methods for managing transmission control protocol (TCP) acknowledgements

Citations (153)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4805134A (en) 1986-01-09 1989-02-14 International Business Machines Corporation Electronic system for accessing graphical and textual information
US5272599A (en) 1993-03-19 1993-12-21 Compaq Computer Corporation Microprocessor heat dissipation apparatus for a printed circuit board
US5325532A (en) 1992-09-25 1994-06-28 Compaq Computer Corporation Automatic development of operating system boot image
US5636371A (en) 1995-06-07 1997-06-03 Bull Hn Information Systems Inc. Virtual network mechanism to access well known port application programs running on a single host system
US5677910A (en) 1992-08-07 1997-10-14 Plaintree Systems Inc. High performance two-port transport LAN bridge
US5740467A (en) 1992-01-09 1998-04-14 Digital Equipment Corporation Apparatus and method for controlling interrupts to a host during data transfer between the host and an adapter
US5790804A (en) 1994-04-12 1998-08-04 Mitsubishi Electric Information Technology Center America, Inc. Computer network interface and network protocol with direct deposit messaging
US5946189A (en) 1997-04-23 1999-08-31 Compaq Computer Corporation Pivotable support and heat sink apparatus removably connectable without tools to a computer processor
US6044415A (en) 1998-02-27 2000-03-28 Intel Corporation System for transferring I/O data between an I/O device and an application program's memory in accordance with a request directly over a virtual connection
US6070219A (en) * 1996-10-09 2000-05-30 Intel Corporation Hierarchical interrupt structure for event notification on multi-virtual circuit network interface controller
US6098112A (en) 1995-10-19 2000-08-01 Hewlett-Packard Company Streams function registering
US6122670A (en) 1997-10-30 2000-09-19 Tsi Telsys, Inc. Apparatus and method for constructing data for transmission within a reliable communication protocol by performing portions of the protocol suite concurrently
US6160554A (en) 1998-03-19 2000-12-12 Hewlett Packard Company Computer file content preview window
WO2001048972A1 (en) 1999-12-28 2001-07-05 Mellanox Technologies Ltd. Adaptive sampling
US6304945B1 (en) 1999-05-13 2001-10-16 Compaq Computer Corporation Method and apparatus for maintaining cache coherency in a computer system having multiple processor buses
US6332195B1 (en) 1996-02-09 2001-12-18 Secure Computing Corporation Secure server utilizing separate protocol stacks
US6349035B1 (en) 2000-09-29 2002-02-19 Compaq Information Technologies Group, L.P. Method and apparatus for tooless mating of liquid cooled cold plate with tapered interposer heat sink
WO2002035838A1 (en) 2000-10-28 2002-05-02 Voltaire Limited Modular set top box assembly
US20020059052A1 (en) 2000-09-12 2002-05-16 Mellanox Technologies Ltd. Co-simulation of network components
US20020112139A1 (en) 2001-02-13 2002-08-15 Krause Michael R. Highly available, monotonic increasing sequence number generation
US6438130B1 (en) 2001-06-28 2002-08-20 Mellanox Technologies Ltd. Forwarding database cache
US20020129293A1 (en) 2001-03-07 2002-09-12 Hutton John F. Scan based multiple ring oscillator structure for on-chip speed measurement
US20020140985A1 (en) 2001-04-02 2002-10-03 Hudson Kevin R. Color calibration for clustered printing
US20020156784A1 (en) 1999-04-08 2002-10-24 Hanes David H. Method for transferring and indexing data from old media to new media
US6496935B1 (en) 2000-03-02 2002-12-17 Check Point Software Technologies Ltd System, device and method for rapid packet filtering and processing
US6502203B2 (en) 1999-04-16 2002-12-31 Compaq Information Technologies Group, L.P. Method and apparatus for cluster system operation
US20030007165A1 (en) 2001-07-05 2003-01-09 Hudson Kevin R. Direction-dependent color conversion in bidirectional printing
US6530007B2 (en) 1998-07-13 2003-03-04 Compaq Information Technologies Group, L.P. Method and apparatus for supporting heterogeneous memory in computer systems
US20030058459A1 (en) 2001-09-26 2003-03-27 Yifeng Wu Generalized color calibration architecture and method
US20030063299A1 (en) 2001-10-02 2003-04-03 Cowan Philip B. Color calibration method and apparatus
US20030065856A1 (en) 2001-10-03 2003-04-03 Mellanox Technologies Ltd. Network adapter with multiple event queues
US20030081060A1 (en) 2001-10-29 2003-05-01 Huanzhao Zeng Method and system for management of color through conversion between color spaces
US20030172330A1 (en) 2002-03-11 2003-09-11 Barron Dwight L. Data redundancy in a hot pluggable, large symmetric multi-processor system
US20030191786A1 (en) 1999-02-01 2003-10-09 Matson Mark D. Computer method and apparatus for division and square root operations using signed digit
US20030202043A1 (en) 2002-04-24 2003-10-30 Huanzhao Zeng Determination of control points for construction of first color space-to-second color space look-up table
US20030214677A1 (en) 1998-03-26 2003-11-20 Ranjit Bhaskar Tone dependent variable halftoning wih adjustable algorithm selection
US6667918B2 (en) 2002-05-01 2003-12-23 Mellanox Technologies Ltd. Self-repair of embedded memory arrays
US6718392B1 (en) 2000-10-24 2004-04-06 Hewlett-Packard Development Company, L.P. Queue pair partitioning in distributed computer system
US20040071250A1 (en) 2000-06-20 2004-04-15 Bunton William P. High-speed interconnection adapter having automated lane de-skew
US6728743B2 (en) 2001-01-04 2004-04-27 Mellanox Technologies Ltd. Modulo remainder generator
US6735642B2 (en) 2000-06-02 2004-05-11 Mellanox Technologies Ltd. DMA doorbell
US20040141642A1 (en) 2003-01-22 2004-07-22 Huanzhao Zeng Color space conversion using interpolation
US6768996B1 (en) 1998-10-08 2004-07-27 Hewlett-Packard Development Company, L.P. System and method for retrieving an abstracted portion of a file without regard to the operating system of the current host computer
US6799220B1 (en) 2000-04-13 2004-09-28 Intel Corporation Tunneling management messages over a channel architecture network
US20040190538A1 (en) 2000-09-08 2004-09-30 Hewlett-Packard Development Company, L.P. Speculative transmit for system area network latency reduction
US20040190557A1 (en) 2003-03-27 2004-09-30 Barron Dwight L. Signaling packet
US20040193825A1 (en) 2003-03-27 2004-09-30 Garcia David J. Binding a memory window to a queue pair
US20040190533A1 (en) 2003-03-27 2004-09-30 Prashant Modi Method and apparatus for performing connection management with multiple stacks
US20040193734A1 (en) 2003-03-27 2004-09-30 Barron Dwight L. Atomic operations
US20040210754A1 (en) 2003-04-16 2004-10-21 Barron Dwight L. Shared security transform device, system and methods
US20040252685A1 (en) 2003-06-13 2004-12-16 Mellanox Technologies Ltd. Channel adapter with integrated switch
US20050008223A1 (en) 2003-07-09 2005-01-13 Huanzhao Zeng Representing extended color gamut information
US20050018221A1 (en) 2003-07-21 2005-01-27 Huanzhao Zeng Conforming output intensities of pens
US20050038941A1 (en) 2003-08-14 2005-02-17 Mallikarjun Chadalapaka Method and apparatus for accessing a memory
US20050039171A1 (en) 2003-08-12 2005-02-17 Avakian Arra E. Using interceptors and out-of-band data to monitor the performance of Java 2 enterprise edition (J2EE) applications
US20050039187A1 (en) 2003-08-12 2005-02-17 Avakian Arra E. Instrumenting java code by modifying bytecodes
US20050038918A1 (en) 2003-08-14 2005-02-17 Hilland Jeffrey R. Method and apparatus for implementing work request lists
US20050039172A1 (en) 2003-08-12 2005-02-17 Jeffrey Rees Synthesizing application response measurement (ARM) instrumentation
US6865672B1 (en) 1998-05-18 2005-03-08 Spearhead Technologies, Ltd. System and method for securing a computer communication network
US6868450B1 (en) 2000-05-17 2005-03-15 Hewlett-Packard Development Company, L.P. System and method for a process attribute based computer network filter
US20050066333A1 (en) 2003-09-18 2005-03-24 Krause Michael R. Method and apparatus for providing notification
US6904534B2 (en) 2001-09-29 2005-06-07 Hewlett-Packard Development Company, L.P. Progressive CPU sleep state duty cycle to limit peak power of multiple computers on shared power distribution unit
US20050172181A1 (en) 2004-01-16 2005-08-04 Mellanox Technologies Ltd. System and method for production testing of high speed communications receivers
US6931530B2 (en) 2002-07-22 2005-08-16 Vormetric, Inc. Secure network file access controller implementing access control and auditing
US6934758B2 (en) 1997-12-11 2005-08-23 Sun Microsystems, Inc. Stack-based access control using code and executor identifiers
US20050219314A1 (en) 2004-03-30 2005-10-06 David Donovan Formation of images
US20050219278A1 (en) 2004-03-30 2005-10-06 Hudson Kevin R Image formation with a flexible number of passes
US20050231751A1 (en) 2004-04-15 2005-10-20 Yifeng Wu Image processing system and method
US6978331B1 (en) 1999-09-08 2005-12-20 Mellanox Technologies Ltd. Synchronization of interrupts with data packets
US20060026443A1 (en) 2004-07-29 2006-02-02 Mcmahan Larry N Communication among partitioned devices
US20060045098A1 (en) 2004-08-31 2006-03-02 Krause Michael R System for port mapping in a network
US20060126619A1 (en) 2004-12-14 2006-06-15 Teisberg Robert R Aggregation over multiple processing nodes of network resources each providing offloaded connections between applications over a network
US20060165074A1 (en) 2004-12-14 2006-07-27 Prashant Modi Aggregation of network resources providing offloaded connections between applications over a network
US20060185011A1 (en) 2005-02-14 2006-08-17 International Business Machines Corporation Packet filtering in a NIC to control antidote loading
US7099275B2 (en) 2001-09-21 2006-08-29 Slt Logic Llc Programmable multi-service queue scheduler
US20060193318A1 (en) 2005-02-28 2006-08-31 Sriram Narasimhan Method and apparatus for processing inbound and outbound quanta of data
US7103626B1 (en) 2000-05-24 2006-09-05 Hewlett-Packard Development, L.P. Partitioning in distributed computer system
US7111303B2 (en) 2002-07-16 2006-09-19 International Business Machines Corporation Virtual machine operating system LAN
US20060228637A1 (en) 2005-04-11 2006-10-12 Jackson Bruce J Color filter
US20060248191A1 (en) 2005-04-27 2006-11-02 Hudson Charles L Aggregation of hybrid network resources operable to support both offloaded and non-offloaded connections
US7136397B2 (en) 2001-08-20 2006-11-14 Slt Logic Llc Network architecture and system for delivering bi-directional xDSL based services
US7143412B2 (en) 2002-07-25 2006-11-28 Hewlett-Packard Development Company, L.P. Method and apparatus for optimizing performance in a multi-processing system
US7149227B2 (en) 2002-05-31 2006-12-12 Mellanox Technologies Ltd. Round-robin arbiter with low jitter
US7151744B2 (en) 2001-09-21 2006-12-19 Slt Logic Llc Multi-service queuing method and apparatus that provides exhaustive arbitration, load balancing, and support for rapid port failover
US7216225B2 (en) 2000-05-24 2007-05-08 Voltaire Ltd. Filtered application-to-application communication
US7240350B1 (en) 2002-01-07 2007-07-03 Slt Logic, Llc System and method for providing communications to processes
US7245627B2 (en) 2002-04-23 2007-07-17 Mellanox Technologies Ltd. Sharing a network interface card among multiple hosts
US7254237B1 (en) 2001-01-12 2007-08-07 Slt Logic, Llc System and method for establishing a secure connection
US20070188351A1 (en) 2005-12-16 2007-08-16 Andrew Brown Hardware enablement using an interface
US20070220183A1 (en) 2002-07-23 2007-09-20 Mellanox Technologies Ltd. Receive Queue Descriptor Pool
US7285996B2 (en) 2005-09-30 2007-10-23 Slt Logic, Llc Delay-locked loop
US7316017B1 (en) 2003-01-06 2008-01-01 Slt Logic, Llc System and method for allocatiing communications to processors and rescheduling processes in a multiprocessor system
US20080024586A1 (en) 2006-07-31 2008-01-31 Rodolfo Jodra Barron System and method for calibrating a beam array of a printer
US7346702B2 (en) 2000-08-24 2008-03-18 Voltaire Ltd. System and method for highly scalable high-speed content-based filtering and load balancing in interconnected fabrics
US20080109526A1 (en) 2006-11-06 2008-05-08 Viswanath Subramanian Rdma data to responder node coherency domain
US20080115217A1 (en) 2006-10-31 2008-05-15 Hewlett-Packard Development Company, L.P. Method and apparatus for protection of a computer system from malicious code attacks
US20080115216A1 (en) 2006-10-31 2008-05-15 Hewlett-Packard Development Company, L.P. Method and apparatus for removing homogeneity from execution environment of computing system
US20080126509A1 (en) 2006-11-06 2008-05-29 Viswanath Subramanian Rdma qp simplex switchless connection
US7386619B1 (en) 2003-01-06 2008-06-10 Slt Logic, Llc System and method for allocating communications to processors in a multiprocessor system
US20080135774A1 (en) 2006-12-08 2008-06-12 Asml Netherlands B.V. Scatterometer, a lithographic apparatus and a focus analysis method
US20080148400A1 (en) 2006-10-31 2008-06-19 Hewlett-Packard Development Company, L.P. Method and apparatus for enforcement of software licence protection
US20080147828A1 (en) 2006-12-18 2008-06-19 Enstone Mark R Distributed Configuration Of Network Interface Cards
US20080177890A1 (en) 1999-05-24 2008-07-24 Krause Michael R Reliable datagram via independent source destination resources
US20080244060A1 (en) 2007-03-30 2008-10-02 Cripe Daniel N Electronic device profile migration
WO2008127672A2 (en) 2007-04-11 2008-10-23 Slt Logic Llc Modular blade for providing scalable mechanical, electrical and environmental functionality in the enterprise using advanced tca boards
US20080304519A1 (en) 2007-06-06 2008-12-11 Hewlett-Packard Development Company, L.P. Method for ethernet power savings on link aggregated groups
US7518164B2 (en) 2006-03-29 2009-04-14 Mellanox Technologies Ltd. Current-triggered low turn-on voltage SCR
US20090165003A1 (en) 2007-12-21 2009-06-25 Van Jacobson System and method for allocating communications to processors and rescheduling processes in a multiprocessor system
US7573967B2 (en) 2005-07-01 2009-08-11 Slt Logic Llc Input threshold adjustment in a synchronous data sampling circuit
US20090201926A1 (en) 2006-08-30 2009-08-13 Mellanox Technologies Ltd Fibre channel processing by a host channel adapter
US7580495B2 (en) 2005-06-30 2009-08-25 Slt Logic Llc Mixer-based phase control
US20090213856A1 (en) 2001-05-04 2009-08-27 Slt Logic Llc System and Method for Providing Transformation of Multi-Protocol Packets in a Data Stream
US20090268612A1 (en) 2008-04-29 2009-10-29 Google Inc. Method and apparatus for a network queuing engine and congestion management gateway
WO2009134219A1 (en) 2008-04-28 2009-11-05 Hewlett-Packard Development Company, L.P. Adjustable server-transmission rates over fixed-speed backplane connections within a multi-server enclosure
WO2009136933A1 (en) 2008-05-08 2009-11-12 Hewlett-Packard Development Company, L.P. A method for interfacing a fibre channel network with an ethernet based network
US7631106B2 (en) 2002-08-15 2009-12-08 Mellanox Technologies Ltd. Prefetching of receive queue descriptors
US20090302923A1 (en) 2006-05-08 2009-12-10 Mellanox Technologies Ltd. Terminated input buffer with offset cancellation circuit
US7653754B2 (en) 2004-01-05 2010-01-26 Mellanox Technologies Ltd. Method, system and protocol that enable unrestricted user-level access to a network interface adapter
WO2010020907A2 (en) 2008-08-21 2010-02-25 Voltaire Ltd. Device, system, and method of distributing messages
US7688853B2 (en) 2001-05-04 2010-03-30 Slt Logic, Llc System and method for hierarchical policing of flows and subflows of a data stream
US20100088437A1 (en) 2008-10-06 2010-04-08 Mellanox Technologies Ltd Infiniband adaptive congestion control adaptive marking rate
US20100138840A1 (en) 2005-08-23 2010-06-03 Mellanox Technologies Ltd. System and method for accelerating input/output access operation on a virtual machine
US20100169880A1 (en) 2008-12-25 2010-07-01 Voltaire Ltd. Virtual input-output connections for machine virtualization
US20100188140A1 (en) 2006-04-07 2010-07-29 Mellanox Technologies Ltd. Accurate Global Reference Voltage Distribution System With Local Reference Voltages Referred To Local Ground And Locally Supplied Voltage
US20100189206A1 (en) 2009-01-28 2010-07-29 Mellanox Technologies Ltd Precise Clock Synchronization
WO2010087826A1 (en) 2009-01-29 2010-08-05 Hewlett-Packard Development Company, L.P. Connecting ports of one or more electronic devices to different subsets of networks based on different operating modes
US7801027B2 (en) 2006-08-30 2010-09-21 Mellanox Technologies Ltd. Auto-negotiation by nodes on an infiniband fabric
US7802071B2 (en) 2007-07-16 2010-09-21 Voltaire Ltd. Device, system, and method of publishing information to multiple subscribers
US7813460B2 (en) 2005-09-30 2010-10-12 Slt Logic, Llc High-speed data sampler with input threshold adjustment
US20100265849A1 (en) 2009-04-21 2010-10-21 Voltaire Ltd Spanning tree root selection in a hierarchical network
US20100274876A1 (en) 2009-04-28 2010-10-28 Mellanox Technologies Ltd Network interface device with memory management capabilities
US7827442B2 (en) 2006-01-23 2010-11-02 Slt Logic Llc Shelf management controller with hardware/software implemented dual redundant configuration
US7835375B2 (en) 2001-05-04 2010-11-16 Slt Logic, Llc Method and apparatus for providing multi-protocol, multi-stage, real-time frame classification
US7848322B2 (en) 2007-03-26 2010-12-07 Voltaire Ltd. Device, system and method of UDP communication
US7864787B2 (en) 2007-03-26 2011-01-04 Voltaire Ltd. Device, system and method of multicast communication
US20110004457A1 (en) 2006-08-07 2011-01-06 Voltaire Ltd. Service-oriented infrastructure management
US20110010557A1 (en) 2009-07-07 2011-01-13 Mellanox Technologies Ltd Control message signature for device control
US20110029847A1 (en) 2009-07-30 2011-02-03 Mellanox Technologies Ltd Processing of data integrity field
US20110058571A1 (en) 2009-09-09 2011-03-10 Mellanox Technologies Ltd. Data switch with shared port buffers
US7921178B2 (en) 2008-12-04 2011-04-05 Voltaire Ltd. Device, system, and method of accessing storage
US20110083064A1 (en) 2009-10-04 2011-04-07 Mellanox Technologies Ltd Processing of block and transaction signatures
WO2011043769A1 (en) 2009-10-07 2011-04-14 Hewlett-Packard Development Company, L.P. Notification protocol based endpoint caching of host memory
US7929539B2 (en) 2001-04-11 2011-04-19 Mellanox Technologies Ltd. Multiple queue pair access with a single doorbell
US7930437B2 (en) 2001-04-11 2011-04-19 Mellanox Technologies Ltd. Network adapter with shared database for message context information
US20110096668A1 (en) 2009-10-26 2011-04-28 Mellanox Technologies Ltd. High-performance adaptive routing
US7934959B2 (en) 2009-10-06 2011-05-03 Mellanox Technologies Ltd. Adapter for pluggable module
WO2011053305A1 (en) 2009-10-30 2011-05-05 Hewlett-Packard Development Co Thermal bus bar for a blade enclosure
WO2011053330A1 (en) 2009-11-02 2011-05-05 Hewlett-Packard Development Company, L.P. Multiprocessing computing with distributed embedded switching
US20110113083A1 (en) 2009-11-11 2011-05-12 Voltaire Ltd Topology-Aware Fabric-Based Offloading of Collective Functions
US20110119673A1 (en) 2009-11-15 2011-05-19 Mellanox Technologies Ltd. Cross-channel network operation offloading for collective operations
US20110116512A1 (en) 2009-11-19 2011-05-19 Mellanox Technologies Ltd Dynamically-Connected Transport Service
US7978606B2 (en) 2001-05-04 2011-07-12 Slt Logic, Llc System and method for policing multiple data flows and multi-protocol data flows
US20110173352A1 (en) 2010-01-13 2011-07-14 Mellanox Technologies Ltd Power Reduction on Idle Communication Lanes

Family Cites Families (71)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US685672A (en) * 1899-11-16 1901-10-29 Henry M Williams Rail-joint.
JPH07110749A (en) 1993-04-16 1995-04-25 Hewlett Packard Co <Hp> Dynamic construction method of dialog box
JP3473975B2 (en) * 1993-09-08 2003-12-08 株式会社日立製作所 Network system and communication method in network
US20030033588A1 (en) 2001-01-29 2003-02-13 John Alexander System, method and article of manufacture for using a library map to create and maintain IP cores effectively
US20030086300A1 (en) 2001-04-06 2003-05-08 Gareth Noyes FPGA coprocessing system
US7480796B2 (en) 2001-06-07 2009-01-20 Kwan Sofware Engineering, Inc. System and method for authenticating data using incompatible digest functions
US8006280B1 (en) 2001-12-12 2011-08-23 Hildebrand Hal S Security system for generating keys from access rules in a decentralized manner and methods therefor
US7301906B2 (en) 2001-12-17 2007-11-27 Lsi Corporation Methods and structures for improved buffer management and dynamic adaption of flow control status in high-speed communication networks
US7441262B2 (en) 2002-07-11 2008-10-21 Seaway Networks Inc. Integrated VPN/firewall system
JP4619788B2 (en) 2002-10-11 2011-01-26 パナソニック株式会社 Method for protecting identification information in WLAN interconnection
EP2511787B1 (en) 2003-05-23 2017-09-20 IP Reservoir, LLC Data decompression and search using FPGA devices
US8417852B2 (en) 2003-06-05 2013-04-09 Nvidia Corporation Uploading TCP frame data to user buffers and buffers in system memory
US7685254B2 (en) 2003-06-10 2010-03-23 Pandya Ashish A Runtime adaptable search processor
US20050114700A1 (en) 2003-08-13 2005-05-26 Sensory Networks, Inc. Integrated circuit apparatus and method for high throughput signature based network applications
US7685436B2 (en) 2003-10-02 2010-03-23 Itt Manufacturing Enterprises, Inc. System and method for a secure I/O interface
WO2005086681A2 (en) 2004-03-04 2005-09-22 Quova, Inc. Geo-location and geo-compliance utilizing a client agent
EP1730917A1 (en) 2004-03-30 2006-12-13 Telecom Italia S.p.A. Method and system for network intrusion detection, related network and computer program product
US7308438B2 (en) 2004-04-23 2007-12-11 International Business Machines Corporation Adaptive management method with authorization control
US7505463B2 (en) 2004-06-15 2009-03-17 Sun Microsystems, Inc. Rule set conflict resolution
US20060026681A1 (en) 2004-07-29 2006-02-02 Zakas Phillip H System and method of characterizing and managing electronic traffic
WO2006049072A1 (en) 2004-11-04 2006-05-11 Nec Corporation Firewall examination system and firewall information extraction system
US7873959B2 (en) 2005-02-01 2011-01-18 Microsoft Corporation Publishing the status of and updating firmware components
US7634584B2 (en) 2005-04-27 2009-12-15 Solarflare Communications, Inc. Packet validation in virtual network interface architecture
US7702629B2 (en) 2005-12-02 2010-04-20 Exegy Incorporated Method and device for high performance regular expression pattern matching
KR100738537B1 (en) 2005-12-27 2007-07-11 삼성전자주식회사 method and apparatus for network intrusion detection
US7954114B2 (en) 2006-01-26 2011-05-31 Exegy Incorporated Firmware socket module for FPGA-based pipeline processing
US7970899B2 (en) 2006-03-03 2011-06-28 Barracuda Networks Inc Integrated data flow packet admission and traffic management apparatus
US8379841B2 (en) 2006-03-23 2013-02-19 Exegy Incorporated Method and system for high throughput blockwise independent encryption/decryption
US7636703B2 (en) 2006-05-02 2009-12-22 Exegy Incorporated Method and apparatus for approximate pattern matching
US7840482B2 (en) 2006-06-19 2010-11-23 Exegy Incorporated Method and system for high speed options pricing
US7921046B2 (en) 2006-06-19 2011-04-05 Exegy Incorporated High speed processing of financial information using FPGA devices
US20080021716A1 (en) 2006-07-19 2008-01-24 Novell, Inc. Administrator-defined mandatory compliance expression
WO2008014328A2 (en) 2006-07-25 2008-01-31 Pivx Solutions, Inc. Systems and methods for digitally-signed updates
US7519602B2 (en) 2006-10-31 2009-04-14 Sap Ag Systems and methods for information exchange using object warehousing
US7660793B2 (en) 2006-11-13 2010-02-09 Exegy Incorporated Method and system for high performance integration, processing and searching of structured and unstructured data using coprocessors
US7996348B2 (en) 2006-12-08 2011-08-09 Pandya Ashish A 100GBPS security and search architecture using programmable intelligent search memory (PRISM) that comprises one or more bit interval counters
US7890692B2 (en) 2007-08-17 2011-02-15 Pandya Ashish A FSA context switch architecture for programmable intelligent search memory
US8326816B2 (en) 2010-05-06 2012-12-04 Oracle International Corporation Fine grain synchronization for database replay
US8423789B1 (en) 2007-05-22 2013-04-16 Marvell International Ltd. Key generation techniques
US7852756B2 (en) 2007-06-13 2010-12-14 02Micro International Limited Network interface system with filtering function
WO2009029842A1 (en) 2007-08-31 2009-03-05 Exegy Incorporated Method and apparatus for hardware-accelerated encryption/decryption
US8171483B2 (en) 2007-10-20 2012-05-01 Citrix Systems, Inc. Method and system for communicating between isolation environments
US10229453B2 (en) 2008-01-11 2019-03-12 Ip Reservoir, Llc Method and system for low latency basket calculation
EP2256661A4 (en) 2008-03-25 2012-08-15 Panasonic Corp Electronic terminal, control method, computer program, and integrated circuit
US8374986B2 (en) 2008-05-15 2013-02-12 Exegy Incorporated Method and system for accelerated stream processing
US8627328B2 (en) 2008-11-14 2014-01-07 Oracle International Corporation Operation control for deploying and managing software service in a virtual environment
CA2744746C (en) 2008-12-15 2019-12-24 Exegy Incorporated Method and apparatus for high-speed processing of financial market depth data
US8706836B2 (en) 2008-12-15 2014-04-22 Shara Susznnah Vincent Live streaming media and data communication hub
US8812424B2 (en) 2009-07-21 2014-08-19 International Business Machines Corporation Semantic web technologies in system automation
US9210140B2 (en) 2009-08-19 2015-12-08 Solarflare Communications, Inc. Remote functionality selection
US8291258B2 (en) 2010-01-08 2012-10-16 Juniper Networks, Inc. High availability for network security devices
EP3663999A1 (en) 2010-02-05 2020-06-10 Medversant Technologies, LLC System and method for peer referencing in an online computer system
US8473316B1 (en) 2010-06-04 2013-06-25 Amazon Technologies, Inc. System and method for order processing state management
DE102010025638B4 (en) 2010-06-30 2012-09-06 Siemens Aktiengesellschaft Method for processing data in a packet-switched communication network
US20120016833A1 (en) 2010-07-14 2012-01-19 Sap Ag Systems and methods for dynamic process model reconfiguration based on process execution context
US10193873B2 (en) 2010-09-30 2019-01-29 Comcast Cable Communications, Llc Key derivation for secure communications
US8601265B2 (en) 2010-11-22 2013-12-03 Netapp, Inc. Method and system for improving storage security in a cloud computing environment
US10037568B2 (en) 2010-12-09 2018-07-31 Ip Reservoir, Llc Method and apparatus for managing orders in financial markets
KR101173401B1 (en) 2011-01-24 2012-08-10 엘지이노텍 주식회사 Solar cell and manufacturing method of the same
US9401807B2 (en) 2011-02-03 2016-07-26 Hewlett Packard Enterprise Development Lp Processing non-editable fields in web pages
JP5731223B2 (en) 2011-02-14 2015-06-10 インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Machines Corporation Abnormality detection device, monitoring control system, abnormality detection method, program, and recording medium
JP5691723B2 (en) 2011-03-25 2015-04-01 富士通株式会社 Monitoring method, information processing apparatus, and monitoring program
US9158561B2 (en) 2011-08-18 2015-10-13 Vmware, Inc. Systems and methods for modifying an operating system for a virtual machine
US20130054857A1 (en) 2011-08-25 2013-02-28 Neil R.T Horman Reducing latency at a network interface card
US8789135B1 (en) 2012-06-15 2014-07-22 Google Inc. Scalable stateful firewall design in openflow based networks
US9104461B2 (en) 2012-06-15 2015-08-11 Oracle International Corporation Hypervisor-based management and migration of services executing within virtual environments based on service dependencies and hardware requirements
US9167050B2 (en) 2012-08-16 2015-10-20 Futurewei Technologies, Inc. Control pool based enterprise policy enabler for controlled cloud access
US8855126B2 (en) 2012-09-06 2014-10-07 Unisys Corporation Fast data packet transfer operations
US9571507B2 (en) 2012-10-21 2017-02-14 Mcafee, Inc. Providing a virtual security appliance architecture to a virtual cloud infrastructure
US10742604B2 (en) 2013-04-08 2020-08-11 Xilinx, Inc. Locked down network interface
CN106464674B (en) 2014-05-12 2021-11-16 谷歌有限责任公司 Managing NIC encryption streams for migrating customers or tasks

Patent Citations (172)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4805134A (en) 1986-01-09 1989-02-14 International Business Machines Corporation Electronic system for accessing graphical and textual information
US5740467A (en) 1992-01-09 1998-04-14 Digital Equipment Corporation Apparatus and method for controlling interrupts to a host during data transfer between the host and an adapter
US5677910A (en) 1992-08-07 1997-10-14 Plaintree Systems Inc. High performance two-port transport LAN bridge
US5325532A (en) 1992-09-25 1994-06-28 Compaq Computer Corporation Automatic development of operating system boot image
US5272599A (en) 1993-03-19 1993-12-21 Compaq Computer Corporation Microprocessor heat dissipation apparatus for a printed circuit board
US5790804A (en) 1994-04-12 1998-08-04 Mitsubishi Electric Information Technology Center America, Inc. Computer network interface and network protocol with direct deposit messaging
US5636371A (en) 1995-06-07 1997-06-03 Bull Hn Information Systems Inc. Virtual network mechanism to access well known port application programs running on a single host system
US6098112A (en) 1995-10-19 2000-08-01 Hewlett-Packard Company Streams function registering
US6332195B1 (en) 1996-02-09 2001-12-18 Secure Computing Corporation Secure server utilizing separate protocol stacks
US6070219A (en) * 1996-10-09 2000-05-30 Intel Corporation Hierarchical interrupt structure for event notification on multi-virtual circuit network interface controller
US5946189A (en) 1997-04-23 1999-08-31 Compaq Computer Corporation Pivotable support and heat sink apparatus removably connectable without tools to a computer processor
US6122670A (en) 1997-10-30 2000-09-19 Tsi Telsys, Inc. Apparatus and method for constructing data for transmission within a reliable communication protocol by performing portions of the protocol suite concurrently
US6934758B2 (en) 1997-12-11 2005-08-23 Sun Microsystems, Inc. Stack-based access control using code and executor identifiers
US6044415A (en) 1998-02-27 2000-03-28 Intel Corporation System for transferring I/O data between an I/O device and an application program's memory in accordance with a request directly over a virtual connection
US6160554A (en) 1998-03-19 2000-12-12 Hewlett Packard Company Computer file content preview window
US20030214677A1 (en) 1998-03-26 2003-11-20 Ranjit Bhaskar Tone dependent variable halftoning wih adjustable algorithm selection
US6865672B1 (en) 1998-05-18 2005-03-08 Spearhead Technologies, Ltd. System and method for securing a computer communication network
US6530007B2 (en) 1998-07-13 2003-03-04 Compaq Information Technologies Group, L.P. Method and apparatus for supporting heterogeneous memory in computer systems
US6768996B1 (en) 1998-10-08 2004-07-27 Hewlett-Packard Development Company, L.P. System and method for retrieving an abstracted portion of a file without regard to the operating system of the current host computer
US20030191786A1 (en) 1999-02-01 2003-10-09 Matson Mark D. Computer method and apparatus for division and square root operations using signed digit
US7509355B2 (en) 1999-04-08 2009-03-24 Hewlett-Packard Development Company, L.P. Method for transferring and indexing data from old media to new media
US20020156784A1 (en) 1999-04-08 2002-10-24 Hanes David H. Method for transferring and indexing data from old media to new media
US6502203B2 (en) 1999-04-16 2002-12-31 Compaq Information Technologies Group, L.P. Method and apparatus for cluster system operation
US6304945B1 (en) 1999-05-13 2001-10-16 Compaq Computer Corporation Method and apparatus for maintaining cache coherency in a computer system having multiple processor buses
US20080177890A1 (en) 1999-05-24 2008-07-24 Krause Michael R Reliable datagram via independent source destination resources
US7904576B2 (en) 1999-05-24 2011-03-08 Hewlett-Packard Development Company, L.P. Reliable datagram via independent source destination resources
US6978331B1 (en) 1999-09-08 2005-12-20 Mellanox Technologies Ltd. Synchronization of interrupts with data packets
WO2001048972A1 (en) 1999-12-28 2001-07-05 Mellanox Technologies Ltd. Adaptive sampling
US6496935B1 (en) 2000-03-02 2002-12-17 Check Point Software Technologies Ltd System, device and method for rapid packet filtering and processing
US6799220B1 (en) 2000-04-13 2004-09-28 Intel Corporation Tunneling management messages over a channel architecture network
US6868450B1 (en) 2000-05-17 2005-03-15 Hewlett-Packard Development Company, L.P. System and method for a process attribute based computer network filter
US7103626B1 (en) 2000-05-24 2006-09-05 Hewlett-Packard Development, L.P. Partitioning in distributed computer system
US7216225B2 (en) 2000-05-24 2007-05-08 Voltaire Ltd. Filtered application-to-application communication
US6735642B2 (en) 2000-06-02 2004-05-11 Mellanox Technologies Ltd. DMA doorbell
US20040071250A1 (en) 2000-06-20 2004-04-15 Bunton William P. High-speed interconnection adapter having automated lane de-skew
US7346702B2 (en) 2000-08-24 2008-03-18 Voltaire Ltd. System and method for highly scalable high-speed content-based filtering and load balancing in interconnected fabrics
US20040190538A1 (en) 2000-09-08 2004-09-30 Hewlett-Packard Development Company, L.P. Speculative transmit for system area network latency reduction
US20020059052A1 (en) 2000-09-12 2002-05-16 Mellanox Technologies Ltd. Co-simulation of network components
US6349035B1 (en) 2000-09-29 2002-02-19 Compaq Information Technologies Group, L.P. Method and apparatus for tooless mating of liquid cooled cold plate with tapered interposer heat sink
US6718392B1 (en) 2000-10-24 2004-04-06 Hewlett-Packard Development Company, L.P. Queue pair partitioning in distributed computer system
WO2002035838A1 (en) 2000-10-28 2002-05-02 Voltaire Limited Modular set top box assembly
US6728743B2 (en) 2001-01-04 2004-04-27 Mellanox Technologies Ltd. Modulo remainder generator
US7254237B1 (en) 2001-01-12 2007-08-07 Slt Logic, Llc System and method for establishing a secure connection
US6950961B2 (en) 2001-02-13 2005-09-27 Hewlett-Packard Development Company, L.P. Highly available, monotonic increasing sequence number generation
US20020112139A1 (en) 2001-02-13 2002-08-15 Krause Michael R. Highly available, monotonic increasing sequence number generation
US20020129293A1 (en) 2001-03-07 2002-09-12 Hutton John F. Scan based multiple ring oscillator structure for on-chip speed measurement
US20020140985A1 (en) 2001-04-02 2002-10-03 Hudson Kevin R. Color calibration for clustered printing
US7929539B2 (en) 2001-04-11 2011-04-19 Mellanox Technologies Ltd. Multiple queue pair access with a single doorbell
US7930437B2 (en) 2001-04-11 2011-04-19 Mellanox Technologies Ltd. Network adapter with shared database for message context information
US7835375B2 (en) 2001-05-04 2010-11-16 Slt Logic, Llc Method and apparatus for providing multi-protocol, multi-stage, real-time frame classification
US7978606B2 (en) 2001-05-04 2011-07-12 Slt Logic, Llc System and method for policing multiple data flows and multi-protocol data flows
US7688853B2 (en) 2001-05-04 2010-03-30 Slt Logic, Llc System and method for hierarchical policing of flows and subflows of a data stream
US20090213856A1 (en) 2001-05-04 2009-08-27 Slt Logic Llc System and Method for Providing Transformation of Multi-Protocol Packets in a Data Stream
US6438130B1 (en) 2001-06-28 2002-08-20 Mellanox Technologies Ltd. Forwarding database cache
US20030007165A1 (en) 2001-07-05 2003-01-09 Hudson Kevin R. Direction-dependent color conversion in bidirectional printing
US7136397B2 (en) 2001-08-20 2006-11-14 Slt Logic Llc Network architecture and system for delivering bi-directional xDSL based services
US7151744B2 (en) 2001-09-21 2006-12-19 Slt Logic Llc Multi-service queuing method and apparatus that provides exhaustive arbitration, load balancing, and support for rapid port failover
US7099275B2 (en) 2001-09-21 2006-08-29 Slt Logic Llc Programmable multi-service queue scheduler
US20030058459A1 (en) 2001-09-26 2003-03-27 Yifeng Wu Generalized color calibration architecture and method
US6904534B2 (en) 2001-09-29 2005-06-07 Hewlett-Packard Development Company, L.P. Progressive CPU sleep state duty cycle to limit peak power of multiple computers on shared power distribution unit
US20030063299A1 (en) 2001-10-02 2003-04-03 Cowan Philip B. Color calibration method and apparatus
US20030065856A1 (en) 2001-10-03 2003-04-03 Mellanox Technologies Ltd. Network adapter with multiple event queues
US20030081060A1 (en) 2001-10-29 2003-05-01 Huanzhao Zeng Method and system for management of color through conversion between color spaces
US7240350B1 (en) 2002-01-07 2007-07-03 Slt Logic, Llc System and method for providing communications to processes
US20030172330A1 (en) 2002-03-11 2003-09-11 Barron Dwight L. Data redundancy in a hot pluggable, large symmetric multi-processor system
US7093158B2 (en) 2002-03-11 2006-08-15 Hewlett-Packard Development Company, L.P. Data redundancy in a hot pluggable, large symmetric multi-processor system
US7245627B2 (en) 2002-04-23 2007-07-17 Mellanox Technologies Ltd. Sharing a network interface card among multiple hosts
US20030202043A1 (en) 2002-04-24 2003-10-30 Huanzhao Zeng Determination of control points for construction of first color space-to-second color space look-up table
US6667918B2 (en) 2002-05-01 2003-12-23 Mellanox Technologies Ltd. Self-repair of embedded memory arrays
US7149227B2 (en) 2002-05-31 2006-12-12 Mellanox Technologies Ltd. Round-robin arbiter with low jitter
US7111303B2 (en) 2002-07-16 2006-09-19 International Business Machines Corporation Virtual machine operating system LAN
US6931530B2 (en) 2002-07-22 2005-08-16 Vormetric, Inc. Secure network file access controller implementing access control and auditing
US20070220183A1 (en) 2002-07-23 2007-09-20 Mellanox Technologies Ltd. Receive Queue Descriptor Pool
US7143412B2 (en) 2002-07-25 2006-11-28 Hewlett-Packard Development Company, L.P. Method and apparatus for optimizing performance in a multi-processing system
US7631106B2 (en) 2002-08-15 2009-12-08 Mellanox Technologies Ltd. Prefetching of receive queue descriptors
US20080301406A1 (en) 2003-01-06 2008-12-04 Van Jacobson System and method for allocating communications to processors in a multiprocessor system
US7386619B1 (en) 2003-01-06 2008-06-10 Slt Logic, Llc System and method for allocating communications to processors in a multiprocessor system
US7316017B1 (en) 2003-01-06 2008-01-01 Slt Logic, Llc System and method for allocatiing communications to processors and rescheduling processes in a multiprocessor system
US20040141642A1 (en) 2003-01-22 2004-07-22 Huanzhao Zeng Color space conversion using interpolation
US7502826B2 (en) 2003-03-27 2009-03-10 Hewlett-Packard Development Company, L.P. Atomic operations
US20040190533A1 (en) 2003-03-27 2004-09-30 Prashant Modi Method and apparatus for performing connection management with multiple stacks
US7103744B2 (en) 2003-03-27 2006-09-05 Hewlett-Packard Development Company, L.P. Binding a memory window to a queue pair
US20040190557A1 (en) 2003-03-27 2004-09-30 Barron Dwight L. Signaling packet
US20040193734A1 (en) 2003-03-27 2004-09-30 Barron Dwight L. Atomic operations
US20040193825A1 (en) 2003-03-27 2004-09-30 Garcia David J. Binding a memory window to a queue pair
US7554993B2 (en) 2003-03-27 2009-06-30 Hewlett-Packard Development Company, L.P. Method and apparatus for performing connection management with multiple stacks
US20040210754A1 (en) 2003-04-16 2004-10-21 Barron Dwight L. Shared security transform device, system and methods
US20040252685A1 (en) 2003-06-13 2004-12-16 Mellanox Technologies Ltd. Channel adapter with integrated switch
US20050008223A1 (en) 2003-07-09 2005-01-13 Huanzhao Zeng Representing extended color gamut information
US20050018221A1 (en) 2003-07-21 2005-01-27 Huanzhao Zeng Conforming output intensities of pens
US20050039172A1 (en) 2003-08-12 2005-02-17 Jeffrey Rees Synthesizing application response measurement (ARM) instrumentation
US20050039187A1 (en) 2003-08-12 2005-02-17 Avakian Arra E. Instrumenting java code by modifying bytecodes
US20050039171A1 (en) 2003-08-12 2005-02-17 Avakian Arra E. Using interceptors and out-of-band data to monitor the performance of Java 2 enterprise edition (J2EE) applications
US20050038941A1 (en) 2003-08-14 2005-02-17 Mallikarjun Chadalapaka Method and apparatus for accessing a memory
US7757232B2 (en) 2003-08-14 2010-07-13 Hewlett-Packard Development Company, L.P. Method and apparatus for implementing work request lists
US20050038918A1 (en) 2003-08-14 2005-02-17 Hilland Jeffrey R. Method and apparatus for implementing work request lists
US7617376B2 (en) 2003-08-14 2009-11-10 Hewlett-Packard Development Company, L.P. Method and apparatus for accessing a memory
US20050066333A1 (en) 2003-09-18 2005-03-24 Krause Michael R. Method and apparatus for providing notification
US7404190B2 (en) 2003-09-18 2008-07-22 Hewlett-Packard Development Company, L.P. Method and apparatus for providing notification via multiple completion queue handlers
US7653754B2 (en) 2004-01-05 2010-01-26 Mellanox Technologies Ltd. Method, system and protocol that enable unrestricted user-level access to a network interface adapter
US20050172181A1 (en) 2004-01-16 2005-08-04 Mellanox Technologies Ltd. System and method for production testing of high speed communications receivers
US20050219314A1 (en) 2004-03-30 2005-10-06 David Donovan Formation of images
US20050219278A1 (en) 2004-03-30 2005-10-06 Hudson Kevin R Image formation with a flexible number of passes
US20050231751A1 (en) 2004-04-15 2005-10-20 Yifeng Wu Image processing system and method
US7650386B2 (en) 2004-07-29 2010-01-19 Hewlett-Packard Development Company, L.P. Communication among partitioned devices
US20060026443A1 (en) 2004-07-29 2006-02-02 Mcmahan Larry N Communication among partitioned devices
US20060045098A1 (en) 2004-08-31 2006-03-02 Krause Michael R System for port mapping in a network
US20060126619A1 (en) 2004-12-14 2006-06-15 Teisberg Robert R Aggregation over multiple processing nodes of network resources each providing offloaded connections between applications over a network
US7403535B2 (en) 2004-12-14 2008-07-22 Hewlett-Packard Development Company, L.P. Aggregation of network resources providing offloaded connections between applications over a network
US7551614B2 (en) 2004-12-14 2009-06-23 Hewlett-Packard Development Company, L.P. Aggregation over multiple processing nodes of network resources each providing offloaded connections between applications over a network
US20060165074A1 (en) 2004-12-14 2006-07-27 Prashant Modi Aggregation of network resources providing offloaded connections between applications over a network
US20060185011A1 (en) 2005-02-14 2006-08-17 International Business Machines Corporation Packet filtering in a NIC to control antidote loading
US20060193318A1 (en) 2005-02-28 2006-08-31 Sriram Narasimhan Method and apparatus for processing inbound and outbound quanta of data
US20060228637A1 (en) 2005-04-11 2006-10-12 Jackson Bruce J Color filter
US20060248191A1 (en) 2005-04-27 2006-11-02 Hudson Charles L Aggregation of hybrid network resources operable to support both offloaded and non-offloaded connections
US7580415B2 (en) 2005-04-27 2009-08-25 Hewlett-Packard Development Company, L.P. Aggregation of hybrid network resources operable to support both offloaded and non-offloaded connections
US7580495B2 (en) 2005-06-30 2009-08-25 Slt Logic Llc Mixer-based phase control
US7573967B2 (en) 2005-07-01 2009-08-11 Slt Logic Llc Input threshold adjustment in a synchronous data sampling circuit
US20100138840A1 (en) 2005-08-23 2010-06-03 Mellanox Technologies Ltd. System and method for accelerating input/output access operation on a virtual machine
US7813460B2 (en) 2005-09-30 2010-10-12 Slt Logic, Llc High-speed data sampler with input threshold adjustment
US7285996B2 (en) 2005-09-30 2007-10-23 Slt Logic, Llc Delay-locked loop
US20070188351A1 (en) 2005-12-16 2007-08-16 Andrew Brown Hardware enablement using an interface
US7827442B2 (en) 2006-01-23 2010-11-02 Slt Logic Llc Shelf management controller with hardware/software implemented dual redundant configuration
US7518164B2 (en) 2006-03-29 2009-04-14 Mellanox Technologies Ltd. Current-triggered low turn-on voltage SCR
US20100188140A1 (en) 2006-04-07 2010-07-29 Mellanox Technologies Ltd. Accurate Global Reference Voltage Distribution System With Local Reference Voltages Referred To Local Ground And Locally Supplied Voltage
US20090302923A1 (en) 2006-05-08 2009-12-10 Mellanox Technologies Ltd. Terminated input buffer with offset cancellation circuit
US20080024586A1 (en) 2006-07-31 2008-01-31 Rodolfo Jodra Barron System and method for calibrating a beam array of a printer
US20110004457A1 (en) 2006-08-07 2011-01-06 Voltaire Ltd. Service-oriented infrastructure management
US20090201926A1 (en) 2006-08-30 2009-08-13 Mellanox Technologies Ltd Fibre channel processing by a host channel adapter
US7801027B2 (en) 2006-08-30 2010-09-21 Mellanox Technologies Ltd. Auto-negotiation by nodes on an infiniband fabric
US20080115217A1 (en) 2006-10-31 2008-05-15 Hewlett-Packard Development Company, L.P. Method and apparatus for protection of a computer system from malicious code attacks
US20080115216A1 (en) 2006-10-31 2008-05-15 Hewlett-Packard Development Company, L.P. Method and apparatus for removing homogeneity from execution environment of computing system
US20080148400A1 (en) 2006-10-31 2008-06-19 Hewlett-Packard Development Company, L.P. Method and apparatus for enforcement of software licence protection
US20080109526A1 (en) 2006-11-06 2008-05-08 Viswanath Subramanian Rdma data to responder node coherency domain
US20080126509A1 (en) 2006-11-06 2008-05-29 Viswanath Subramanian Rdma qp simplex switchless connection
US20080135774A1 (en) 2006-12-08 2008-06-12 Asml Netherlands B.V. Scatterometer, a lithographic apparatus and a focus analysis method
US20080147828A1 (en) 2006-12-18 2008-06-19 Enstone Mark R Distributed Configuration Of Network Interface Cards
US7864787B2 (en) 2007-03-26 2011-01-04 Voltaire Ltd. Device, system and method of multicast communication
US7848322B2 (en) 2007-03-26 2010-12-07 Voltaire Ltd. Device, system and method of UDP communication
US7856488B2 (en) 2007-03-30 2010-12-21 Hewlett-Packard Development Company, L.P. Electronic device profile migration
US20080244060A1 (en) 2007-03-30 2008-10-02 Cripe Daniel N Electronic device profile migration
WO2008127672A2 (en) 2007-04-11 2008-10-23 Slt Logic Llc Modular blade for providing scalable mechanical, electrical and environmental functionality in the enterprise using advanced tca boards
US20080304519A1 (en) 2007-06-06 2008-12-11 Hewlett-Packard Development Company, L.P. Method for ethernet power savings on link aggregated groups
US7802071B2 (en) 2007-07-16 2010-09-21 Voltaire Ltd. Device, system, and method of publishing information to multiple subscribers
US20090165003A1 (en) 2007-12-21 2009-06-25 Van Jacobson System and method for allocating communications to processors and rescheduling processes in a multiprocessor system
US20110029669A1 (en) 2008-04-28 2011-02-03 Mike Chuang Adjustable Server-Transmission Rates Over Fixed-Speed Backplane Connections Within A Multi-Server Enclosure
WO2009134219A1 (en) 2008-04-28 2009-11-05 Hewlett-Packard Development Company, L.P. Adjustable server-transmission rates over fixed-speed backplane connections within a multi-server enclosure
US20090268612A1 (en) 2008-04-29 2009-10-29 Google Inc. Method and apparatus for a network queuing engine and congestion management gateway
US20110044344A1 (en) 2008-05-08 2011-02-24 Hudson Charles L A method for interfacing a fibre channel network with an ethernet based network
WO2009136933A1 (en) 2008-05-08 2009-11-12 Hewlett-Packard Development Company, L.P. A method for interfacing a fibre channel network with an ethernet based network
WO2010020907A2 (en) 2008-08-21 2010-02-25 Voltaire Ltd. Device, system, and method of distributing messages
US20100088437A1 (en) 2008-10-06 2010-04-08 Mellanox Technologies Ltd Infiniband adaptive congestion control adaptive marking rate
US7921178B2 (en) 2008-12-04 2011-04-05 Voltaire Ltd. Device, system, and method of accessing storage
US20100169880A1 (en) 2008-12-25 2010-07-01 Voltaire Ltd. Virtual input-output connections for machine virtualization
US20100189206A1 (en) 2009-01-28 2010-07-29 Mellanox Technologies Ltd Precise Clock Synchronization
WO2010087826A1 (en) 2009-01-29 2010-08-05 Hewlett-Packard Development Company, L.P. Connecting ports of one or more electronic devices to different subsets of networks based on different operating modes
US20100265849A1 (en) 2009-04-21 2010-10-21 Voltaire Ltd Spanning tree root selection in a hierarchical network
US8000336B2 (en) 2009-04-21 2011-08-16 Voltaire Ltd. Spanning tree root selection in a hierarchical network
US20100274876A1 (en) 2009-04-28 2010-10-28 Mellanox Technologies Ltd Network interface device with memory management capabilities
US20110010557A1 (en) 2009-07-07 2011-01-13 Mellanox Technologies Ltd Control message signature for device control
US20110029847A1 (en) 2009-07-30 2011-02-03 Mellanox Technologies Ltd Processing of data integrity field
US20110058571A1 (en) 2009-09-09 2011-03-10 Mellanox Technologies Ltd. Data switch with shared port buffers
US20110083064A1 (en) 2009-10-04 2011-04-07 Mellanox Technologies Ltd Processing of block and transaction signatures
US7934959B2 (en) 2009-10-06 2011-05-03 Mellanox Technologies Ltd. Adapter for pluggable module
WO2011043769A1 (en) 2009-10-07 2011-04-14 Hewlett-Packard Development Company, L.P. Notification protocol based endpoint caching of host memory
US20110096668A1 (en) 2009-10-26 2011-04-28 Mellanox Technologies Ltd. High-performance adaptive routing
WO2011053305A1 (en) 2009-10-30 2011-05-05 Hewlett-Packard Development Co Thermal bus bar for a blade enclosure
WO2011053330A1 (en) 2009-11-02 2011-05-05 Hewlett-Packard Development Company, L.P. Multiprocessing computing with distributed embedded switching
US20110113083A1 (en) 2009-11-11 2011-05-12 Voltaire Ltd Topology-Aware Fabric-Based Offloading of Collective Functions
US20110119673A1 (en) 2009-11-15 2011-05-19 Mellanox Technologies Ltd. Cross-channel network operation offloading for collective operations
US20110116512A1 (en) 2009-11-19 2011-05-19 Mellanox Technologies Ltd Dynamically-Connected Transport Service
US20110173352A1 (en) 2010-01-13 2011-07-14 Mellanox Technologies Ltd Power Reduction on Idle Communication Lanes

Non-Patent Citations (154)

* Cited by examiner, † Cited by third party
Title
A. Edwards, et al.; ACM Computer Communication Review, vol. 24, No. 4, pp. 14-23, Oct. 1994.
A. Edwards, S. Muir; ACM Computer Communication Review, vol. 25, No. 4, Oct. 1995.
A. Romanow and S. Floyd; ACM Computer Communication Review, vol. 24, No. 4, p. 79-88, Oct. 1994.
Andrew D. Birrell, et al.; Communications of the ACM, vol. 25, Issue 4, pp. 260-274, Apr. 1982.
Andy Currid; ACM Queue, vol. 2, No. 3, 2004, May 1, 2004.
B. Leslie, et al.; J. Comput. Sci. & Technol., vol. 20, Sep. 2005.
Babak Falsafi, et al.; Proceedings of the 1994 conference on Supercomputing, pp. 380-389, Washington D.C., Nov. 14, 1994.
Bilic Hrvoye, et al.; article in Proceedings of the 9th Symposium on High Performance Interconnects, "Deferred Segmentation for Wire-Speed Transmission of Large TCP Frames over Standard GbE Networks," Aug. 22, 2001, 5pp.
Bilic Hrvoye, et al.; presentation slides from 9th Symposium on High Performance Interconnects, "Deferred Segmentation for Wire-Speed Transmission of Large TCP Frames over Standard GbE Networks," Aug. 22, 2001, 9pp.
Boon S. Ang, et al.; Proceedings of the 1998 ACM/IEEE conference on Supercomputing, Orlando, Florida, Nov. 7, 1998.
Bruce Lowekamp, et al.; ACM Computer Communication Review, vol. 31, No. 4, Oct. 2001.
Bruce S. Davie; ACM Computer Communication Review, vol. 21, No. 4, Sep. 1991.
C. A. Thekkath, et al.; ACM Computer Communication Review, vol. 23, No. 4, Oct. 1993.
C. Brendan S. Traw, et al.; ACM Computer Communication Review, vol. 21, No. 4, p. 317-325, Sep. 1991.
C. Kline; ACM Computer Communication Review, vol. 17, No. 5, Aug. 1987.
C. Partridge, J. Hughes, J. Stone; ACM Computer Communication Review, vol. 25, No. 4, p. 68-76, Oct. 1995.
C. Traw and J. Smith; IEEE Journal on Selected Areas in Communications, pp. 240-253, Feb. 1993.
Charles Kalmanek; ACM Computer Communication Review, vol. 32, No. 5, pp. 13-19, Nov. 2002.
Charles P. Thacker and Lawrence C. Stewart; ACM Operating Systems Review, vol. 21, Issue 4, p. 164-172, 1987, Oct. 1997.
Cheng Jin, et al.; Proceedings of IEEE Infocom 2004, pp. 1246-1259, Mar. 7, 2004.
Chi-Chao Chang, et al.; Proceedings of the 1996 ACM/IEEE conference on Supercomputing, Pittsburgh, Nov. 17, 1996.
Chris Maeda, Brian Bershad; ACM Operating Systems Review, vol. 27, Issue 5, p. 244-255, Dec. 1993.
Christopher A. Kent, Jeffrey C. Mogul; ACM Computer Communication Review, vol. 17, No. 5, pp. 390-401, Oct. 1987.
Craig Partridge; ACM Computer Communication Review, vol. 20, No. 1, p. 44-53, Jan. 1990.
D. D. Clark and D. L. Tennenhouse; ACM Computer Communication Review, vol. 20, No. 4, pp. 200-208, Sep. 1990.
D. L. Tennenhouse, D. J. Wetherall; ACM Computer Communication Review, vol. 26, No. 2, pp. 15-20, Apr. 1996.
Danny Cohen, et al.; ACM Computer Communication Review, vol. 23, No. 4, p. 32-44, Jul. 1993.
Danny Cohen, Gregory Finn, Robert Felderman, Annette DeSchon; Journal of High Speed Networks, Jan. 3, 1994.
Danny Cohen, Gregory Finn, Robert Felderman, Annette DeSchon; Made available by authors, Jan. 10, 1992.
David A. Borman; ACM Computer Communication Review, vol. 19, No. 2, p. 11-15, Apr. 1989.
David D. Clark, et al.; IEEE Communications Magazine, vol. 27, No. 6, pp. 23-29, Jun. 1989.
David D. Clark; ACM Computer Communication Review, vol. 18, No. 4, pp. 106-114, Aug. 1988.
David R. Boggs, et al.; ACM Computer Communication Review, vol. 18, No. 4, p. 222-234, Aug. 1988.
David R. Cheriton; ACM Computer Communication Review, vol. 19, No. 4, p. 158-169, Sep. 1989.
David Wetherall; ACM Computer Communication Review, vol. 36, No. 3, pp. 77-78, Jul. 2006.
Derek McAuley, Rolf Neugebauer; Proceedings of the ACM SIGCOMM 2003 Workshops, Aug. 2003.
Derek Robert McAuley; PhD Thesis, University of Cambridge, Sep. 1989.
Dickman, L., "Protocol OffLoading vs OnLoading in High Performance Networks," 14th Symposium on High Performance Interconnects, Aug. 23, 2006, 8pp.
E. Blanton and M. Allman; ACM Computer Communication Review, vol. 32, No. 1, Jan. 2002.
E. Ruetsche; ACM Computer Communication Review, vol. 23, No. 3, Jul. 1993.
Ed Anderson, et al.; Proceedings of the 1997 ACM/IEEE conference on Supercomputing, p. 1-17, San Jose, California, Nov. 16, 1997.
Edward D. Lazowska, David A. Patterson; ACM Computer Communication Review, vol. 35, No. 2, Jul. 2005.
Eric C. Cooper, et al.; ACM Computer Communication Review, vol. 20, No. 4, p. 135-144, Sep. 1990.
Erich Ruetsche and Matthias Kaiserswerth; Proceedings of the IFIP TC6/WG6.4 Fourth International Conference on High Performance Networking IV, Dec. 14, 1992.
F.F. Kuo; ACM Computer Communication Review, vol. 4 No. 1, Jan. 1974.
Gary S. Delp, et al.; ACM Computer Communication Review, vol. 18, No. 4, p. 165-174, Aug. 1988.
Gene Tsudik; ACM Computer Communication Review, vol. 22, No. 5, pp. 29-38, Oct. 1992.
Geoffray P., "Protocol off-loading vs on-loading in high-performance networks," 14th Symposium on High Performance Interconnects, Aug. 23, 2006, 5pp.
Gordon E. Moore; Electronics, vol. 38, No. 8, pp. 114-117, Apr. 19, 1965.
Greg Chesson; Proceedings of the Third International Conference on High Speed Networking, Nov. 1991.
Greg Minshall, et al.; ACM Computer Communication Review, vol. 36, No. 3, pp. 79-92, Jul. 2006.
Greg Regnier, et al.; Computer, IEEE Computer Society, vol. 37, No. 11, pp. 48-58, Nov. 2004.
Greg Regnier, et al.; IEEE Micro, vol. 24, No. 1, p. 24-31, Jan. 1994.
Gregory G. Finn and Paul Mockapetris; Proceedings of InterOp '94, Las Vegas, Nevada, May 1994.
Gregory G. Finn; ACM Computer Communication Review, vol. 21, No. 5, p. 18-29, Oct. 1991.
Gregory L. Chesson; United States District Court, Northern District California, San Francisco Division, Feb. 4, 2005.
H. K. Jerry Chu; Proceedings of the USENIX Annual Technical Conference 1996, Jan. 1996.
H. Kanakia and D. Cheriton; ACM Computer Communication Review, vol. 18, No. 4, p. 175-187, Aug. 1988.
Harvey J. Wassermann, et al.; Proceedings of the 1997 ACM/IEEE conference on Supercomputing, p. 1-11, San Jose, California, Nov. 16, 1997.
Humaira Kamal, et al.; Proceedings of the 2005 ACM/IEEE conference on Supercomputing, Seattle, p. 30, Washington, Nov. 12, 2005.
Ian Leslie and Derek R. McAuley; ACM Computer Communication Review, vol. 21, No. 4, p. 327, Sep. 1991.
Ian M. Leslie, et al.; ACM Computer Communication Review, vol. 14, No. 2, pp. 2-9, Jun. 1984.
Ian Pratt and Keir Fraser; Proceedings of IEEE Infocom 2001, pp. 67-76, Apr. 22, 2001.
J. C. Mogul; ACM Computer Communication Review, vol. 25, No. 4, Oct. 1995.
J. Carver Hill; Communications of the ACM, vol. 16, No. 6, p. 350-351, Jun. 1973.
J. Evans and T. Buller; IEEE TCGN Gigabit Networking Workshop, 2001, Apr. 22, 2001.
J. Vis; ACM Computer Communication Review, vol. 24, No. 1, pp. 7-11, Jan. 1994.
Jack B. Dennis and Earl C. Van Horn; Communications of the ACM, vol. 9, No. 3, pp. 143-155, Mar. 1966.
Jeffrey C. Mogul; Proceedings of HotOS IX: The 9th Workshop on Hot Topics in Operating Systems, pp. 25-30, May 18, 2003.
Jeffrey R. Michel; MSci Thesis, University of Virginia, 1993, Aug. 1993.
Jenwei Hsieh, et al.; Proceedings of the 2000 ACM/IEEE conference on Supercomputing, Dallas, Texas, Nov. 4, 2000.
Jiuxing Liu, et al.; Proceedings of the 2003 ACM/IEEE conference on Supercomputing, Phoenix, Arizona, Nov. 15, 2003.
Joe Touch, et al.; "Atomic-2" slides, Gigabit Networking Workshop '97 Meeting, Kobe, Japan, Apr. 1997, 10pp.
Joe Touch, et al.; "Host-based Routing Using Peer DMA," Gigabit Networking Workshop '97 Meeting, Kobe, Japan, Apr. 1997, 2pp.
John M. McQuillan, et al.; Proceedings of the 6th Data Communications Symposium, p. 63, Nov. 1979.
John Nagle; ACM Computer Communication Review, vol. 14, No. 4, p. 11-17, Oct. 1984.
John Salmon, et al.; Proceedings of the 1998 ACM/IEEE conference on Supercomputing, Orlando, Florida, Nov. 7, 1998.
Jon Crowcroft, Derek McAuley; ACM Computer Communication Review, vol. 32, No. 5, Nov. 2002.
Jon Crowcroft; ACM Computer Communication Review, vol. 36, No. 2, pp. 51-52, Apr. 2006.
Jonathan Kay and Joseph Pasquale; ACM Computer Communication Review, vol. 23, No. 4, pp. 259-268, Oct. 1993.
Jonathan M. Smith and C. Brendan S. Traw; IEEE Network, vol. 7, Issue 4, pp. 44-52, Jul. 1993.
Jonathan Smith; ACM Computer Communication Review, vol. 32, No. 5, pp. 29-37, Nov. 2002.
Jonathan Stone, Craig Partridge; ACM Computer Communication Review, vol. 30, No. 4, pp. 309-319, Oct. 2000.
Jose Carlos Sancho, et al.; Proceedings of the 2006 ACM/IEEE conference on Supercomputing, Tampa, Florida, Nov. 11, 2006.
Justin Hurwitz, Wu-chun Feng; Proceedings of the 11th Symposium on High Performance Interconnects, Aug. 20, 2003.
K. Kleinpaste, P. Steenkiste, B. Zill; ACM Computer Communication Review, vol. 25, No. 4, p. 87-98, Oct. 1995.
Ken Calvert; ACM Computer Communication Review, vol. 36, No. 2, pp. 27-30, Apr. 2006.
Kieran Mansley, et al.; Euro-Par Conference 2007, pp. 224-233, Rennes, France, Aug. 28, 2007.
L. S. Brakmo, et al.; ACM Computer Communication Review, vol. 24, No. 4, p. 24-35, Oct. 1994.
M. Allman; ACM Computer Communication Review, vol. 29, No. 3, Jul. 1999.
M. de Vivo, et al.; ACM Computer Communication Review, vol. 29, No. 1, pp. 81-85, Jan. 1999.
M. Kaiserswerth; IEEE/ACM Transactions in Networking vol. 1, Issue 6, pp. 650-663, Dec. 1993.
M.V. Wilkes and R.M. Needham; ACM SIGOPS Operating Systems Review, vol. 14, Issue 1, pp. 21-29, Jan. 1980.
Mansley K., "Engineering a User-Level TCP for the CLAN Network," Laboratory for Communication Engineering, Univ. Cambridge, England, AGM SIGCOMM Aug. 2003 Workshops, pp. 228-236.
Margaret L. Simmons and Harvey J. Wasserman; Proceedings of the 1988 ACM/IEEE conference on Supercomputing, p. 288-295, Orlando, Florida, Nov. 12, 1988.
Mark David Hayter; PhD Thesis, University of Cambridge, Sep. 1993.
Mark Hayter, Derek McAuley; ACM Operating Systems Review, vol. 25, Issue 4, p. 14-21, Oct. 1991.
Marvin Zelkowitz; Communications of the ACM, vol. 14, No. 6, p. 417-418, Jun. 1971.
Matthias Kaiserswerth; IEEE/ACM Transactions on Networking, vol. 1, No. 6, p. 650-663, Dec. 1993.
Mengjou Lin, et al.; Proceedings of the 1994 conference on Supercomputing, Washington D.C., Nov. 14, 1994.
Michael J. Dixon; University of Cambridge Computer Laboratory Technical Report No. 245, Jan. 1992.
Michael S. Warren, et al.; Proceedings of the 1998 ACM/IEEE conference on Supercomputing, Orlando, Florida, Nov. 7, 1998.
Mogul J., "TCP offload is a dumb idea whose time has come," USENIX Assoc., Proceedings of HotOS IX: The 9th Workshop on Hot Topics in Operating Systems, May 2003, pp. 24-30.
Montry G., OpenFabrics Alliance presentation slides, 14th Symposium on High Performance Interconnects, Aug. 23, 2006, 8pp.
Murali Rangarajan, et al.; Technical Report DCR-TR-481, Computer Science Department, Rutgers University, Mar. 2002.
Nanette J. Boden, et al.; Draft of paper published in IEEE Micro, vol. 15, No. 1, pp. 29-36, 1995, Nov. 16, 1994.
NR Adiga, et al.; Proceedings of the 2002 ACM/IEEE conference on Supercomputing, pp. 1-22, Baltimore, Nov. 16, 2002.
O. Angin, et al.; ACM Computer Communication Review, vol. 27, No. 3, pp. 100-117, Jul. 1997.
P. Balaji, et al.; Proceedings of the IEEE International Conference on Cluster Computing, Sep. 2005.
P. Druschel, et al.; ACM Computer Communication Review, vol. 24, No. 4, Oct. 1994.
P. Kermani and L. Kleinrock; Computer Networks, vol. 3, No. 4, pp. 267-286, Sep. 1979.
Parry Husbands and James C. Hoe; Proceedings of the 1998 ACM/IEEE conference on Supercomputing, p. 1-15, Orlando, Florida, Nov. 7, 1998.
Pasi Sarolahti, et al.; ACM Computer Communication Review, vol. 33, No. 2, Apr. 2003.
Patrick Crowley, et al.; Proceedings of the 14th international conference on Supercomputing, pp. 54-65, Santa Fe, New Mexico, May 8, 2000.
Patrick Geoffray; HPCWire article: http://www.hpcwire.com/features/17886984.html, Aug. 18, 2006.
Paul E. McKenney and Ken F. Dove; ACM Computer Communication Review, vol. 22, No. 4, Oct. 1992.
Paul Ronald Barham; PhD Thesis, University of Cambridge, Jul. 1996.
Paul V. Mockapetris, Kevin J. Dunlap; ACM Computer Communication Review, vol. 18, No. 4, pp. 123-133, Aug. 1988.
Peter Druschel and Larry L. Peterson; ACM Operating Systems Review, vol. 27, Issue 5, p. 189-202, Dec. 1993.
Peter Steenkiste; ACM Computer Communication Review, vol. 22, No. 4, Oct. 1992.
Petrini F., "Protocol Off-loading vs On-loading in High-Performance Networks," 14th Symposium on High Performance Interconnects, Aug. 23, 2006, 4pp.
Philip Buonadonna, et al.; Proceedings of the 1998 ACM/IEEE conference on Supercomputing, p. 1-15, Orlando, Florida, Nov. 7, 1998.
Piyush Shivam, et al.; Proceedings of the 2001 ACM/IEEE conference on Supercomputing, pp. 57, Denver, Nov. 10, 2001.
R. Braden, et al.; ACM Computer Communication Review, vol. 19, No. 2, p. 86-94, Apr. 1989.
R. Bush, D. Meyer; IETF Network Working Group, Request for Comments: 3439, Dec. 2002.
R. J. Black, I. Leslie, and D. McAuley; ACM Computer Communication Review, vol. 24, No. 4, p. 158-167, Oct. 1994.
Raj K. Singh, et al.; ACM Computer Communication Review, vol. 24, No. 3, p. 8-17, Jul. 1994.
Raj K. Singh, et al.; Proceedings of the 1993 ACM/IEEE conference on Supercomputing, p. 452-461, Portland, Oregon, Nov. 15, 1993.
Regnier G., "Protocol Onload vs. Offload," 14th Symposium on High Performance Interconnects, Aug. 23, 2006, 1pp.
Riddoch D. et al, "Distributed Computing with the CLAN Network," Laboratory for Communications Engineering, Cambridge, England, SIGCOMM 2002, 13 pp.
Robert M. Brandriff, et al.; ACM Computer Communication Review, vol. 15, No. 4, Sep. 1985.
Robert M. Metcalfe and David R. Boggs; Communications of the ACM, vol. 19, Issue 7, pp. 395-404, Jul. 1976.
Robert Ross, et al.; Proceedings of the 2001 ACM/IEEE conference on Supercomputing, pp. 11, Denver, Nov. 10, 2001.
S. L. Pope, et al.; Parallel and Distributed Computing and Networks, Brisbane, Australia, Dec. 1998.
Sally Floyd; ACM Computer Communication Review, vol. 24, No. 5, p. 8-23, Oct. 1994.
Sayantan Sur, et al.; Proceedings of the 2006 ACM/IEEE conference on Supercomputing, Tampa, Florida, Nov. 11, 2006.
Srihari Makineni and Ravi Iyer; Proceedings of the 10th International Symposium on High Performance Computer Architecture, pp. 152, Feb. 14, 2004.
Steve Muir and Jonathan Smith; Technical Report MS-CIS-00-04, University of Pennsylvania, Jan. 2000.
Steven J. Sistare, Christopher J. Jackson; Proceedings of the 2002 ACM/IEEE conference on Supercomputing, p. 1-15, Baltimore, Nov. 16, 2002.
Steven Pope, David Riddoch; ACM Computer Communication Review, vol. 37, No. 2, pp. 89-92, 2007, Mar. 19, 2007.
Stuart Wray, et al.; Proceedings of the International Conference on Multimedia Computing and Systems, p. 265-273, Boston, May 1994.
Sumitha Bhandarkar, et al.; ACM Computer Communication Review, vol. 36, No. 1, pp. 41-50, Jan. 2006.
Thomas Sterling, et al.; Proceedings of the 24th International Conference on Parallel Processing, pp. 11-14, Aug. 1995.
Thorsten von Eicken, et al.; ACM Operating Systems Review, vol. 29, Issue 5, p. 109-126, Dec. 1995.
Tom Kelly; ACM Computer Communication Review, vol. 33, No. 2, pp. 83-91, Apr. 2003.
V. Cerf, et al.; ACM Computer Communication Review, vol. 6 No. 1, p. 1-18, Jan. 1976.
V. Jacobson; ACM Computer Communication Review, vol. 18, No. 4, p. 314-329, Aug. 1988.
Various forum members; Message-Passing Interface Forum, University of Tennessee, Knoxville, May 5, 1994.
Vinay Aggarwal, et al.; ACM Computer Communication Review, vol. 33, No. 5, Oct. 2003.
Vinton Cerf, Robert Kahn; IEEE Transactions on Communications, vol. COM-22, No. 5, pp. 637-648, May 1974.
W. E. Leland, et al.; ACM Computer Communication Review, vol. 23, No. 4, p. 85-95, Oct. 1993.
W. Feng and P. Tinnakornsrisuphap; Proceedings of the 2000 ACM/IEEE conference on Supercomputing, Dallas, Texas, Nov. 4, 2000.
W. Feng, et al.; Proceedings of the 13th Symposium on High Performance Interconnects, Aug. 17, 2005.
Wu-chun Feng, et al.; Proceedings of the 2003 ACM/IEEE conference on Supercomputing, Phoenix, Arizona, Nov. 15, 2003.

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9912665B2 (en) 2005-04-27 2018-03-06 Solarflare Communications, Inc. Packet validation in virtual network interface architecture
US10924483B2 (en) 2005-04-27 2021-02-16 Xilinx, Inc. Packet validation in virtual network interface architecture
US10514937B2 (en) * 2012-01-05 2019-12-24 Vmware, Inc. Auto-discovery service and method of discovering applications within a virtual network

Also Published As

Publication number Publication date
EP1884085B1 (en) 2012-12-26
US9912665B2 (en) 2018-03-06
WO2006116594A1 (en) 2006-11-02
US20060248234A1 (en) 2006-11-02
US7634584B2 (en) 2009-12-15
US20140059221A1 (en) 2014-02-27
US10924483B2 (en) 2021-02-16
EP1884085A1 (en) 2008-02-06
US20180159860A1 (en) 2018-06-07
US20100049876A1 (en) 2010-02-25

Similar Documents

Publication Publication Date Title
US10924483B2 (en) Packet validation in virtual network interface architecture
US11210148B2 (en) Reception according to a data transfer protocol of data directed to any of a plurality of destination entities
US9594842B2 (en) Hashing algorithm for network receive filtering
US7496699B2 (en) DMA descriptor queue read and cache write pointer arrangement
US7543290B2 (en) Multiple queue pair access with single doorbell
EP2632109B1 (en) Data processing system and method therefor
EP1861778B1 (en) Data processing system
US7003586B1 (en) Arrangement for implementing kernel bypass for access by user mode consumer processes to a channel adapter based on virtual address mapping
EP1543658B1 (en) One shot rdma having a 2-bit state

Legal Events

Date Code Title Description
AS Assignment

Owner name: COMERICA BANK, AS AGENT, A TEXAS BANKING ASSOCIATI

Free format text: SECURITY AGREEMENT;ASSIGNOR:SOLARFLARE COMMUNICATIONS, INC., A DELAWARE CORPORATION;REEL/FRAME:027487/0097

Effective date: 20111229

AS Assignment

Owner name: SOLARFLARE COMMUNICATIONS INC., CALIFORNIA

Free format text: CHANGE OF ADDRESS OF THE ASSIGNEE;ASSIGNOR:SOLARFLARE COMMUNICATIONS INC.;REEL/FRAME:029636/0295

Effective date: 20130104

STCF Information on status: patent grant

Free format text: PATENTED CASE

AS Assignment

Owner name: LEVEL 5 NETWORKS, INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:POPE, STEVE;RIDDOCH, DAVID;YU, CHING;AND OTHERS;REEL/FRAME:030261/0800

Effective date: 20050422

Owner name: SOLARFLARE COMMUNICATIONS, INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:LEVEL 5 NETWORKS, INC.;REEL/FRAME:030261/0284

Effective date: 20080926

AS Assignment

Owner name: SOLARFLARE COMMUNICATIONS, INC., CALIFORNIA

Free format text: RELEASE OF SECURITY INTEREST;ASSIGNOR:COMERICA BANK;REEL/FRAME:033716/0073

Effective date: 20140910

AS Assignment

Owner name: SOLARFLARE COMMUNICATIONS, INC., CALIFORNIA

Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:COMERICA BANK;REEL/FRAME:034255/0140

Effective date: 20141105

AS Assignment

Owner name: PACIFIC WESTERN BANK, NORTH CAROLINA

Free format text: SECURITY INTEREST;ASSIGNOR:SOLARFLARE COMMUNICATIONS, INC.;REEL/FRAME:038363/0589

Effective date: 20160331

FPAY Fee payment

Year of fee payment: 4

AS Assignment

Owner name: ALLY BANK, NEW YORK

Free format text: SECURITY INTEREST;ASSIGNOR:SOLARFLARE COMMUNICATIONS, INC.;REEL/FRAME:041902/0527

Effective date: 20170223

AS Assignment

Owner name: SOLARFLARE COMMUNICATIONS, INC., CALIFORNIA

Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:PACIFIC WESTERN BANK;REEL/FRAME:042820/0890

Effective date: 20170615

AS Assignment

Owner name: SOLARFLARE COMMUNICATIONS, INC., CALIFORNIA

Free format text: RELEASE OF INTELLECTUAL PROPERTY SECURITY AGREEMENT;ASSIGNOR:ALLY BANK;REEL/FRAME:049918/0440

Effective date: 20190730

AS Assignment

Owner name: XILINX, INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SOLARFLARE COMMUNICATIONS, INC.;REEL/FRAME:051108/0209

Effective date: 20191113

FEPP Fee payment procedure

Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 8