blackbird-op-linux - Blackbird™ Linux sources for OpenPOWER

	Commit message (Collapse)	Author	Age	Files	Lines
*	[TCP/DCCP]: Introduce net_xmit_eval	Gerrit Renker	2006-12-02	9	-30/+14
\| \| \| \| \| \| \| \| \| \| \| \|	Throughout the TCP/DCCP (and tunnelling) code, it often happens that the return code of a transmit function needs to be tested against NET_XMIT_CN which is a value that does not indicate a strict error condition. This patch uses a macro for these recurring situations which is consistent with the already existing macro net_xmit_errno, saving on duplicated code. Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[SCTP]: Cleanup of the sctp state table code.	Peter Zijlstra	2006-12-02	1	-361/+335
\| \| \| \| \| \| \| \| \| \| \| \|	I noticed an insane high density of repeated characters fixable by a simple regular expression: % s/{.fn = \([^,]*\),[[:space:]]\+\(\\\n[[:space:]]\+\)\?.name = "\1"}/TYPE_SCTP_FUNC(\1)/g (NOTE: the .name for .fn = sctp_sf_do_9_2_start_shutdown didn't match) Signed-off-by: Peter Zijlstra <a.p.zijlstra@chello.nl> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[ATM] ambassador,firestream: "-1 >>" is implementation defined	Alexey Dobriyan	2006-12-02	2	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	6.5.7(5): The result of E1 >> E2 is E1 right-shifted E2 bit positions. ... If E1 has a signed type and a negative value, the resulting value is implementation defined. So, cast -1 to unsigned type to make result well-defined. [ Modified to use ~0U based upon recommendation from Al Viro. -DaveM ] Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[SELinux]: Add support for DCCP	James Morris	2006-12-02	6	-8/+103
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch implements SELinux kernel support for DCCP (http://linux-net.osdl.org/index.php/DCCP), which is similar in operation to TCP in terms of connected state between peers. The SELinux support for DCCP is thus modeled on existing handling of TCP. A new DCCP socket class is introduced, to allow protocol differentation. The permissions for this class inherit all of the socket permissions, as well as the current TCP permissions (node_bind, name_bind etc). IPv4 and IPv6 are supported, although labeled networking is not, at this stage. Patches for SELinux userspace are at: http://people.redhat.com/jmorris/selinux/dccp/user/ I've performed some basic testing, and it seems to be working as expected. Adding policy support is similar to TCP, the only real difference being that it's a different protocol. Acked-by: Stephen Smalley <sds@tycho.nsa.gov> Signed-off-by: James Morris <jmorris@namei.org> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NET]: The scheduled removal of the frame diverter.	Adrian Bunk	2006-12-02	12	-767/+6
\| \| \| \| \| \| \|	This patch contains the scheduled removal of the frame diverter. Signed-off-by: Adrian Bunk <bunk@stusta.de> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[DCCPv6]: Choose a genuine initial sequence number	Gerrit Renker	2006-12-02	3	-20/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This * resolves a FIXME - DCCPv6 connections started all with an initial sequence number of 1; * provides a redirection `secure_dccpv6_sequence_number' in case the init_sequence_v6 code should be updated later; * concentrates the update of S.GAR into dccp_connect_init(); * removes a duplicate dccp_update_gss() in ipv4.c; * uses inet->dport instead of usin->sin_port, due to the following assignment in dccp_v4_connect(): inet->dport = usin->sin_port; Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[DCCP]: Remove redundant statements in init_sequence (ISS)	Gerrit Renker	2006-12-02	2	-17/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch removes the following redundancies: 1) The test skb->protocol == htons(ETH_P_IPV6) in dccp_v6_init_sequence is always true since * dccp_v6_conn_request() is the only calling function * dccp_v6_conn_request() redirects all skb's with ETH_P_IP to dccp_v4_conn_request() 2) The first argument, `struct sock *sk', of dccp_v{4,6}_init_sequence() is never used. (This is similar for tcp_v{4,6}_init_sequence, an analogous patch has been submitted to netdev and merged.) By the way - are the `sport' / `dport' arguments in the right order? I have made them consistent among calls but they seem to be in the reverse order. Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[DCCP]: Remove forward declarations in timer.c	Gerrit Renker	2006-12-02	1	-57/+53
\| \| \| \| \| \| \| \| \|	This removes 3 forward declarations by reordering 2 functions. No code change at all. Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[DCCP]: Introduce a consistent naming scheme for sysctls	Gerrit Renker	2006-12-02	5	-33/+32
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	In order to make their function clearer and obtain a consistent naming scheme to identify sysctls, all existing DCCP sysctls have been prefixed with `sysctl_dccp', following the same convention as used by TCP. Feature-specific sysctls retain the `feat' in the middle, although the `default' has been dropped, since it is obvious from use. Also removed a duplicate `dccp_feat_default_sequence_window' in ipv4.c. Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[DCCP]: Add sysctls to control retransmission behaviour	Gerrit Renker	2006-12-02	7	-13/+91
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This adds 3 sysctls which govern the retransmission behaviour of DCCP control packets (3way handshake, feature negotiation). It removes 4 FIXMEs from the code. The close resemblance of sysctl variables to their TCP analogues is emphasised not only by their name, but also by giving them the same initial values. This is useful since there is not much practical experience with DCCP yet. Furthermore, with regard to the previous patch, it is now possible to limit the number of keepalive-Responses by setting net.dccp.default.request_retries (also a bit like in TCP). Lastly, added documentation of all existing DCCP sysctls. Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[DCCP]: Increment sequence numbers on retransmitted Response packets	Gerrit Renker	2006-12-02	2	-7/+13
\| \| \| \|	Problem:
*	[DCCP]: Update comments on precisely which packets can be retransmitted	Gerrit Renker	2006-12-02	1	-4/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This updates program documentation: spell out precise conditions about which packets are eligible for retransmission (which is actually quite hard to extract from RFC 4340). It is based on the following table derived from RFC 4340: +-----------+---------------------------------+---------------------+ \| Type \| Retransmit? \| Remark \| +-----------+---------------------------------+---------------------+ \| Request \| in client-REQUEST state \| sec. 8.1.1 \| \| Response \| NEVER \| SHOULD NOT, 8.1.3 \| \| Data \| NEVER \| unreliable protocol \| \| Ack \| possible in client-PARTOPEN \| sec. 8.1.5 \| \| DataAck \| NEVER \| unreliable protocol \| \| CloseReq \| only in server-CLOSEREQ state \| MUST, sec. 8.3 \| \| Close \| in node-CLOSING state \| MUST, sec. 8.3 \| +-----------+-------------------------------------------------------+ \| Reset \| only in response to other packets \| \| Sync \| only in response to sequence-invalid packets (7.5.4) \| \| SyncAck \| only in response to Sync packets \| +-----------+-------------------------------------------------------+ Hence the only packets eligible for retransmission are: * Requests in client-REQUEST state (sec. 8.1.1) * Acks in client-PARTOPEN state (sec. 8.1.5) * CloseReq in server-CLOSEREQ state (sec. 8.3) * Close in node-CLOSING state (sec. 8.3) I had meant to put in a check for these types too, but have left that for later. Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[DECNET]: Fix build regressions.	David S. Miller	2006-12-02	2	-8/+2
\| \| \| \| \| \|	Spotted by Arnaldo. Signed-off-by: David S. Miller <davem@davemloft.net>
*	[TCP] htcp: Better packing of struct htcp.	David S. Miller	2006-12-02	1	-2/+2
\| \| \| \| \| \|	Based upon a patch by Joe Perches. Signed-off-by: David S. Miller <davem@davemloft.net>
*	[IPv6]: Only modify checksum for UDP	Brian Haley	2006-12-02	2	-3/+1
\| \| \| \| \| \| \| \|	Only change upper-layer checksum from 0 to 0xFFFF for UDP (as RFC 768 states), not for others as RFC 4443 doesn't require it. Signed-off-by: Brian Haley <brian.haley@hp.com> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[IPv6] rules: Remove bogus tos validation check	Thomas Graf	2006-12-02	1	-2/+1
\| \| \| \| \| \| \| \| \| \|	Noticed by Al Viro: (frh->tos & ~IPV6_FLOWINFO_MASK)) where IPV6_FLOWINFO_MASK is htonl(0xfffffff) and frh->tos is u8, which makes no sense here... Signed-off-by: Thomas Graf <tgraf@suug.ch> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETLINK]: Do precise netlink message allocations where possible	Thomas Graf	2006-12-02	21	-107/+233
\| \| \| \| \| \| \| \| \| \| \| \| \|	Account for the netlink message header size directly in nlmsg_new() instead of relying on the caller calculate it correctly. Replaces error handling of message construction functions when constructing notifications with bug traps since a failure implies a bug in calculating the size of the skb. Signed-off-by: Thomas Graf <tgraf@suug.ch> Acked-by: Paul Moore <paul.moore@hp.com> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[TCP]: Remove dead code in init_sequence	Gerrit Renker	2006-12-02	2	-15/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This removes two redundancies: 1) The test (skb->protocol == htons(ETH_P_IPV6) in tcp_v6_init_sequence() is always true, due to * tcp_v6_conn_request() is the only function calling this one * tcp_v6_conn_request() redirects all skb's with ETH_P_IP protocol to tcp_v4_conn_request() [ cf. top of tcp_v6_conn_request()] 2) The first argument, `struct sock *sk' of tcp_v{4,6}_init_sequence() is never used. Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[DCCP]: Support for partial checksums (RFC 4340, sec. 9.2)	Gerrit Renker	2006-12-02	7	-97/+173
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch does the following: a) introduces variable-length checksums as specified in [RFC 4340, sec. 9.2] b) provides necessary socket options and documentation as to how to use them c) basic support and infrastructure for the Minimum Checksum Coverage feature [RFC 4340, sec. 9.2.1]: acceptability tests, user notification and user interface In addition, it (1) fixes two bugs in the DCCPv4 checksum computation: * pseudo-header used checksum_len instead of skb->len * incorrect checksum coverage calculation based on dccph_x (2) removes dccp_v4_verify_checksum() since it reduplicates code of the checksum computation; code calling this function is updated accordingly. (3) now uses skb_checksum(), which is safer than checksum_partial() if the sk_buff has is a non-linear buffer (has pages attached to it). (4) fixes an outstanding TODO item: * If P.CsCov is too large for the packet size, drop packet and return. The code has been tested with applications, the latest version of tcpdump now comes with support for partial DCCP checksums. Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[IPV6]: Per-interface statistics support.	YOSHIFUJI Hideaki	2006-12-02	13	-108/+195
\| \| \| \| \| \|	For IP MIB (RFC4293). Signed-off-by: YOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
*	[IPV6]: Introduce ip6_dst_idev() to get inet6_dev{} stored in dst_entry{}.	YOSHIFUJI Hideaki	2006-12-02	3	-2/+7
\| \| \| \| \| \|	Otherwise, we will see a lot of casts... Signed-off-by: YOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
*	[IPV6] ROUTE: Use &rt->u.dst instead of cast.	YOSHIFUJI Hideaki	2006-12-02	2	-4/+4
\| \| \| \|	Signed-off-by: YOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
*	[IPV6] ROUTE: Use macros to format /proc/net/ipv6_route.	YOSHIFUJI Hideaki	2006-12-02	1	-24/+13
\| \| \| \|	Signed-off-by: YOSHIFUJI Hideaki <yoshfuji@linux-ipv6.org>
*	[DCCP]: Update code comments for Step 2/3	Gerrit Renker	2006-12-02	4	-51/+68
\| \| \| \| \| \| \| \| \|	Sorts out the comments for processing steps 2,3 in section 8.5 of RFC 4340. All comments have been updated against this document, and the reference to step 2 has been made consistent throughout the files. Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[DCCP]: tidy up dccp_v{4,6}_conn_request	Gerrit Renker	2006-12-02	4	-22/+16
\| \| \| \| \| \| \|	This is a code simplification to remove reduplicated code by concentrating and abstracting shared code. Detailed Changes:
*	[DCCP]: Fix logfile overflow	Ian McDonald	2006-12-02	2	-3/+7
\| \| \| \| \| \| \| \| \| \| \| \|	This patch fixes data being spewed into the logs continually. As the code stood if there was a large queue and long delays timeo would go down to zero and never get reset. This fixes it by resetting timeo. Put constant into header as well. Signed-off-by: Ian McDonald <ian.mcdonald@jandi.co.nz> Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[DCCP]: Fix DCCP Probe Typo	Ian McDonald	2006-12-02	1	-1/+1
\| \| \| \| \| \| \| \| \|	Fixes a typo in Kconfig, patch is by Ian McDonald and is re-sent from http://www.mail-archive.com/dccp@vger.kernel.org/msg00579.html Signed-off-by: Ian McDonald <ian.mcdonald@jandi.co.nz> Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[DCCPv6]: remove forward declarations in ipv6.c	Gerrit Renker	2006-12-02	1	-195/+190
\| \| \| \| \| \| \| \| \| \| \|	This does the same for ipv6.c as the preceding one does for ipv4.c: Only the inet_connection_sock_af_ops forward declarations remain, since at least dccp_ipv6_mapped has a circular dependency to dccp_v6_request_recv_sock. No code change, merely re-ordering. Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[DCCPv4]: remove forward declarations in ipv4.c	Gerrit Renker	2006-12-02	1	-134/+132
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	This relates to Arnaldo's announcement in http://www.mail-archive.com/dccp@vger.kernel.org/msg00604.html Originally this had been part of the Oops fix and is a revised variant of http://www.mail-archive.com/dccp@vger.kernel.org/msg00598.html No code change, merely reshuffling, with the particular objective of having all request_sock_ops close(r) together for more clarity. Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[DCCP]: calling dccp_v{4,6}_reqsk_send_ack is a BUG	Gerrit Renker	2006-12-02	4	-103/+14
\| \| \| \| \| \| \| \| \| \|	This patch removes two functions, the send_ack functions of request_sock, which are not called/used by the DCCP code. It is correct that these functions are not called, below is a justification why calling these functions (on a passive socket in the LISTEN/RESPOND state) would mean a DCCP protocol violation. A) Background: using request_sock in TCP:
*	[DCCP] timewait: Remove leftover extern declarations	Arnaldo Carvalho de Melo	2006-12-02	1	-2/+0
\| \| \| \| \| \| \| \| \|	Gerrit Renker noticed dccp_tw_deschedule and submitted a patch with a FIXME, but as he suggests in the same patch the best thing is to just ditch this declaration, while doing that also noticed that tcp_tw_count is as well not defined anywhere, so ditch it too. Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[DCCP]: Simplify jump labels in dccp_v{4,6}_rcv	Gerrit Renker	2006-12-02	2	-23/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a code simplification and was singled out from the DCCPv6 Oops patch on http://www.mail-archive.com/dccp@vger.kernel.org/msg00600.html It mainly makes the code consistent between ipv{4,6}.c for the functions dccp_v4_rcv dccp_v6_rcv and removes the do_time_wait label to simplify code somewhat. Commiter note: fixed up a compile problem, trivial. Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[DCCP]: Combine allocating & zeroing header space on skb	Gerrit Renker	2006-12-02	4	-24/+14
\| \| \| \| \| \| \| \| \| \| \| \|	This is a code simplification: it combines three often recurring operations into one inline function, * allocate `len' bytes header space in skb * fill these `len' bytes with zeroes * cast the start of this header space as dccp_hdr Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[DCCPv6]: Add a FIXME for missing IPV6_PKTOPTIONS	Gerrit Renker	2006-12-02	1	-2/+6
\| \| \| \| \| \| \| \| \| \| \| \|	This refers to the possible memory leak pointed out in http://www.mail-archive.com/dccp@vger.kernel.org/msg00574.html, fixed by David Miller in http://www.mail-archive.com/netdev@vger.kernel.org/msg24881.html and adds a FIXME to point out where code is missing. Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[DCCP]: set safe upper bound for option length	Gerrit Renker	2006-12-02	1	-6/+12
\| \| \| \| \| \| \| \| \| \| \|	This is a re-send from http://www.mail-archive.com/dccp@vger.kernel.org/msg00553.html It is the same patch as before, but I have built in Arnaldo's suggestions pointed out in that posting. Signed-off-by: Gerrit Renker <gerrit@erg.abdn.ac.uk> Signed-off-by: Arnaldo Carvalho de Melo <acme@mandriva.com>
*	[TCP]: Don't set SKB owner in tcp_transmit_skb().	David S. Miller	2006-12-02	8	-12/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The data itself is already charged to the SKB, doing the skb_set_owner_w() just generates a lot of noise and extra atomics we don't really need. Lmbench improvements on lat_tcp are minimal: before: TCP latency using localhost: 23.2701 microseconds TCP latency using localhost: 23.1994 microseconds TCP latency using localhost: 23.2257 microseconds after: TCP latency using localhost: 22.8380 microseconds TCP latency using localhost: 22.9465 microseconds TCP latency using localhost: 22.8462 microseconds Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NET] ip-sysctl.txt: Alphabetize.	Stephen Hemminger	2006-12-02	1	-178/+177
\| \| \| \| \| \| \|	Rearrange TCP entries in alpha order. Signed-off-by: Stephen Hemminger <shemminger@osdl.org> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[TCP]: Allow autoloading of congestion control via setsockopt.	Stephen Hemminger	2006-12-02	1	-1/+11
\| \| \| \| \| \| \| \|	If user has permision to load modules, then autoload then attempt autoload of TCP congestion module. Signed-off-by: Stephen Hemminger <shemminger@osdl.org> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[TCP]: Restrict congestion control choices.	Stephen Hemminger	2006-12-02	5	-0/+125
\| \| \| \| \| \| \| \| \| \| \| \| \|	Allow normal users to only choose among a restricted set of congestion control choices. The default is reno and what ever has been configured as default. But the policy can be changed by administrator at any time. For example, to allow any choice: cp /proc/sys/net/ipv4/tcp_available_congestion_control \ /proc/sys/net/ipv4/tcp_allowed_congestion_control Signed-off-by: Stephen Hemminger <shemminger@osdl.org> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[TCP]: Add tcp_available_congestion_control sysctl.	Stephen Hemminger	2006-12-02	5	-0/+51
\| \| \| \| \| \| \| \|	Create /proc/sys/net/ipv4/tcp_available_congestion_control that reflects currently available TCP choices. Signed-off-by: Stephen Hemminger <shemminger@osdl.org> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[SCTP]: Fix warning	Vlad Yasevich	2006-12-02	2	-1/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	An alternate solution would be to make the digest a pointer, allocate it in sctp_endpoint_init() and free it in sctp_endpoint_destroy(). I guess I should have originally done it this way... CC [M] net/sctp/sm_make_chunk.o net/sctp/sm_make_chunk.c: In function 'sctp_unpack_cookie': net/sctp/sm_make_chunk.c:1358: warning: initialization discards qualifiers from pointer target type The reason is that sctp_unpack_cookie() takes a const struct sctp_endpoint and modifies the digest in it (digest being embedded in the struct, not a pointer). Make digest a pointer to fix this warning. Signed-off-by: Vlad Yasevich <vladislav.yasevich@hp.com> Acked-by: Sridhar Samudrala <sri@us.ibm.com> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[IPV6] tcp: Fix typo _read_mostly --> __read_mostly.	David S. Miller	2006-12-02	1	-1/+1
\| \| \| \|	Signed-off-by: David S. Miller <davem@davemloft.net>
*	[DCCP]: Fix typo _read_mostly --> __read_mostly.	David S. Miller	2006-12-02	1	-1/+1
\| \| \| \|	Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NET]: Size listen hash tables using backlog hint	Eric Dumazet	2006-12-02	9	-25/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We currently allocate a fixed size (TCP_SYNQ_HSIZE=512) slots hash table for each LISTEN socket, regardless of various parameters (listen backlog for example) On x86_64, this means order-1 allocations (might fail), even for 'small' sockets, expecting few connections. On the contrary, a huge server wanting a backlog of 50000 is slowed down a bit because of this fixed limit. This patch makes the sizing of listen hash table a dynamic parameter, depending of : - net.core.somaxconn tunable (default is 128) - net.ipv4.tcp_max_syn_backlog tunable (default : 256, 1024 or 128) - backlog value given by user application (2nd parameter of listen()) For large allocations (bigger than PAGE_SIZE), we use vmalloc() instead of kmalloc(). We still limit memory allocation with the two existing tunables (somaxconn & tcp_max_syn_backlog). So for standard setups, this patch actually reduce RAM usage. Signed-off-by: Eric Dumazet <dada1@cosmosbay.com> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[PKT_SCHED]: Make sch_fifo.o available when CONFIG_NET_SCHED is not set.	David Kimdon	2006-12-02	2	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Based on patch by Patrick McHardy. Add a new option, NET_SCH_FIFO, which provides a simple fifo qdisc without requiring CONFIG_NET_SCHED. The d80211 stack needs a generic fifo qdisc for WME. At present it uses net/d80211/fifo_qdisc.c which is functionally equivalent to sch_fifo.c. This patch will allow the d80211 stack to remove net/d80211/fifo_qdisc.c and use sch_fifo.c instead. Signed-off-by: David Kimdon <david.kimdon@devicescape.com> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NET] rules: Add support to invert selectors	Thomas Graf	2006-12-02	2	-7/+18
\| \| \| \| \| \| \| \|	Introduces a new flag FIB_RULE_INVERT causing rules to apply if the specified selector doesn't match. Signed-off-by: Thomas Graf <tgraf@suug.ch> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NET] rules: Share common attribute validation policy	Thomas Graf	2006-12-02	4	-15/+10
\| \| \| \| \| \| \| \|	Move the attribute policy for the non-specific attributes into net/fib_rules.h and include it in the respective protocols. Signed-off-by: Thomas Graf <tgraf@suug.ch> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NET] rules: Protocol independant mark selector	Thomas Graf	2006-12-02	6	-89/+32
\| \| \| \| \| \| \| \|	Move mark selector currently implemented per protocol into the protocol independant part. Signed-off-by: Thomas Graf <tgraf@suug.ch> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[IPV4] nl_fib_lookup: Rename fl_fwmark to fl_mark	Thomas Graf	2006-12-02	2	-2/+2
\| \| \| \| \| \| \|	For the sake of consistency. Signed-off-by: Thomas Graf <tgraf@suug.ch> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NET]: Rethink mark field in struct flowi	Thomas Graf	2006-12-02	14	-121/+26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Now that all protocols have been made aware of the mark field it can be moved out of the union thus simplyfing its usage. The config options in the IPv4/IPv6/DECnet subsystems to enable respectively disable mark based routing only obfuscate the code with ifdefs, the cost for the additional comparison in the flow key is insignificant, and most distributions have all these options enabled by default anyway. Therefore it makes sense to remove the config options and enable mark based routing by default. Signed-off-by: Thomas Graf <tgraf@suug.ch> Signed-off-by: David S. Miller <davem@davemloft.net>