Difference between revisions of "Portal:DeveloperDocs/set internals"

From nftables wiki
Jump to navigation Jump to search
(→‎Available nft_set_types: Added notes about algorithm for choosing nft_set_type.)
(→‎Hash implementations: Added jhash, rhashtable links.)
 
(10 intermediate revisions by the same user not shown)
Line 1: Line 1:
The nftables generalized set infrastructure includes multiple set implementations. ''nft_select_set_ops()'' in [https://git.kernel.org/pub/scm/linux/kernel/git/pablo/nf-next.git/tree/net/netfilter/nf_tables_api.c nf_tables_api.c] chooses the implementation depending on required set features and operations, and on estimated lookup time and memory requirements in combination with user-specified set policy (''performance'' or ''memory'').
'''Disclaimer! This page is incomplete and likely to contain mistakes. Please make corrections directly, and/or send them to netfilter list. Thanks!'''
 
The nftables generalized set infrastructure includes multiple set implementations. The implementation chosen for a given set depends on required set features and operations, and on estimated element lookup time and set memory requirements.


== Available nft_set_types ==
== Available nft_set_types ==
Line 8: Line 10:
! rowspan="2" | ''nft_set_types[]'' order
! rowspan="2" | ''nft_set_types[]'' order
! colspan="2" | ''nft_set_estimate NFT_SET_CLASS_[order]''
! colspan="2" | ''nft_set_estimate NFT_SET_CLASS_[order]''
! rowspan="2" | # Concatenated fields
! rowspan="2" | # ''klen'' restrictions
! rowspan="2" | Must specify size
! rowspan="2" | ''NFT_SET_INTERVAL''
! rowspan="2" | ''NFT_SET_INTERVAL''
! rowspan="2" | ''NFT_SET_MAP''
! rowspan="2" | ''NFT_SET_MAP''
Line 13: Line 18:
! rowspan="2" | ''NFT_SET_OBJECT''
! rowspan="2" | ''NFT_SET_OBJECT''
! rowspan="2" | ''NFT_SET_EVAL''
! rowspan="2" | ''NFT_SET_EVAL''
! rowspan="2" | Expression support
! rowspan="2" | Notes
! rowspan="2" | Notes


Line 24: Line 30:
| ''O_1''
| ''O_1''
| ''O_N''
| ''O_N''
|
| {{partial|!= 4}}
| {{no|Yes}}
| {{no}}
| {{no}}
| {{yes}}
| {{yes}}
Line 29: Line 38:
| {{yes}}
| {{yes}}
| {{no}}
| {{no}}
| {{yes}}
|  
|  


Line 36: Line 46:
| ''O_1''
| ''O_1''
| ''O_N''
| ''O_N''
|
| {{partial|!= 4}}
| {{no|Yes}}
| {{no}}
| {{no}}
| {{yes}}
| {{yes}}
Line 41: Line 54:
| {{yes}}
| {{yes}}
| {{no}}
| {{no}}
| {{yes}}
|  
|  


Line 48: Line 62:
| ''O_1''
| ''O_1''
| ''O_N''
| ''O_N''
|
| {{yes|<&#61;&nbsp;255}}
| {{partial|If eval path updates}}
| {{no}}
| {{no}}
| {{yes}}
| {{yes}}
| {{yes}}
| {{yes}}
| {{yes}}
Line 60: Line 78:
| ''O_1''
| ''O_1''
| ''O_1''
| ''O_1''
|
| {{no|<&#61;&nbsp;2}}
| {{yes|No}}
| {{no}}
| {{no}}
| {{no}}
| {{no}}
| {{no}}
Line 72: Line 94:
| ''O_LOG_N''
| ''O_LOG_N''
| ''O_N''
| ''O_N''
| {{partial|<&#61;&nbsp;1}}
| {{yes|<&#61;&nbsp;255}}
| {{yes|No}}
| {{yes}}
| {{yes}}
| {{yes}}
| {{yes}}
Line 77: Line 102:
| {{yes}}
| {{yes}}
| {{no}}
| {{no}}
|  
| {{yes}}
|


|-
|-
Line 84: Line 110:
| ''O_LOG_N''
| ''O_LOG_N''
| ''O_N''
| ''O_N''
| {{yes|>&#61;&nbsp;2}}
| {{yes|<&#61;&nbsp;255}}
| {{yes|No}}
| {{partial|Mandatory}}
| {{yes}}
| {{yes}}
| {{yes}}
| {{yes}}
| {{yes}}
| {{yes}}
| {{no}}
| {{yes}}
| {{yes}}
| {{no}}
|  
|  


Line 96: Line 126:
| ''O_LOG_N''
| ''O_LOG_N''
| ''O_N''
| ''O_N''
| {{yes|>&#61;&nbsp;2}}
| {{yes|<&#61;&nbsp;255}}
| {{yes|No}}
| {{partial|Mandatory}}
| {{yes}}
| {{yes}}
| {{yes}}
| {{yes}}
| {{yes}}
| {{yes}}
| {{no}}
| {{yes}}
| {{yes}}
| {{no}}
|  
|  


|}
|}


* ''klen'' is key length in bytes.
* ''nft_set_estimate'' ''.lookup'' and ''.space'' are in terms of enum ''nft_set_class'', defined in [https://git.kernel.org/pub/scm/linux/kernel/git/pablo/nf-next.git/tree/include/net/netfilter/nf_tables.h nf_tables.h]:
* ''nft_set_estimate'' ''.lookup'' and ''.space'' are in terms of enum ''nft_set_class'', defined in [https://git.kernel.org/pub/scm/linux/kernel/git/pablo/nf-next.git/tree/include/net/netfilter/nf_tables.h nf_tables.h]:
<source>
<source>
Line 117: Line 152:


== Hash implementations ==
== Hash implementations ==
[https://git.kernel.org/pub/scm/linux/kernel/git/pablo/nf-next.git/tree/net/netfilter/nft_set_hash.c nft_set_hash.c]
* [https://git.kernel.org/pub/scm/linux/kernel/git/pablo/nf-next.git/tree/net/netfilter/nft_set_hash.c nft_set_hash.c]
* [https://git.kernel.org/pub/scm/linux/kernel/git/pablo/nf-next.git/tree/include/linux/jhash.h jhash.h]
* [https://git.kernel.org/pub/scm/linux/kernel/git/pablo/nf-next.git/tree/include/linux/rhashtable-types.h rhashtable-types.h]
* [https://git.kernel.org/pub/scm/linux/kernel/git/pablo/nf-next.git/tree/include/linux/rhashtable.h rhashtable.h]


== Bitmap implementation ==
== Bitmap implementation ==

Latest revision as of 04:30, 8 March 2021

Disclaimer! This page is incomplete and likely to contain mistakes. Please make corrections directly, and/or send them to netfilter list. Thanks!

The nftables generalized set infrastructure includes multiple set implementations. The implementation chosen for a given set depends on required set features and operations, and on estimated element lookup time and set memory requirements.

Available nft_set_types

nft_set_type nft_set_types[] order nft_set_estimate NFT_SET_CLASS_[order] # Concatenated fields # klen restrictions Must specify size NFT_SET_INTERVAL NFT_SET_MAP NFT_SET_TIMEOUT NFT_SET_OBJECT NFT_SET_EVAL Expression support Notes
.lookup .space
nft_set_hash_fast_type 0 O_1 O_N != 4 Yes No Yes No Yes No Yes
nft_set_hash_type 1 O_1 O_N != 4 Yes No Yes No Yes No Yes
nft_set_rhash_type 2 O_1 O_N <= 255 If eval path updates No Yes Yes Yes Yes Yes
nft_set_bitmap_type 3 O_1 O_1 <= 2 No No No No No No No
nft_set_rbtree_type 4 O_LOG_N O_N <= 1 <= 255 No Yes Yes Yes Yes No Yes
nft_set_pipapo_avx2_type 5 O_LOG_N O_N >= 2 <= 255 No Mandatory Yes Yes Yes No Yes
nft_set_pipapo_type 6 O_LOG_N O_N >= 2 <= 255 No Mandatory Yes Yes Yes No Yes
  • klen is key length in bytes.
  • nft_set_estimate .lookup and .space are in terms of enum nft_set_class, defined in nf_tables.h:
enum nft_set_class {
	NFT_SET_CLASS_O_1,
	NFT_SET_CLASS_O_LOG_N,
	NFT_SET_CLASS_O_N,
};
  • nft_select_set_ops() in nf_tables_api.c: chooses which nft_set_type to use. For sets with default performance policy it chooses lower .lookup; for sets with memory policy it chooses lower .space.
  • When choosing between two nft_set_types with the same .lookup and .space, nft_select_set_ops() chooses the type that appears first in nft_set_types[].

Hash implementations

Bitmap implementation

nft_set_bitmap.c - contains good documentation

Red-black tree implementation

nft_set_rbtree.c

PIPAPO implementations

PIPAPO is loosely inspired by the Grouper network packet classification algorithm.