first draft of rate limit API #2028

kyessenov · 2021-06-17T22:11:28Z

Signed-off-by: Kuat Yessenov [email protected]

Signed-off-by: Kuat Yessenov <[email protected]>

howardjohn · 2021-06-17T22:17:29Z

gen.sh

@@ -24,7 +24,8 @@ buf generate \
  --path analysis \
  --path authentication \
  --path meta \
-  --path telemetry
+  --path telemetry \
+  --path policy


Maybe a bit of a nitpick but do we want a new group? "policy" feels very abstract - everything is policy. Maybe we should just it in the networking group?

+1 - we already had security use broad 'policy', let's keep it in networking ( it's not telemetry ).

Everything is networking in istio. I don't like policy either, but please not networking. Open to suggestions.

I'm not aware of any good reason to not be in networking. It's not about like or not like here - it's not security nor telemetry, and I don't think we're planning to create a 'rate limiting' WG or API group.

I stated one good reason - this is not a universal rate limiting API. This is addressing a specific need for Istio users who do not want to use EnvoyFilters. It is not a core networking API.

if we're rate limiting the use of features on the network I'd be inclined to keep it in networking

mesh/v1alpha1/config.proto

howardjohn · 2021-06-17T22:19:37Z

mesh/v1alpha1/config.proto

+      // The behaviour in case the rate limiting service does not respond back.
+      // When it is set to true, the proxy will not allow traffic in case of
+      // communication failure between rate limiting service and the proxy.
+      bool failure_mode_deny = 4;


I think this may be more natural as an Enum. This would match k8s webhooks, as prior art. Maybe we can change this to failurePolicy: {Fail,Ignore} to exactly match their api.

failureModeDeny: false feels awkward

maybe fail_open?

Matching webhooks is good.

Avoid booleans - not very nice when merging, nor extensible.

Enums should have default UNKNOWN forcing the users to think about which way to go. We want false to be used in most cases (fail open) so boolean matches that better. Using enum is inconsistent with fail_open field in authorization provider, which has the inverted semantics.

We have used fail_open in extauthz and I don’t think there is a 3rd option normally.
When there are only two options possible, bool is recommended per https://cloud.google.com/apis/design/design_patterns#bool_vs_enum_vs_string

I don't think 'forcing users' is a good strategy. We should have reasonable defaults, it's part of the job of defining an API. The problem is we are not designing a gRPC or imperative API - but a CR that will be used with yaml and may need merging semantics. The API design patterns are great for gRPC/imperative - but not a perfect match for yaml.

But as I mentioned in few other comments - and I'll keep asking for - let's look at patterns used by other APIs doing rate limiting, and try to keep it similar.

I agree failureModeDeny: false seems odd. how about denyOnFailed: false or denyWhenFailed: false? would be good to add false is the default (assuming it is).

howardjohn · 2021-06-17T22:24:39Z

policy/v1alpha1/ratelimit.proto

+  // Optional. Specifies a set of descriptors that are evaluated against the
+  // rate limit policy.  The rate limit applies if any of the generated
+  // descriptors trigger a limit.
+  repeated RateLimitDescriptor descriptors = 3;


The word "descriptor" feels very odd to me. I know its what Envoy uses, but we are not tied to envoy at all. Is there a more fitting description if we forget about envoy-isms? I noticed nginx and haproxy don't have any "descriptor" concept

Is there a RFC that lists the APIs used by nginx/haproxy/etc - we should look for commonality.

We'd be aiming to integrate Envoy rate limit service implementations (derivations of https://github.com/envoyproxy/ratelimit) which will use this terminology. It would be confusing to call it something else when performing the integration.

Isn't that just one of the implementation that will be supported? Maybe the only for now but I thought I had seen mentions of other backend?

If it is just envoy then I agree

This version of global is just envoy. We can add other policies later. Maybe it's better to be more specific but nothing else uses "envoy" as qualification in providers.

Haproxy has a rate limiting dsl, nginx is simpler but all of them have a notion of domain, which is a set of related counters.
Re: envoy specific, it is an api that anyone can implement.
However we don’t want too much client side specifics if we can avoid it.

I would be great to have a list with services ( including managed ) and APIs - and what notions they have. If notion of domain is common - I'm all for it.

The model in K8S Gateway is to identify 'core' features, that can be adopted by multiple vendors ( both server and client ) - and keep the rest optional. I would like to focus first on finding the 'core' - the simplest viable API
that can have more than one implementation. We can follow up with extensions, but the API as specified in this PR is far too large, and claiming that anyone can implement Envoy XDS and API is not realistic.

This API is not trying to define a universal cross-vendor rate limiting API. I don't think it's realistic to expect that without involvement of other vendors (haproxy/nginx). This is the reason why I don't want to combine this with "networking" group because it's not really a core API. It is more similar to Wasm API where implementations may choose to implement or not without losing core functions of a mesh. Right now, there is a clear demand for a first class large API for rate limiting, not an extension, because doing the same with EnvoyFilter is extremely error prone, given the complexity of the concepts. Trying to make this into some core API will either not address user's need or leave too many details opaque, making it no better than EnvoyFilter.

But it doesn't appear this API is better than EnvoyFilter. At least with EnvoyFilter you have the Envoy API and you can directly apply and map to envoy. We can make it easier to apply the EnvoyFilter - but having a mangled EnvoyFilter provided as an Istio API doesn't serve either people who use Istio nor Envoy APIs. Same complexity and even more 'error prone' - since you also need to guess how this API maps to the EnvoyFilter.

If it was implemented as WASM - I suspect it wouldn't be a spaghetti module with all concepts mixed in, but probably one
WASM module focused on local rate limits with a smaller API, etc.

By vendor I also mean Redis, Google, AWS and others who may have quota servers. And we don't really need 'involvement' from haproxy/nginx - just to understand their capabilities and API, and be able to answer the question 'why does our core API needs to be more complicated'.

@costinm There is a difference between this API and runtime API that proxies use. For runtime APIs, my understanding is that Google would like to use RLS. gRPC as well. I can't speak for AWS or Redis, and we can only speculate about whether RLS is a good fit without them at the table.

Now if we agree RLS is an appropriate runtime API, we need a way to enable its usage in Istio. Because RLS uses "descriptors" as the core concept, it's only appropriate to use the same nomenclature in Istio API. The choice we have is the binding and the methods of computing descriptors. I'd like to focus on those two concepts instead of questioning why we need descriptors.

howardjohn · 2021-06-17T22:26:29Z

policy/v1alpha1/ratelimit.proto

+    // Optional. The workload selector decides where to apply the server-side rate limit in the same namespace.
+    istio.type.v1beta1.WorkloadSelector workload = 1;
+    // Optional. The route selector decides which client route rules enforce the client-side rate limit.
+    RouteSelector route = 2;


I think this may be required to be called targetRef to align with https://docs.google.com/document/d/13fyptUtO9NV_ZAgkoJlfukcBf2PVGhsKWG37yLkppJo/edit?resourcekey=0-Urhtj9gBkGBkSL1gHgbWKw

Also it would be good to make sure that doc is approved before we approve this otherwise we will have some problems..

We had a lot of confusion with server side vs client side - I think it would be best to not make it explicit.

The rate limit defines the user intent - we may implement it in different ways ( including in middle boxes ), and
we may need to implement it for non-envoy cases.

Isn't client vs server side not an implementation detail but an extremely important aspect? both are valid for different use cases but it's not a transparent decision?

IIUC, this cannot ratelimit the below route as it has two route destination .

// apiVersion: networking.istio.io/v1beta1 // kind: VirtualService // metadata: // name: reviews-route-two-domains // spec: // hosts: // - reviews.com // http: // - route: // - destination: // host: dev.reviews.com // weight: 25 // - destination: // host: reviews.com // weight: 75

I think we can't do that given how we translate to Envoy xDS. You would need a separate virtual service for dev.reviews.com.

Regarding server/client side: my point was that we normally decide where to apply by selector ( i.e. if the CR is applied to a workload, it is typically a server-side API, etc).

For 'client side' APIs - we do have a lot of issues, including 'who decides' ( producer or consumer ), etc.
I would focus first on defining a clean and simple API that solves one specific use case - for example global rate limit applied on a gateway or server ( a very typical use case and IMO most useful ).

Like Telemetry, we can have a number of different, smaller APIs that are easier to understand and implement with multiple vendors - and for advanced cases either use EnvoyFilter or have extension APIs that are really
Envoy APIs in disguise. But we do need the 'common APIs' clearly separated.

I am fine leaving client-side specifically for gateways. The Istio APIs do not differentiate between virtual services for gateways vs virtual services for sidecars, so it's not possible to define it as such.

K8S Gateway API will apply to Sidecars as well, and the goal is not to differentiate, but to simplify the API and focus on most important and common use cases - and make it as simple as possible.

Enforcing rate limit on each client side is not really possible - and we already have DestinationRule 'maxConnections, etc.
We can extend DestinationRule with more un-enforceable rate limiting.

Having an ingress gateway - or the backend's sidecar - protect the application with an enforced rate limit is far more common and important.

@mandarjog - for rate limit applied on a backend, the outcome is not only 'fail'/'allow' - but can also be 'spill over to other backends' ( we don't support this yet, but it can be quite valuable and can be implemented ).

Rate limits on an ingress using a quota/RL service are likely the most common case ( for RL applied to the backend -the main purpose is to protect the app behind the sidecar, that can't take more than x qps - so RL service doesn't make a lot of sense for common use case).

@kyessenov does this API allow users to configure rate limit from a namespace (as the clients)? seems only at the client level which could be too detailed as users may have v1/v2/canary and other services in that namespace which are clients.

policy/v1alpha1/ratelimit.proto

howardjohn · 2021-06-17T22:31:54Z

policy/v1alpha1/ratelimit.proto

+// +genclient
+// +k8s:deepcopy-gen=true
+// -->
+message RateLimit {


This could use some more yaml examples - both e2e examples at the top level and for each section.

Ack. I'd like to wait for initial bike shedding since it is a lot of work to rewrite samples.

Could we add some initial examples? It is easier to envision how the API works for our users with examples.

howardjohn · 2021-06-17T22:33:22Z

policy/v1alpha1/ratelimit.proto

+  // processed by this filter. Each request processed by the filter consumes a
+  // single token. If the token is available, the request will be allowed. If
+  // no tokens are available, the proxy responds with 429.
+  TokenBucket token_bucket = 1;


This seems a bit complex and ties to a single implementation. Does this get us any benefits that could not be expressed by something simpler like `limit: 10 QPS" or something? Looking around at other proxies they seem to offer a similar simple API

Pure qps can not express the token bucket algorithm.

Token buckets have subtle behavioral changes. I think this is mostly to be explicit rather than let the user second-guess the underlying behavior.

Sound like a perfect candidate to not include in the core API, not until we have clear evidence it can have more than one implementation and clear semantics.

Token buckets are much easier to understand and re-implement than arbitrary QPS semantics. I do not understand the point about clear evidence.

k8s or golang do not deal with the same performance envelope as envoy. Envoy can go up to 1 million qps where even microseconds of CPU time matter. We don't have enough evidence to know what good fill interval is for Istio so instead of trying to hard code it now, it's better to leave it open.

If we can't figure it out how can a user be expected to?

+1 - and even there was a case for 'expert' users to fine tune - that doesn't need to be in the same API and confuse the 99.9% of users that just understand "rps: 10". Advanced users that understand Envoy enough to configure this stuff can use EnvoyFilter, and if needed we can provide a second 'advanced' API or a 'blessed' filter.

Can we start again, with the common case first - 'request per second: 10' attached just to a backend and hostname ( per-route can be added later as well ). There is a huge difference between a real API intended for users, and calling a particular implementation with all the internals 'the API'.

Token buckets are a standard concept in rate limiting. See https://cloud.google.com/architecture/rate-limiting-strategies-techniques#techniques-enforcing-rate-limits, for example. Would wrapping this into "algorithm" proto with a sole field address the concern?

QPS as a single rate limit spec is going to be confusing. None of the technique achieve perfect QPS, and we can go over the limit with bursty traffic. Do I need to remind you about rate limit mixer test with sliding window not hitting rate limit time-to-time?

howardjohn · 2021-06-17T22:33:46Z

policy/v1alpha1/ratelimit.proto

+  // no tokens are available, the proxy responds with 429.
+  TokenBucket token_bucket = 1;
+
+  // Per-descriptor local rate limit override.


I don't understand what this means. Can you clarify a bit in the comments how this is supposed to be used?

Unfortunately the rls documentation has to be read fully to understand this in absence of detailed examples .

Another 'smell' that it doesn't belong in the core API. Can we have a simple API, focused on 'local rate limits' - and another simple one focused only on 'remote' ?

The descriptors are shared between the two. There is soon a quota based rate limiting coming which will mix the two so it's not good to separate them.

mesh/v1alpha1/config.proto

costinm

Is the rate applied on server side, or client side ?
MeshConfig is intended for istiod configurations that are global. Pretty much everything in it needs to move - and would need a very strong reason why it must be set for all the workloads in the mesh.

I would suggest adding it to ProxyConfig - so it can be used for specific workloads, with different settings. It can also be added to DestinationRule, if it's client-side.

Or it can be a separate proto, that we can start in ProxyConfig and move to a standalone CRD.

But please not in MeshConfig

hzxuzhonghu · 2021-06-18T01:20:44Z

policy/v1alpha1/ratelimit.proto

+// +k8s:deepcopy-gen=true
+// -->
+message RateLimit {
+  oneof binding {


How could i limit one specific workload to access the route? I think they need to combine

Are you asking for a client-side rate limit restricted to a particular workload? That can be added.

But please not in MeshConfig

Rate limit feels like a similar category as telemetry,tracing,and ext authz backend. So why not follow the same pattern?

If we don't like that we should change everything IMO. It may be better to he consistently "wrong" than inconsistent?

And we are changing - I hope. Telemetry, tracing are getting promoted to CRDs, hopefully Authz will graduate too. The hope is that after all features graduate, the MeshConfig will be almost empty shell.

Also for specifying a particular backend - we have pretty good patterns ( BackendRef, hostname, etc).
Configurations related to a service/feature should be in the actual CRD - not in MeshConfig.

Isn't Telemetry API still referring to mesh config providers? Almkst all of the providers in mesh config are tracing providers added for the Telemetry api

hzxuzhonghu · 2021-06-18T01:22:07Z

policy/v1alpha1/ratelimit.proto

+// kind: VirtualService
+// route: blah
+// ```
+message RouteSelector {


TBH, it looks very hard to understand

It is less direct. However, it allows us to deprecate networking API and keep this API around. The decoupling is important for future compatibility.

hzxuzhonghu · 2021-06-18T01:24:53Z

policy/v1alpha1/ratelimit.proto

+  // Resource kind.
+  string kind = 3;
+  // Optional. Individual route rule name within the resource.
+  string route = 4;


what's this?

For vs, it is the HTTPRoute.Name? This field can be optional now

VirtualService has many routes inside. This is referring to a route within a virtual service. It could be /special_path, for example.

Is the common use case for the user to want to have a rate limit for a URL, or is it really to have a rate limit for requests going to a particular backend ? With the route just a convoluted way to identify the cluster ?

And is this common across other rate limit services and APIs ? Once again, a doc with examples for all existing common rate limit services and APIs would make this much easier to review.

+1 and the doc to document and agree on common user cases. Is tying Ratelimit to route the right method to configure rate limit for client side?

policy/v1alpha1/ratelimit.proto

howardjohn · 2021-06-18T01:27:19Z

But please not in MeshConfig

Rate limit feels like a similar category as telemetry,tracing,and ext authz backend. So why not follow the same pattern?

If we don't like that we should change everything IMO. It may be better to he consistently "wrong" than inconsistent?

hzxuzhonghu · 2021-06-18T01:30:34Z

mesh/v1alpha1/config.proto

+      uint32 port = 2;
+
+      // The rate limit domain to use when calling the service.
+      string domain = 3;


Does it mean the whole mesh rl is under one domain?

It's per provider. You can have multiple rate limit providers with different domains.

Is there a typical use for the field. E.g. prod vs test ?

The Envoy docs don't suggest much which calls into question the value of asking the user to configure it vs having a defgault.

policy/v1alpha1/ratelimit.proto

kyessenov · 2021-06-18T02:53:59Z

@costinm

All variations are possible:

local rate limit on outbound route at ingress;
global rate limit on outbound route at ingress;
local rate limit on app sidecar
global rate limit on app sidecar

Usage of provider is for consistency with the rest of APIs (e.g. telemetry API). This was strongly requested AFAIR, and I don't want to have a special way just for rate limit. Let's not use this PR to resurrect that discussion and keep it separate.

Signed-off-by: Kuat Yessenov <[email protected]>

bianpengyuan · 2021-06-18T04:00:09Z

policy/v1alpha1/ratelimit.proto

+    // Constant entry key and value.
+    ConstantEntry constant = 3;
+    // CEL expression entry.
+    Expression expression = 4;


identity from jwt token and spiffee are going to be very commonly used, maybe we should make them first class instead of using expression?

We could. I think some reviewers want this API to be minimal though.

I take minimum as including identity in the descriptor but not expression.

We can be prescriptive in the examples as opposed to enumerating cases in the API. I want to hear what others think since I am fine with adding identity as an option.

One reason to include identity is that dynamic metadata key for jwt token identity is implementation detail and it would be better to not let user configure expression with that. So does mTLS, relies on envoy CEL implementation.

I think it's easy to add domain-specific actions here as long as we have agreement to broaden the API surface.

bianpengyuan · 2021-06-18T04:26:20Z

policy/v1alpha1/ratelimit.proto

+  }
+  oneof entry_type {
+    // Entry obtained from a header value.
+    RequestHeader header = 1;


Does RequestHeader apply to local rate limit?

It does. The local rate limit has descriptor version, too.

Hmm I am a bit confused. I thought local rate limit applies with condition match semantic (i.e. if header x has value y, then apply this rate limit). IIUC RequestHeader is used to get value of a header instead of matching it with some value, which I am not sure how does it work with local. I think I will need some example to understand it better.

You would have something like this in local rate limit:

token_bucket: { general rate limit, can be set very high } descriptors: token_bucket: { rate limit for matching header x to y} entries: - key: x value: y

Then add a rate limit action for header x separately.

bianpengyuan · 2021-06-18T04:27:12Z

policy/v1alpha1/ratelimit.proto

+    // Entry obtained by testing a header value against exact value, prefix, etc.
+    RequestHeaderMatch header_match = 2;
+    // Constant entry key and value.
+    ConstantEntry constant = 3;


Same here, I guess constant does not apply to local rate limit?

No, I think descriptors are the same in both. Local rate limit gets a descriptor, matches against any one in the policy, then uses the corresponding token bucket instead of the global token bucket.

mandarjog · 2021-06-18T03:32:49Z

mesh/v1alpha1/config.proto

+    // Defines configuration for a global rate limiting service provider.
+    // The service must implement Rate Limit Service (RLS):
+    // https://www.envoyproxy.io/docs/envoy/latest/api-v3/service/ratelimit/v3/rls.proto.
+    message RateLimitProvider {


Do we have a usecase for multiple rls? Or can everything be expressed as one?

I am not aware of a use case of multiple global RLS.

canary services using a different RLS ( for example a canary version of the RLS )

regional RLS

different implementations or instances of RLS ( for example different redis servers, dedicated to one or a set of apps)

The other question is - do we have a use case for a namespace-configured RLS instead of mesh wide, mesh-admin configured one. And I think the answer is yes.

I think that is covered by multiple named provider instances no?

mandarjog · 2021-06-18T03:33:41Z

mesh/v1alpha1/config.proto

+      // to unambiguously resolve a service in the service registry. The <Hostname> is a fully qualified host name of a
+      // service defined by the Kubernetes service or ServiceEntry.
+      //
+      // Example: "rls.default.svc.cluster.local" or "bar/rls.example.com".


The examples specifically make it sounds like they have to be in cluster, they don’t have to be.

These are copied verbatim from all other providers. I don't have the context why we use this naming scheme in providers.

The service implementations don't have to be in-cluster but we do prefer the Service/ServiceEntry to be a declared KRM resource available to the control plane. Not strictly required though...

mandarjog · 2021-06-18T03:45:44Z

mesh/v1alpha1/config.proto

+      // The behaviour in case the rate limiting service does not respond back.
+      // When it is set to true, the proxy will not allow traffic in case of
+      // communication failure between rate limiting service and the proxy.
+      bool failure_mode_deny = 4;


We have used fail_open in extauthz and I don’t think there is a 3rd option normally.
When there are only two options possible, bool is recommended per https://cloud.google.com/apis/design/design_patterns#bool_vs_enum_vs_string

mandarjog · 2021-06-18T04:16:45Z

policy/v1alpha1/ratelimit.proto

+    // Optional. The workload selector decides where to apply the server-side rate limit in the same namespace.
+    istio.type.v1beta1.WorkloadSelector workload = 1;
+    // Optional. The route selector decides which client route rules enforce the client-side rate limit.
+    RouteSelector route = 2;


TargetRef is a good idea. @howardjohn is that document nearing approval?
Besides ref is a better description than selector.

re the example that Zhonghu gave, how would limit per destination?

mandarjog · 2021-06-18T04:30:58Z

policy/v1alpha1/ratelimit.proto

+  // Optional. Specifies a set of descriptors that are evaluated against the
+  // rate limit policy.  The rate limit applies if any of the generated
+  // descriptors trigger a limit.
+  repeated RateLimitDescriptor descriptors = 3;


Haproxy has a rate limiting dsl, nginx is simpler but all of them have a notion of domain, which is a set of related counters.
Re: envoy specific, it is an api that anyone can implement.
However we don’t want too much client side specifics if we can avoid it.

mandarjog · 2021-06-18T04:32:44Z

policy/v1alpha1/ratelimit.proto

+
+// Route selector references a client networking resource to overlay the rate
+// limit filter application. This can be a service, a virtual service, or a
+// gateway resource. Route name designation is optional.


named route: clairify if it is the name of the xds resource or the configuration resource?

Do we get to control the route name in the portable way?

Route is the k8s resource inner name. Routes are not separate k8s resource, the hosts are, so we have to add a separate qualifier to select a route within. It's portable since it's not using Envoy route name.

mandarjog · 2021-06-18T04:36:47Z

policy/v1alpha1/ratelimit.proto

+  // no tokens are available, the proxy responds with 429.
+  TokenBucket token_bucket = 1;
+
+  // Per-descriptor local rate limit override.


Unfortunately the rls documentation has to be read fully to understand this in absence of detailed examples .

costinm · 2021-06-23T05:40:30Z

Token buckets are standard for implementing rate limiting. The question I suspect is if they should be exposed as part of the API, and if users should be expected ( or allowed ) to mess with this.

…

On Tue, Jun 22, 2021 at 10:15 PM Kuat ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In policy/v1alpha1/ratelimit.proto <#2028 (comment)>: > + + // The number of tokens added to the bucket during each fill interval. If not specified, defaults + // to a single token. + google.protobuf.UInt32Value tokens_per_fill = 2; + + // The fill interval that tokens are added to the bucket. During each fill interval + // `tokens_per_fill` are added to the bucket. The bucket will never contain more than + // `max_tokens` tokens. The interval must be >=50ms. + google.protobuf.Duration fill_interval = 3; + } + + // The token bucket configuration to use for rate limiting requests that are + // processed by this filter. Each request processed by the filter consumes a + // single token. If the token is available, the request will be allowed. If + // no tokens are available, the proxy responds with 429. + TokenBucket token_bucket = 1; Token buckets are a standard concept in rate limiting. See https://cloud.google.com/architecture/rate-limiting-strategies-techniques#techniques-enforcing-rate-limits, for example. Would wrapping this into "algorithm" proto with a sole field address the concern? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#2028 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAUR2XZPMBPNCY6BKPFVRTTUFUY3ANCNFSM464QUHZQ> .

mandarjog · 2021-06-23T07:14:19Z

Token buckets are standard for implementing rate limiting. The question I suspect is if they should be exposed as part of the API, and if users should be expected ( or allowed ) to mess with this.
…
The alternative is that we have no istio api, but controllers for backends to program envoy. What this means is that if someone uses Google cloud armor to configure their rate limits we will use that config to correctly configure the proxy. There will be a similar rls controller that will read back end rls config and program envoy. These cotrollers may require istiod extensions or output an EnvoyFilter.

Managed grpc may or may not implement it, but are Likely to implement rls anyway.

The uses cases that customers need, rate limiting based on service account or based on presence and values of multiple headers cannot be expressed in a simpler api.
But using the alternative above we can solve that problem for specific backend without adding an api.

a least common denominator istio api can be orthogonally added .

kyessenov · 2021-06-23T13:27:49Z

Right, I haven't heard a single use case that only needs service-service rate limit. The least common denominator API does not answer anyone's needs. You are right, this is just a principle EnvoyFilter for rate limiting. But that can be said about authz and others, they are just crafted subsets of xDS with better operational user experience. EnvoyFilter are incredibly error prone.

…

On Wed, Jun 23, 2021 at 12:14 AM mandarjog ***@***.***> wrote: Token buckets are standard for implementing rate limiting. The question I suspect is if they should be exposed as part of the API, and if users should be expected ( or allowed ) to mess with this. … <#m_5533683122270881071_> The alternative is that we have no istio api, but controllers for backends to program envoy. What this means is that if someone uses Google cloud armor to configure their rate limits we will use that config to correctly configure the proxy. There will be a similar rls controller that will read back end rls config and program envoy. These cotrollers may require istiod extensions or output an EnvoyFilter. Managed grpc may or may not implement it, but are Likely to implement rls anyway. The uses cases that customers need, rate limiting based on service account or based on presence and values of multiple headers cannot be expressed in a simpler api. But using the alternative above we can solve that problem for specific backend without adding an api. a least common denominator istio api can be orthogonally added . — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#2028 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACIYRRXH4U34EGTYLW3BVDLTUGCVLANCNFSM464QUHZQ> .

linsun · 2021-06-24T21:12:23Z

policy/v1alpha1/ratelimit.proto

+// If no route name is provided, the rate limit applies to all routes. However,
+// the rate limit for a named route takes priority over the rate limit for the
+// entire resource.


can we give an example on the priority thing?

howardjohn · 2021-06-28T15:05:45Z

policy/v1alpha1/ratelimit.proto

+
+option go_package = "istio.io/api/policy/v1alpha1";
+
+package istio.policy.v1alpha1;


High level concern with rate limiting in general - I think we need to ensure it works well with our existing APIs. For example, retries and locality lb. If we have a single pod at rate limit (constant 429) in our zone, and 1000s of pods outside the zone returning 200, we must fail over. Similarly with retries, we probably don't want to retry the pod returning 429s (if we retry on 429 at all)?

I think it depends. We don't want cascading failures so 429 might need to be propagated downstream to shed load. This would be the primary purpose for this version of rate limiting since that appears what users want. Fail-over is a different feature.

The behavior of rate limiting and locality load balancing can be extremely dangerous though.

Without rate limiting: pod gets overloaded and starts returning 5xx, triggering outlier detection and failover
With rate limiting: pod sheds load with 429, not triggering outlier detection. 100% of requests are now failing, even though there are plenty of pods that could handle the load.

This is exacerbated when there is uneven distribution between pods in different zones (ie 1 pod in priority 0, 1000s in priority 1)

503 and 429 are similar, and envoy treats x-envoy-retry-grpc-on / resource exhausted as retryable.

kyessenov · 2021-06-28T16:55:01Z

It's ok to retry on 429/503 but the client needs to back-off. I think what John suggested is just a regular LB which does not take into account back pressure. gRPC throttles itself on client-side which is why it's considered "safe", although there've been some cascading failures due to rate limited retries.

…

On Mon, Jun 28, 2021 at 9:44 AM mandarjog ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In policy/v1alpha1/ratelimit.proto <#2028 (comment)>: > +// +// Unless required by applicable law or agreed to in writing, software +// distributed under the License is distributed on an "AS IS" BASIS, +// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. +// See the License for the specific language governing permissions and +// limitations under the License. + +syntax = "proto3"; + +import "type/v1beta1/selector.proto"; +import "google/protobuf/wrappers.proto"; +import "google/protobuf/duration.proto"; + +option go_package = "istio.io/api/policy/v1alpha1"; + +package istio.policy.v1alpha1; 503 and 429 are similar, and envoy treats x-envoy-retry-grpc-on / resource exhausted as retryable. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#2028 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACIYRRVMMP5QHX2HNPXE7QDTVCRHNANCNFSM464QUHZQ> .

istio-testing · 2021-07-02T20:34:11Z

@kyessenov: PR needs rebase.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

first draft

e3bb3e7

Signed-off-by: Kuat Yessenov <[email protected]>

kyessenov requested review from costinm, mandarjog and hzxuzhonghu June 17, 2021 22:11

kyessenov requested review from linsun, louiscryan, nrjpoddar and smawson as code owners June 17, 2021 22:11

google-cla bot added the cla: yes Set by the Google CLA bot to indicate the author of a PR has signed the Google CLA. label Jun 17, 2021

istio-testing added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Jun 17, 2021

typo

9eec267

Signed-off-by: Kuat Yessenov <[email protected]>

kyessenov added the release-notes-none Indicates a PR that does not require release notes. label Jun 17, 2021

howardjohn reviewed Jun 17, 2021

View reviewed changes

hzxuzhonghu reviewed Jun 18, 2021

View reviewed changes

mesh/v1alpha1/config.proto Show resolved Hide resolved

costinm reviewed Jun 18, 2021

View reviewed changes

hzxuzhonghu reviewed Jun 18, 2021

View reviewed changes

costinm reviewed Jun 18, 2021

View reviewed changes

policy/v1alpha1/ratelimit.proto Show resolved Hide resolved

costinm reviewed Jun 18, 2021

View reviewed changes

policy/v1alpha1/ratelimit.proto Show resolved Hide resolved

hzxuzhonghu reviewed Jun 18, 2021

View reviewed changes

policy/v1alpha1/ratelimit.proto Show resolved Hide resolved

kyessenov added 3 commits June 17, 2021 20:13

slim down

3a8c3c6

Signed-off-by: Kuat Yessenov <[email protected]>

slim down

03ebefa

Signed-off-by: Kuat Yessenov <[email protected]>

review

e9aa076

Signed-off-by: Kuat Yessenov <[email protected]>

bianpengyuan reviewed Jun 18, 2021

View reviewed changes

mandarjog reviewed Jun 18, 2021

View reviewed changes

linsun reviewed Jun 24, 2021

View reviewed changes

howardjohn reviewed Jun 28, 2021

View reviewed changes

istio-testing added the needs-rebase Indicates a PR needs to be rebased before being merged label Jul 2, 2021

jwendell mentioned this pull request Jul 23, 2021

Add Ratelimit API #1767

Closed


		option go_package = "istio.io/api/policy/v1alpha1";

		package istio.policy.v1alpha1;

first draft of rate limit API #2028

Are you sure you want to change the base?

first draft of rate limit API #2028

Conversation

kyessenov commented Jun 17, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

howardjohn Jun 18, 2021 • edited Loading

Choose a reason for hiding this comment

kyessenov Jun 18, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

linsun Jun 24, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

costinm left a comment

Choose a reason for hiding this comment

hzxuzhonghu Jun 18, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

howardjohn commented Jun 18, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kyessenov commented Jun 18, 2021

bianpengyuan Jun 18, 2021 • edited Loading

howardjohn Jun 18, 2021 •

edited

Loading

kyessenov Jun 18, 2021 •

edited

Loading

linsun Jun 24, 2021 •

edited

Loading

hzxuzhonghu Jun 18, 2021 •

edited

Loading

bianpengyuan Jun 18, 2021 •

edited

Loading