New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[monitoring] Adding influxDB 2.x version support #274 #584

Draft

praptisharma28 wants to merge 7 commits into master from influxdb2.x

Member

praptisharma28 commented May 31, 2024 •

edited

Loading

Fixes #274

Checks:

I have manually tested the proposed changes
I have written new test cases to avoid regressions (if necessary)
I have updated the documentation (e.g. README.rst)

praptisharma28 force-pushed the influxdb2.x branch from e2b2446 to 295446d Compare

May 31, 2024 17:39


          [monitoring] Adding influxDB 2.x version support #274

4a6ae39

Fixes #274

praptisharma28 force-pushed the influxdb2.x branch 2 times, most recently from 83694c5 to cfe8da0 Compare

June 1, 2024 15:30


          [fix] Workflow and test updated #274

d02db9c

Fixes #274

praptisharma28 force-pushed the influxdb2.x branch from 87f36b9 to d02db9c Compare

June 1, 2024 20:10


          [monitoring] Updated files and corrected failing qa checks #274

0aa3c6d

Fixes #274

praptisharma28 force-pushed the influxdb2.x branch from 5647cd4 to 0aa3c6d Compare

June 11, 2024 10:09

pandafy reviewed

View reviewed changes

README.rst Show resolved Hide resolved

pandafy reviewed

View reviewed changes

openwisp_monitoring/db/backends/__init__.py Outdated Show resolved Hide resolved

Member Author

praptisharma28 commented Jun 18, 2024

June 18th weekly call summary:

Need to get ping check in influxdb 2.0 at the server locally, so that health status like "CRITICAL", "OK" are visible too and not just "UNKNOWN".
Need to get charts which right now says an error when I check for charts.
Note: Celery beat and worker will be running all the time along with the server, when I am working on 8000 port locally.


          [monitoring] Changes #274

cfc9b58

Fixes #274

nemesifier requested changes

View reviewed changes

Member

nemesifier left a comment

There's too many if statements in the code (eg: if influxdb1 or influxdb2).
This is not the way to go. The logic for influxdb1 or influxdb2 must be encapsulated in each respective timeseries DB backend, the rest of the code should just be adapted to call that logic.

Please have a look at the work which was done for the elasticsearch backend to get an idea:
#164

praptisharma28 added 2 commits

June 29, 2024 17:40


          [monitoring] Avoided if statements #274

61300a6

Fixes #274


          [monitoring] Chart generation code modified #274

bff44f1

Fixes #274

pandafy requested changes

View reviewed changes

Member

pandafy left a comment

In our call today, @praptisharma28 walked me through the code and helped me understand the changes. In this process, we found some areas for improvement.

openwisp_monitoring/db/backends/influxdb2/client.py Outdated

+                      timezone=settings.TIME_ZONE
+                  ):
+                      bucket = self.bucket
+                      measurement = params.get('measurement')

Member

pandafy Jul 1, 2024

params.get('measurement') is equal to params.get('key'). Let's avoid duplicating the values that are already present to the method.

openwisp_monitoring/monitoring/base/models.py Outdated

+                          'start_date': start_date,
+                          'end_date': end_date,
+                          'measurement': self.config_dict.get('measurement', self.metric.key),
+                          'field_name': fields or self.config_dict.get('field_name'),

Member

pandafy Jul 1, 2024

If you are not using the field_name in the query, then please remove it.

Member Author

praptisharma28 Jul 1, 2024

I used pdb, got:

(Pdb) params
{'field_name': None, 'key': 'test_metric', 'time': '2024-06-25', 'days': '7d', 'content_type': 'openwisp_users.user', 'object_id': '4f5f77d3-3b30-4eca-97c6-1301217a4edc', 'start_date': None, 'end_date': None, 'measurement': 'test_metric'}

But if I remove it, I lose on the summary of uptime chart. Charts like mostly reachable, unreachable, partially reachable are lost.

Member

pandafy Jul 2, 2024

There should be only one summary in the uptime chart.

openwisp_monitoring/monitoring/base/models.py Outdated

+                      params.update({
+                          'start_date': start_date,
+                          'end_date': end_date,
+                          'measurement': self.config_dict.get('measurement', self.metric.key),

Member

pandafy Jul 1, 2024

This is duplicate to key, let's remove it.

openwisp_monitoring/monitoring/base/models.py

Comment on lines 703 to 708

                       return timeseries_db._get_top_fields(
-                          query=q,
+                          default_query=self._default_query,
+                          query=self.get_query(),
                           chart_type=self.type,
                           group_map=self._get_group_map(params['days']),
                           number=number,

Member

pandafy Jul 1, 2024

The InfluxDB 2 client does not accept default_query keyword argument. We need to ensure that this works at par with InfluxDB 1 and add any required tests for this method.

openwisp_monitoring/monitoring/base/models.py Outdated

Comment on lines 772 to 780

+                              points = summary = timeseries_db._get_top_fields(
+                                  default_query=self._default_query,
+                                  chart_type=self.type,
+                                  group_map=self.GROUP_MAP,
+                                  number=self.top_fields,
+                                  params=self._get_query_params(self.DEFAULT_TIME),
+                                  time=time,
+                                  query=self.query,
+                                  get_fields=False,

Member

pandafy Jul 1, 2024

This change is wrong. Please revert them to what was here before.

openwisp_monitoring/monitoring/base/models.py Outdated

-                          summary = timeseries_db.get_list_query(summary_query)
+                              points = timeseries_db.get_list_query(data_query, key=self.metric.key)
+                              summary = timeseries_db.get_list_query(
+                                  summary_query, key=self.metric.key

Member

pandafy Jul 1, 2024

Suggested change

      
                                summary_query, key=self.metric.key
          
                                summary_query,

I don't see the key argument being used in the timeseries_db.get_list_query method. Remove it if it is not needed.

openwisp_monitoring/monitoring/base/models.py Outdated

		for point in points:
		time_value = point.get('time') or point.get('_time')

Member

pandafy Jul 1, 2024

Let's avoid doing this and rename the time field in the flux query.

openwisp_monitoring/monitoring/base/models.py Outdated

Comment on lines 795 to 797

+                          if not time_value:
+                              logger.warning(f"Point missing time value: {point}")
+                              continue

Member

pandafy Jul 1, 2024

Will this code ever get executed? Isn't every point in timeseries db mapped to a timestamp.

openwisp_monitoring/monitoring/base/models.py Outdated

Comment on lines 783 to 809

    
                              if decimal_places and isinstance(value, (int, float)):

                              if decimal_places is not None and value is not None:

Member

pandafy Jul 1, 2024

Can you please verify the error you were getting here?

openwisp_monitoring/db/backends/influxdb2/client.py Outdated

Comment on lines 217 to 229

+                  def get_ping_data_query(self, bucket, start, stop, device_ids):
+                      device_filter = ' or '.join([f'r["object_id"] == "{id}"' for id in device_ids])
+                      query = f'''
+                      from(bucket: "{bucket}")
+                        |> range(start: {start}, stop: {stop})
+                        |> filter(fn: (r) => r["_measurement"] == "ping")
+                        |> filter(fn: (r) => r["_field"] == "loss" or r["_field"] == "reachable" or r["_field"] == "rtt_avg" or r["_field"] == "rtt_max" or r["_field"] == "rtt_min")
+                        |> filter(fn: (r) => r["content_type"] == "config.device")
+                        |> filter(fn: (r) => {device_filter})
+                        |> aggregateWindow(every: v.windowPeriod, fn: mean, createEmpty: false)
+                        |> yield(name: "mean")
+                      '''
+                      return query

Member

pandafy Jul 1, 2024

Please move this query to queries.py and use get_query() method to generate the final query.

pandafy reviewed

View reviewed changes

openwisp_monitoring/db/backends/influxdb2/client.py

+                      bucket = self.bucket
+                      # Start building the Flux query
+                      flux_query = f'from(bucket:"{bucket}")'

Member

pandafy Jul 4, 2024 •

edited

Loading

~~@praptisharma28 why are you building the query like this here? Why aren't you using the queries defined in queries.py?~~

~~This looks wrong. Charts for different metrics are handled differently. That's why there are individual queries for each chart.~~

Member

pandafy Jul 4, 2024

My bad, I got confused between the get_list_queries and read method. The current implementation is okay.


          [monitoring] Some changes updated #274

Fixes #274

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet