Defect #23318: #lock_nested_set very slow on mysql with thousands of subtasks - Redmine

Actions

Copy link

Defect #23318

closed

#lock_nested_set very slow on mysql with thousands of subtasks

Added by Stephane Evr about 9 years ago. Updated over 1 year ago.

Status:

Closed

Priority:

Normal

Assignee:

Jean-Philippe Lang

Category:

Database

Target version:

Start date:

Due date:

% Done:

Estimated time:

Resolution:

Duplicate

Affected version:

Description

I have a complex hierarchy of around 15000 issues in redmine, where an issue of this set could potentially have 3000 subtasks.

When doing CRUD operations on such issue, I notified significant slow downs, caused by the lock_nested_set function.

Here is the profiling for the query actually run in lock_nested_set:

SELECT `issues`.`id` FROM `issues` WHERE (root_id IN (SELECT root_id FROM issues WHERE id IN (70395,70389)))  ORDER BY `issues`.`id` ASC FOR UPDATE;

+-------+
|       |
.........
| 70371 |
| 70373 |
| 70375 |
| 70377 |
| 70379 |
| 70381 |
| 70383 |
| 70385 |
| 70387 |
| 70389 |
| 70391 |
| 70393 |
+-------+
2932 rows in set (2.70 sec)

mysql> show profile for QUERY 1;
+--------------------------------+----------+
| Status                         | Duration |
+--------------------------------+----------+
| starting                       | 0.000025 |
| Waiting for query cache lock   | 0.000004 |
| checking query cache for query | 0.000081 |
| checking permissions           | 0.000003 |
| checking permissions           | 0.000004 |
| Opening tables                 | 0.000038 |
| System lock                    | 0.000017 |
| init                           | 0.000056 |
| optimizing                     | 0.000013 |
| statistics                     | 0.000023 |
| preparing                      | 0.000009 |
| executing                      | 0.000002 |
| Sorting result                 | 0.000005 |
| Sending data                   | 0.000049 |
| optimizing                     | 0.000015 |
| statistics                     | 0.000047 |
| preparing                      | 2.690165 |
| end                            | 0.000009 |
| query end                      | 0.000067 |
| closing tables                 | 0.000010 |
| freeing items                  | 0.000038 |
| logging slow query             | 0.000002 |
| logging slow query             | 0.000163 |
| cleaning up                    | 0.000003 |
+--------------------------------+----------+
24 rows in set (0.00 sec)

It takes around 3 seconds to execute the whole query.

I think the main problem is with the nested SELECT statement. If I execute it separately, then paste its results directly into the main query, the query is much faster:

mysql> SELECT root_id FROM issues WHERE id IN (70395,70389);
+---------+
| root_id |
+---------+
|   45083 |
+---------+
1 row in set (0.00 sec)

SELECT `issues`.`id` FROM `issues` WHERE (root_id IN (45083))  ORDER BY `issues`.`id` ASC FOR UPDATE;

+-------+
|       |
.........
| 70371 |
| 70373 |
| 70375 |
| 70377 |
| 70379 |
| 70381 |
| 70383 |
| 70385 |
| 70387 |
| 70389 |
| 70391 |
| 70393 |
+-------+
2932 rows in set (0.01 sec)

I am not an expert in sql queries, and don't want to break anything... Shouldn't we use a JOIN instead?

Environment:
  Redmine version                3.3.0.stable
  Ruby version                   2.2.2-p95 (2015-04-13) [x86_64-linux]
  Rails version                  4.2.6
  Environment                    development
  Database adapter               Mysql2

Files

Download all files

issue_nested_set.patch (834 Bytes) issue_nested_set.patch		Stephane Evr, 2016-10-13 12:06
mysql-deadlock-02.diff (1.31 KB) mysql-deadlock-02.diff		Toshi MARUYAMA, 2017-03-21 16:35

Related issues

Actions

Copy link

Updated by Stephane Evr about 9 years ago

FYI, I was able to change the statement used in lock_nested_set

from:

self.class.reorder(:id).where("root_id IN (SELECT root_id FROM #{self.class.table_name} WHERE id IN (?))", sets_to_lock).lock.ids

to:

self.class.reorder(:id).joins("INNER JOIN #{self.class.table_name} t2 ON #{self.class.table_name}.root_id = t2.root_id").where("t2.id IN (?)", sets_to_lock).distinct.lock.ids

so far the later returns instantly, with the same results as the former. I don't know though how it affects the locking mechanism.

Actions

Copy link

Updated by Toshi MARUYAMA about 9 years ago

Category set to Database

Actions

Copy link

Updated by Go MAEDA almost 9 years ago

Target version set to Candidate for next major release

Passed all tests.

The following is a diff of changes made by Stephane Evr.

Index: lib/redmine/nested_set/issue_nested_set.rb
===================================================================
--- lib/redmine/nested_set/issue_nested_set.rb    (revision 15663)
+++ lib/redmine/nested_set/issue_nested_set.rb    (working copy)
@@ -158,7 +158,7 @@
           self.class.reorder(:id).where(:root_id => sets_to_lock).lock(lock).ids
         else
           sets_to_lock = [id, parent_id].compact
-          self.class.reorder(:id).where("root_id IN (SELECT root_id FROM #{self.class.table_name} WHERE id IN (?))", sets_to_lock).lock.ids
+          self.class.reorder(:id).joins("INNER JOIN #{self.class.table_name} t2 ON #{self.class.table_name}.root_id = t2.root_id").where("t2.id IN (?)", sets_to_lock).distinct.lock.ids
         end
       end

Actions

Copy link

Updated by Toshi MARUYAMA almost 9 years ago

Target version changed from Candidate for next major release to 3.4.0

Actions

Copy link

Updated by Jean-Philippe Lang almost 9 years ago

Subject changed from lock_nested_set very slow on mysql to #lock_nested_set very slow on mysql with thousands of subtasks

Go MAEDA wrote:

Passed all tests.

Did you run the tests with mysql? Because I get an error with Postgresql with the patch applied (FOR UPDATE not allowed with DISTINCT).
Tests pass without .distinct

Actions

Copy link

Updated by Go MAEDA almost 9 years ago

Jean-Philippe Lang wrote:

Did you run the tests with mysql? Because I get an error with Postgresql with the patch applied (FOR UPDATE not allowed with DISTINCT).
Tests pass without .distinct

Sorry, I run the tests only with sqlite.

Actions

Copy link

Updated by Jean-Philippe Lang almost 9 years ago

Category changed from Database to Performance
Status changed from New to Closed
Assignee set to Jean-Philippe Lang
Resolution set to Fixed

Patch committed without the .distinct call, thanks.

Actions

Copy link

Updated by Toshi MARUYAMA almost 9 years ago

Status changed from Closed to Reopened

MySQL "IssueNestedSetConcurrencyTest#test_concurrency" and "IssueNestedSetConcurrencyTest#test_concurrent_subtasks_creation" fail.
http://www.redmine.org/builds/logs/build_trunk_mysql_ruby-2.3_3063.html

It may be related with #19344.

Actions

Copy link

Updated by Toshi MARUYAMA almost 9 years ago

Related to Defect #19344: MySQL 5.6: IssueNestedSetConcurrencyTest#test_concurrency : always fails added

Actions

Copy link

#10

Updated by Jean-Philippe Lang almost 9 years ago

Assignee deleted (~~Jean-Philippe Lang~~)
Target version deleted (~~3.4.0~~)

r15891 reverted.

Actions

Copy link

#11

Updated by Stephane Evr almost 9 years ago

I found an alternative solution which passes the tests, although not as fast as with a JOIN. Here, I simply group by issue id in the original subquery:

SELECT `issues`.* FROM `issues` WHERE (root_id IN (SELECT root_id from issues WHERE id IN (96457,96455) GROUP BY id)) ORDER BY `issues`.`id` ASC;

So the statement would be rewritten as :

self.class.reorder(:id).where("root_id IN (SELECT root_id FROM #{self.class.table_name} WHERE id IN (?) GROUP BY id)", sets_to_lock).lock.ids

Actions

Copy link

#12

Updated by Toshi MARUYAMA almost 9 years ago

Target version set to 3.4.0

Actions

Copy link

#13

Updated by Stephane Evr almost 9 years ago

File issue_nested_set.patch issue_nested_set.patch added

I finally found how to refactor the JOIN Statement and keep the initial performance improvements. Please find attached a patch against the master branch. This passed the issue_nested_set_concurrency_test on MySQL.

Actions

Copy link

#14

Updated by Stephane Evr almost 9 years ago

I ran the test 10 times and all passed

Actions

Copy link

#15

Updated by Jean-Philippe Lang over 8 years ago

Target version changed from 3.4.0 to Candidate for next major release

Reverted r16053 because of SQL error with PostgreSQL.

Actions

Copy link

#16

Updated by Jean-Philippe Lang over 8 years ago

Assignee set to Jean-Philippe Lang

PostgreSQL does not accept the DISTINCT in the subquery:
http://www.redmine.org/builds/logs/build_trunk_postgresql_ruby-2.3_3124.html

One option would be to have a statement specfic to MySQL just like we already have one for SQL Server.

Actions

Copy link

#17

Updated by Stephane Evr over 8 years ago

Jean-Philippe Lang wrote:

PostgreSQL does not accept the DISTINCT in the subquery:
http://www.redmine.org/builds/logs/build_trunk_postgresql_ruby-2.3_3124.html

One option would be to have a statement specfic to MySQL just like we already have one for SQL Server.

Perhaps this is the solution. I don't know if PostgreSQL is affected by this problem.

Actions

Copy link

#18

Updated by Toshi MARUYAMA over 8 years ago

File mysql-deadlock-02.diff mysql-deadlock-02.diff added

Based #note-13 patch.
This patch reduces #19344 test failure times on my CentOS7 mariadb-5.5.52-1.el7.x86_64.
(Not always test pass. About 50% time test passes.)

Actions

Copy link

#19

Updated by Go MAEDA over 1 year ago

Related to Defect #39437: MySQL / MariaDB issue nested set deadlocks and consistency added

Actions

Copy link

#20

Updated by Marius BĂLTEANU over 1 year ago

Category changed from Performance to Database
Status changed from Reopened to Closed
Target version deleted (~~Candidate for next major release~~)
Resolution changed from Fixed to Duplicate

This issue is fixed in the upcoming Redmine 5.1.1 (it will be released soon) and 6.0.0, but it may require some changes to the database settings, please see all the comments from #39437 and the MySQL_configuration page.

I'm closing this as duplicate.

Actions

Copy link

Also available in: Atom PDF

Project

General

Profile

Redmine

Custom queries

Defect #23318

#lock_nested_set very slow on mysql with thousands of subtasks

Updated by Stephane Evr about 9 years ago

Updated by Toshi MARUYAMA about 9 years ago

Updated by Go MAEDA almost 9 years ago

Updated by Toshi MARUYAMA almost 9 years ago

Updated by Jean-Philippe Lang almost 9 years ago

Updated by Go MAEDA almost 9 years ago

Updated by Jean-Philippe Lang almost 9 years ago

Updated by Toshi MARUYAMA almost 9 years ago

Updated by Toshi MARUYAMA almost 9 years ago

Updated by Jean-Philippe Lang almost 9 years ago

Updated by Stephane Evr almost 9 years ago

Updated by Toshi MARUYAMA almost 9 years ago

Updated by Stephane Evr almost 9 years ago

Updated by Stephane Evr almost 9 years ago

Updated by Jean-Philippe Lang over 8 years ago

Updated by Jean-Philippe Lang over 8 years ago

Updated by Stephane Evr over 8 years ago

Updated by Toshi MARUYAMA over 8 years ago

Updated by Go MAEDA over 1 year ago

Updated by Marius BĂLTEANU over 1 year ago