2009-09-19 89 views
0

(使用MySQL 4.1.22)帮助优化MySQL查询

我不能让我的这个查询在大表上使用索引(200K +行),它做就可以了全表扫描。查询现在需要大约1.2秒。如果可能,我想把它缩短到不到0.2秒。

这里是我的查询:

SELECT st_issues.issue_id, st_issues.cat_id,st_categories.name AS cat_name, st_issues.status_id,st_statuses.name AS status_name, st_issues.priority_id,st_priorities.name AS priority_name,st_priorities.color AS color, st_issues.assigned_cid,assigned_u.firstname,assigned_u.lastname,assigned_u.screenname, message, rating, created_by_email,created_by_cid,created_by_uid,by_user.firstname AS by_firstname,by_user.lastname AS by_lastname,by_user.screenname AS by_screenname, st_issues.browser,from_url,created_by_store,created,st_issues.stamp 
FROM st_issues 
JOIN st_categories ON (st_issues.cat_id=st_categories.cat_id) 
JOIN st_statuses ON (st_issues.status_id=st_statuses.status_id) 
JOIN st_priorities ON (st_issues.priority_id=st_priorities.priority_id) 
LEFT JOIN users AS assigned_u ON (assigned_u.cid=st_issues.assigned_cid) 
LEFT JOIN users AS by_user ON (by_user.uid=st_issues.created_by_uid) 
LEFT JOIN st_issue_changes ON (st_issues.issue_id=st_issue_changes.issue_id AND change_id=0) 
WHERE st_issues.assigned_cid=0 

解释的结果:

1, 'SIMPLE', 'st_issues', 'ALL', '', '', , '', 4, 'Using where' 
1, 'SIMPLE', 'st_categories', 'eq_ref', 'PRIMARY', 'PRIMARY', 1, 'sg.st_issues.cat_id', 1, '' 
1, 'SIMPLE', 'st_priorities', 'eq_ref', 'PRIMARY', 'PRIMARY', 1, 'sg.st_issues.priority_id', 1, '' 
1, 'SIMPLE', 'assigned_u', 'ref', 'cid', 'cid', 8, 'sg.st_issues.assigned_cid', 1, '' 
1, 'SIMPLE', 'st_statuses', 'ALL', 'PRIMARY', '', , '', 4, 'Using where' 
1, 'SIMPLE', 'by_user', 'ALL', '', '', , '', 221623, '' 
1, 'SIMPLE', 'st_issue_changes', 'eq_ref', 'PRIMARY', 'PRIMARY', 6, 'sg.st_issues.issue_id,const', 1, '' 

显然问题出在加入上“by_user”,因为它不使用索引。

下面是一些“用户”表的定义:

CREATE TABLE `users` (
    `cid` double unsigned NOT NULL auto_increment, 
    `uid` varchar(20) NOT NULL default '', 
... 
    `firstname` varchar(20) default NULL, 
    `lastname` varchar(20) default NULL, 
... 
    PRIMARY KEY (`uid`), 
... 
) ENGINE=InnoDB 

任何人有为什么它不使用连接中的主键的任何想法?
任何人有任何想法或提示如何加快此查询更多?

(我可以添加其他表的表定义,如果需要/想)

编辑:

这里是st_issues表定义:

CREATE TABLE `st_issues` (
    `issue_id` int(10) unsigned NOT NULL auto_increment, 
    `cat_id` tinyint(3) unsigned NOT NULL default '0', 
    `status_id` tinyint(3) unsigned NOT NULL default '0', 
    `priority_id` tinyint(3) unsigned NOT NULL default '0', 
    `assigned_cid` int(10) unsigned NOT NULL default '0', 
    `rating` tinyint(4) default NULL, 
    `created_by_email` varchar(255) NOT NULL default '', 
    `created_by_cid` int(10) unsigned NOT NULL default '0', 
    `created_by_uid` varchar(20) NOT NULL default '', 
    `created_by_store` tinyint(3) unsigned NOT NULL default '0', 
    `browser` varchar(255) NOT NULL default '', 
    `from_url` varchar(255) NOT NULL default '', 
    `created` datetime NOT NULL default '0000-00-00 00:00:00', 
    `stamp` datetime NOT NULL default '0000-00-00 00:00:00', 
    PRIMARY KEY (`issue_id`), 
    KEY `idx_create_by_cid` (`created_by_cid`), 
    KEY `idx_create_by_uid` (`created_by_uid`) 
) ENGINE=InnoDB DEFAULT CHARSET=utf8; 
+0

你对st_issues有什么索引? – 2009-09-19 15:40:11

+0

您定义了哪些索引? – 2009-09-19 15:40:13

+0

我刚刚添加了st_issues的定义 – Echo 2009-09-19 15:44:40

回答

3

是整个的用户表的定义?

因为它说:

)ENGINE = InnoDB的

而st_issues说:

)ENGINE = InnoDB的默认字符集= UTF8;

如果你的两个表可以使用不同的排序规则,对uid和created_by_uid两个字符串数据类型是不同的,和MySQL必须做一个字符集的强制,才可以对它们进行比较,从而击败你的索引。

确保您对数据库中的所有文本使用相同的字符集/排序规则总是最好的。

+0

啊,我敢打赌这就是问题所在,谢谢。 – Echo 2009-09-19 16:33:32

0

我做了一些测试,发现了以下更改帮助:

  • st_issues.assigned_cid添加索引。

  • users表的主键改为cid而不是uid

  • 变化的连接条件by_user使用cid而不是uid

    LEFT JOIN users AS by_user ON (by_user.cid=st_issues.created_by_cid) 
    

然后我得到了以下EXPLAIN报告(虽然零行数据):

+----+-------------+------------------+--------+---------------+--------------+---------+-------------------------------+------+-------------+ 
| id | select_type | table   | type | possible_keys | key   | key_len | ref       | rows | Extra  | 
+----+-------------+------------------+--------+---------------+--------------+---------+-------------------------------+------+-------------+ 
| 1 | SIMPLE  | st_issues  | ref | assigned_cid | assigned_cid | 4  | const       | 1 |    | 
| 1 | SIMPLE  | st_categories | eq_ref | PRIMARY  | PRIMARY  | 1  | test.st_issues.cat_id   | 1 |    | 
| 1 | SIMPLE  | st_statuses  | eq_ref | PRIMARY  | PRIMARY  | 1  | test.st_issues.status_id  | 1 |    | 
| 1 | SIMPLE  | st_priorities | eq_ref | PRIMARY  | PRIMARY  | 1  | test.st_issues.priority_id | 1 |    | 
| 1 | SIMPLE  | assigned_u  | eq_ref | PRIMARY  | PRIMARY  | 8  | test.st_issues.assigned_cid | 1 |    | 
| 1 | SIMPLE  | by_user   | eq_ref | PRIMARY  | PRIMARY  | 8  | test.st_issues.created_by_cid | 1 |    | 
| 1 | SIMPLE  | st_issue_changes | eq_ref | PRIMARY  | PRIMARY  | 8  | test.st_issues.issue_id,const | 1 | Using index | 
+----+-------------+------------------+--------+---------------+--------------+---------+-------------------------------+------+-------------+ 

这表明优化器为每个表选择了一个索引,而不是在您的查询版本中。我不得不猜测你的查找表的定义。

另一件事,我建议是定义查找表st_categoriesst_statuses自然键,类别或状态的名称。然后从st_issues表中引用该自然键,而不是使用tinyint伪码。好处是您不必执行这些连接来获取类别或状态的名称;它已经在st_issues表中。