| ²é¿´: 1133 | »Ø¸´: 5 | |||
| µ±Ç°Ö÷ÌâÒѾ´æµµ¡£ | |||
wswswwsÒø³æ (ÕýʽдÊÖ)
|
[½»Á÷]
¹ØÁª¹æÔòÍÚ¾òËã·¨×ÛÊö
|
||
|
¹ØÁª¹æÔòÍÚ¾òËã·¨×ÛÊö ÄÚÈݼò½é£º±¾ÎĽéÉÜÁ˹ØÁª¹æÔòµÄ»ù±¾¸ÅÄîºÍ·ÖÀà·½·¨£¬ÁоÙÁËһЩ¹ØÁª¹æÔòÍÚ¾òËã·¨²¢¼òÒª·ÖÎöÁ˵äÐÍËã·¨£¬Õ¹ÍûÁ˹ØÁª¹æÔòÍÚ¾òµÄδÀ´Ñо¿·½Ïò¡£ 1 ÒýÑÔ ¹ØÁª¹æÔòÍÚ¾ò·¢ÏÖ´óÁ¿Êý¾ÝÖÐÏ֮¼äÓÐȤµÄ¹ØÁª»òÏà¹ØÁªÏµ¡£ËüÔÚÊý¾ÝÍÚ¾òÖÐÊÇÒ»¸öÖØÒªµÄ¿ÎÌ⣬×î½ü¼¸ÄêÒѱ»Òµ½çËù¹ã·ºÑо¿¡£ ¹ØÁª¹æÔòÍÚ¾òµÄÒ»¸öµäÐÍÀý×ÓÊǹºÎïÀº·ÖÎö¡£¹ØÁª¹æÔòÑо¿ÓÐÖúÓÚ·¢ÏÖ½»Ò×Êý¾Ý¿âÖв»Í¬ÉÌÆ·£¨Ï֮¼äµÄÁªÏµ£¬ÕÒ³ö¹Ë¿Í¹ºÂòÐÐΪģʽ£¬È繺ÂòÁËijһÉÌÆ·¶Ô¹ºÂòÆäËûÉÌÆ·µÄÓ°Ïì¡£·ÖÎö½á¹û¿ÉÒÔÓ¦ÓÃÓÚÉÌÆ·»õ¼Ü²¼¾Ö¡¢»õ´æ°²ÅÅÒÔ¼°¸ù¾Ý¹ºÂòģʽ¶ÔÓû§½øÐзÖÀà¡£ AgrawalµÈÓÚ1993ÄêÊ×ÏÈÌá³öÁËÍÚ¾ò¹Ë¿Í½»Ò×Êý¾Ý¿âÖÐÏ¼äµÄ¹ØÁª¹æÔòÎÊÌâ[AIS93b]£¬ÒÔºóÖî¶àµÄÑо¿ÈËÔ±¶Ô¹ØÁª¹æÔòµÄÍÚ¾òÎÊÌâ½øÐÐÁË´óÁ¿µÄÑо¿¡£ËûÃǵŤ×÷°üÀ¨¶ÔÔÓеÄËã·¨½øÐÐÓÅ»¯£¬ÈçÒýÈëËæ»ú²ÉÑù¡¢²¢ÐеÄ˼ÏëµÈ£¬ÒÔÌá¸ßËã·¨ÍÚ¾ò¹æÔòµÄЧÂÊ£»¶Ô¹ØÁª¹æÔòµÄÓ¦ÓýøÐÐÍÆ¹ã¡£ ×î½üÒ²ÓжÀÁ¢ÓÚAgrawalµÄƵ¼¯·½·¨µÄ¹¤×÷[HPY00]£¬ÒÔ±ÜÃâÆµ¼¯·½·¨µÄһЩȱÏÝ£¬Ì½Ë÷ÍÚ¾ò¹ØÁª¹æÔòµÄз½·¨¡£Ò²ÓÐһЩ¹¤×÷[KPR98]×¢ÖØÓÚ¶ÔÍÚ¾òµ½µÄģʽµÄ¼ÛÖµ½øÐÐÆÀ¹À£¬ËûÃÇÌá³öµÄÄ£Ðͽ¨ÒéÁËһЩֵµÃ¿¼ÂǵÄÑо¿·½Ïò¡£ 2 »ù±¾¸ÅÄî ÉèI={i1,i2,..,im}ÊÇÏ£¬ÆäÖÐik(k=1,2,¡,m)¿ÉÒÔÊǹºÎïÀºÖеÄÎïÆ·£¬Ò²¿ÉÒÔÊDZ£ÏÕ¹«Ë¾µÄ¹Ë¿Í¡£ÉèÈÎÎñÏà¹ØµÄÊý¾ÝDÊÇÊÂÎñ¼¯£¬ÆäÖÐÿ¸öÊÂÎñTÊÇÏ£¬Ê¹µÃTÍI¡£ÉèAÊÇÒ»¸öÏ£¬ÇÒAÍT¡£ ¹ØÁª¹æÔòÊÇÈçÏÂÐÎʽµÄÂß¼Ô̺£ºA Þ B£¬AÌI, AÌI£¬ÇÒA¡ÉB=F¡£¹ØÁª¹æÔò¾ßÓÐÈçÏÂÁ½¸öÖØÒªµÄÊôÐÔ£º Ö§³Ö¶È: P(A¡ÈB)£¬¼´AºÍBÕâÁ½¸öÏÔÚÊÂÎñ¼¯DÖÐͬʱ³öÏֵĸÅÂÊ¡£ ÖÃÐŶÈ: P(B£üA)£¬¼´ÔÚ³öÏÖÏAµÄÊÂÎñ¼¯DÖУ¬ÏBҲͬʱ³öÏֵĸÅÂÊ¡£ ͬʱÂú×ã×îС֧³Ö¶ÈãÐÖµºÍ×îСÖÃÐŶÈãÐÖµµÄ¹æÔò³ÆÎªÇ¿¹æÔò¡£¸ø¶¨Ò»¸öÊÂÎñ¼¯D£¬ÍÚ¾ò¹ØÁª¹æÔòÎÊÌâ¾ÍÊDzúÉúÖ§³Ö¶ÈºÍ¿ÉÐŶȷֱð´óÓÚÓû§¸ø¶¨µÄ×îС֧³Ö¶ÈºÍ×îС¿ÉÐŶȵĹØÁª¹æÔò£¬Ò²¾ÍÊDzúÉúÇ¿¹æÔòµÄÎÊÌâ¡£ 3 ¹ØÁª¹æÔòÖÖÀà 1) »ùÓÚ¹æÔòÖд¦ÀíµÄ±äÁ¿µÄÀà±ð£¬¹ØÁª¹æÔò¿ÉÒÔ·ÖΪ²¼¶ûÐͺÍÊýÖµÐÍ¡£ ²¼¶ûÐ͹ØÁª¹æÔò´¦ÀíµÄÖµ¶¼ÊÇÀëÉ¢µÄ¡¢ÖÖÀ໯µÄ£¬ËüÏÔʾÁËÕâЩ±äÁ¿Ö®¼äµÄ¹ØÏµ¡£ ÊýÖµÐ͹ØÁª¹æÔò¿ÉÒԺͶàά¹ØÁª»ò¶à²ã¹ØÁª¹æÔò½áºÏÆðÀ´£¬¶ÔÊýÖµÐÍ×ֶνøÐд¦Àí£¬½«Æä½øÐж¯Ì¬µÄ·Ö¸î£¬»òÕßÖ±½Ó¶ÔÔʼµÄÊý¾Ý½øÐд¦Àí£¬µ±È»ÊýÖµÐ͹ØÁª¹æÔòÖÐÒ²¿ÉÒÔ°üº¬ÖÖÀà±äÁ¿¡£ 2) »ùÓÚ¹æÔòÖÐÊý¾ÝµÄ³éÏó²ã´Î£¬¿ÉÒÔ·ÖΪµ¥²ã¹ØÁª¹æÔòºÍ¶à²ã¹ØÁª¹æÔò¡£ ÔÚµ¥²ã¹ØÁª¹æÔòÖУ¬ËùÓеıäÁ¿¶¼Ã»Óп¼Âǵ½ÏÖʵµÄÊý¾ÝÊǾßÓжà¸ö²»Í¬µÄ²ã´ÎµÄ¡£ ÔÚ¶à²ã¹ØÁª¹æÔòÖУ¬¶ÔÊý¾ÝµÄ¶à²ãÐÔÒѾ½øÐÐÁ˳ä·ÖµÄ¿¼ÂÇ¡£ 3) »ùÓÚ¹æÔòÖÐÉæ¼°µ½µÄÊý¾ÝµÄάÊý£¬¹ØÁª¹æÔò¿ÉÒÔ·ÖΪµ¥Î¬µÄºÍ¶àάµÄ¡£ ÔÚµ¥Î¬¹ØÁª¹æÔòÖУ¬ÎÒÃÇֻɿ¼°µ½Êý¾ÝµÄÒ»¸öά£¬ÈçÓû§¹ºÂòµÄÎïÆ· ÔÚ¶àά¹ØÁª¹æÔòÖУ¬Òª´¦ÀíµÄÊý¾Ý½«»áÉæ¼°¶à¸öά¡£ 4 Ëã·¨×ÛÊö 4.1 ¾µäµÄƵ¼¯Ëã·¨ AgrawalµÈÓÚ1994ÄêÌá³öÁËÒ»¸öÍÚ¾ò¹Ë¿Í½»Ò×Êý¾Ý¿âÖÐÏ¼äµÄ¹ØÁª¹æÔòµÄÖØÒª·½·¨ [AS94a, AS94b]£¬ÆäºËÐÄÊÇ»ùÓÚÁ½½×¶ÎƵ¼¯Ë¼ÏëµÄµÝÍÆËã·¨¡£¸Ã¹ØÁª¹æÔòÔÚ·ÖÀàÉÏÊôÓÚµ¥Î¬¡¢µ¥²ã¡¢²¼¶û¹ØÁª¹æÔò¡£ ËùÓÐÖ§³Ö¶È´óÓÚ×îС֧³Ö¶ÈµÄÏ³ÆÎªÆµ·±Ï£¬¼ò³ÆÆµ¼¯¡£ 4.1.1 Ëã·¨µÄ»ù±¾Ë¼Ïë Ê×ÏÈÕÒ³öËùÓÐµÄÆµ¼¯£¬ÕâЩÏ³öÏֵįµ·±ÐÔÖÁÉÙºÍÔ¤¶¨ÒåµÄ×îС֧³Ö¶ÈÒ»Ñù¡£È»ºóÓÉÆµ¼¯²úÉúÇ¿¹ØÁª¹æÔò£¬ÕâЩ¹æÔò±ØÐëÂú×ã×îС֧³Ö¶ÈºÍ×îС¿ÉÐŶȡ£ ÍÚ¾ò¹ØÁª¹æÔòµÄ×ÜÌåÐÔÄÜÓɵÚÒ»²½¾ö¶¨£¬µÚ¶þ²½Ïà¶ÔÈÝÒ×ʵÏÖ¡£ 4.1.2 AprioriºËÐÄËã·¨·ÖÎö ΪÁËÉú³ÉËùÓÐÆµ¼¯£¬Ê¹ÓÃÁ˵ÝÍÆµÄ·½·¨¡£ÆäºËÐÄ˼Ïë¼òÒªÃèÊöÈçÏ£º (1) L1 = {large 1-itemsets}; (2) for (k=2; Lk-1¹F; k++) do begin (3) Ck=apriori-gen(Lk-1); //еĺòÑ¡¼¯ (4) for all transactions tÎD do begin (5) Ct=subset(Ck,t); //ÊÂÎñtÖаüº¬µÄºòÑ¡¼¯ (6) for all candidates cÎ Ct do (7) c.count++; (8) end (9) Lk={cÎ Ck |c.count³minsup} (10) end (11) Answer=¡ÈkLk; Ê×ÏȲúÉúƵ·±1-ÏL1£¬È»ºóÊÇÆµ·±2-ÏL2£¬Ö±µ½ÓÐij¸örֵʹµÃLrΪ¿Õ£¬ÕâʱË㷨ֹͣ¡£ÕâÀïÔÚµÚk´ÎÑ»·ÖУ¬¹ý³ÌÏȲúÉúºòÑ¡k-ÏµÄ¼¯ºÏCk£¬CkÖеÄÿһ¸öÏÊǶÔÁ½¸öÖ»ÓÐÒ»¸öÏͬµÄÊôÓÚLk-1µÄƵ¼¯×öÒ»¸ö(k-2)-Á¬½ÓÀ´²úÉúµÄ¡£CkÖеÄÏÊÇÓÃÀ´²úÉúƵ¼¯µÄºòÑ¡¼¯£¬×îºóµÄƵ¼¯Lk±ØÐëÊÇCkµÄÒ»¸ö×Ó¼¯¡£CkÖеÄÿ¸öÔªËØÐèÔÚ½»Ò×Êý¾Ý¿âÖнøÐÐÑéÖ¤À´¾ö¶¨ÆäÊÇ·ñ¼ÓÈëLk£¬ÕâÀïµÄÑéÖ¤¹ý³ÌÊÇËã·¨ÐÔÄܵÄÒ»¸öÆ¿¾±¡£Õâ¸ö·½·¨ÒªÇó¶à´ÎɨÃè¿ÉÄܴܺóµÄ½»Ò×Êý¾Ý¿â£¬¼´Èç¹ûƵ¼¯×î¶à°üº¬10¸öÏÄÇô¾ÍÐèҪɨÃè½»Ò×Êý¾Ý¿â10±é£¬ÕâÐèÒªºÜ´óµÄI/O¸ºÔØ¡£ ¿ÉÄܲúÉú´óÁ¿µÄºòÑ¡¼¯,ÒÔ¼°¿ÉÄÜÐèÒªÖØ¸´É¨ÃèÊý¾Ý¿â£¬ÊÇAprioriËã·¨µÄÁ½´óȱµã¡£ 4.1.3 Ëã·¨µÄÓÅ»¯ ΪÁËÌá¸ßËã·¨µÄЧÂÊ£¬MannilaµÈÒýÈëÁËÐÞ¼ô¼¼ÊõÀ´¼õСºòÑ¡¼¯CkµÄ´óС[MTV94]£¬ÓÉ´Ë¿ÉÒÔÏÔÖøµØ¸Ä½øÉú³ÉËùÓÐÆµ¼¯Ëã·¨µÄÐÔÄÜ¡£Ëã·¨ÖÐÒýÈëµÄÐÞ¼ô²ßÂÔ»ùÓÚÕâÑùÒ»¸öÐÔÖÊ£ºÒ»¸öÏÊÇÆµ¼¯µ±ÇÒ½öµ±ËüµÄËùÓÐ×Ó¼¯¶¼ÊÇÆµ¼¯¡£ÄÇô£¬Èç¹ûCkÖÐij¸öºòÑ¡ÏÓÐÒ»¸ö(k-1)-×Ó¼¯²»ÊôÓÚLk-1£¬ÔòÕâ¸öÏ¿ÉÒÔ±»ÐÞ¼ôµô²»ÔÙ±»¿¼ÂÇ£¬Õâ¸öÐÞ¼ô¹ý³Ì¿ÉÒÔ½µµÍ¼ÆËãËùÓеĺòÑ¡¼¯µÄÖ§³Ö¶ÈµÄ´ú¼Û¡£ 4.2 ¸Ä½øµÄƵ¼¯Ëã·¨ 4.2.1É¢ÁÐ ¸ÃËã·¨ÓÉParkµÈÔÚ1995ÄêÌá³ö[PCY95b]¡£Í¨¹ýʵÑé·¢ÏÖѰÕÒÆµ·±ÏµÄÖ÷Òª¼ÆËãÊÇÔÚÉú³ÉƵ·±2ÏL2ÉÏ£¬Park¾ÍÊÇÀûÓÃÕâ¸öÐÔÖÊÒýÈëÉ¢Áм¼ÊõÀ´¸Ä½ø²úÉúƵ·±2ÏµÄ·½·¨¡£ Æä»ù±¾Ë¼ÏëÊÇ£ºµ±É¨ÃèÊý¾Ý¿âÖÐÿ¸öÊÂÎñ£¬ÓÉC1ÖеĺòÑ¡1Ï²úÉúƵ·±1ÏL1ʱ£¬¶Ôÿ¸öÊÂÎñ²úÉúËùÓеÄ2Ï£¬½«ËüÃÇÉ¢Áе½É¢Áбí½á¹¹µÄ²»Í¬Í°ÖУ¬²¢Ôö¼Ó¶ÔÓ¦µÄͰ¼ÆÊý£¬ÔÚÉ¢ÁбíÖжÔÓ¦µÄͰ¼ÆÊýµÍÓÚÖ§³Ö¶ÈãÐÖµµÄ2Ï²»¿ÉÄÜÊÇÆµ·±2Ï£¬¿É´ÓºòÑ¡2ÏÖÐɾ³ý£¬ÕâÑù¾Í¿É´ó´óѹËõÁËÒª¿¼ÂǵÄ2Ï¡£ 4.2.2 ÊÂÎñѹËõ AgrawalµÈÌá³öѹËõ½øÒ»²½µü´úɨÃèµÄÊÂÎñÊýµÄ·½·¨[AS94b, HF95]¡£ÒòΪ²»°üº¬ÈκÎKÏµÄÊÂÎñ£¬²»¿ÉÄܰüº¬ÈκΣ¨K+1£©Ï£¬¿É¶ÔÕâЩÊÂÎñ¼ÓÉÏɾ³ý±êÖ¾,ɨÃèÊý¾Ý¿âʱ²»ÔÙ¿¼ÂÇ¡£ 4.2.3 ÔÓ´Õ Ò»¸ö¸ßЧµØ²úÉúƵ¼¯µÄ»ùÓÚÔÓ´ÕµÄËã·¨ÓÉParkµÈÌá³ö[PCY95a]¡£Í¨¹ýʵÑéÎÒÃÇ¿ÉÒÔ·¢ÏÖѰÕÒÆµ¼¯Ö÷ÒªµÄ¼ÆËãÊÇÔÚÉú³ÉƵ·±2-ÏLkÉÏ£¬ParkµÈ¾ÍÊÇÀûÓÃÁËÕâ¸öÐÔÖÊÒýÈëÔÓ´Õ¼¼ÊõÀ´¸Ä½ø²úÉúƵ·±2-ÏµÄ·½·¨¡£ 4.2.4 »®·Ö SavasereµÈÉè¼ÆÁËÒ»¸ö»ùÓÚ»®·ÖµÄËã·¨[SON95]£¬Õâ¸öËã·¨ÏȰÑÊý¾Ý¿â´ÓÂß¼ÉϷֳɼ¸¸ö»¥²»ÏཻµÄ¿é£¬Ã¿´Îµ¥¶À¿¼ÂÇÒ»¸ö·Ö¿é²¢¶ÔËüÉú³ÉËùÓÐµÄÆµ¼¯£¬È»ºó°Ñ²úÉúµÄƵ¼¯ºÏ²¢£¬ÓÃÀ´Éú³ÉËùÓпÉÄܵįµ¼¯£¬×îºó¼ÆËãÕâЩÏµÄÖ§³Ö¶È¡£ÕâÀï·Ö¿éµÄ´óСѡÔñҪʹµÃÿ¸ö·Ö¿é¿ÉÒÔ±»·ÅÈëÖ÷´æ£¬Ã¿¸ö½×¶ÎÖ»Ð豻ɨÃèÒ»´Î¡£¶øËã·¨µÄÕýÈ·ÐÔÊÇÓÉÿһ¸ö¿ÉÄܵįµ¼¯ÖÁÉÙÔÚijһ¸ö·Ö¿éÖÐÊÇÆµ¼¯±£Ö¤µÄ¡£ÉÏÃæËùÌÖÂÛµÄËã·¨ÊÇ¿ÉÒԸ߶Ȳ¢Ðеģ¬¿ÉÒÔ°Ñÿһ·Ö¿é·Ö±ð·ÖÅä¸øÄ³Ò»¸ö´¦ÀíÆ÷Éú³ÉƵ¼¯¡£²úÉúƵ¼¯µÄÿһ¸öÑ»·½áÊøºó£¬´¦ÀíÆ÷Ö®¼ä½øÐÐͨÐÅÀ´²úÉúÈ«¾ÖµÄºòÑ¡k-Ï¡£Í¨³£ÕâÀïµÄͨÐŹý³ÌÊÇËã·¨Ö´ÐÐʱ¼äµÄÖ÷Ҫƿ¾±£»¶øÁíÒ»·½Ã棬ÿ¸ö¶ÀÁ¢µÄ´¦ÀíÆ÷Éú³ÉƵ¼¯µÄʱ¼äÒ²ÊÇÒ»¸öÆ¿¾±¡£ÆäËûµÄ·½·¨»¹ÓÐÔÚ¶à´¦ÀíÆ÷Ö®¼ä¹²ÏíÒ»¸öÔÓ´ÕÊ÷À´²úÉúƵ¼¯¡£¸ü¶àµÄ¹ØÓÚÉú³ÉƵ¼¯µÄ²¢Ðл¯·½·¨¿ÉÒÔÔÚÎÄÏ×[AS96]ÖÐÕÒµ½¡£ 4.2.5 Ñ¡Ñù »ù±¾Ë¼ÏëÊÇÔÚ¸ø¶¨Êý¾ÝµÄÒ»¸ö×Ó¼¯ÍÚ¾ò¡£¶Ôǰһ±éɨÃèµÃµ½µÄÐÅÏ¢£¬×ÐϸµØ×éºÏ·ÖÎö£¬¿ÉÒԵõ½Ò»¸ö¸Ä½øµÄËã·¨£¬MannilaµÈÏÈ¿¼ÂÇÁËÕâÒ»µã[MTV94]£¬ËûÃÇÈÏΪ²ÉÑùÊÇ·¢ÏÖ¹æÔòµÄÒ»¸öÓÐЧ;¾¶¡£ËæºóÓÖÓÉToivonen½øÒ»²½·¢Õ¹ÁËÕâ¸ö˼Ïë[Toi96]£¬ÏÈʹÓôÓÊý¾Ý¿âÖгéÈ¡³öÀ´µÄ²ÉÑùµÃµ½Ò»Ð©ÔÚÕû¸öÊý¾Ý¿âÖпÉÄܳÉÁ¢µÄ¹æÔò£¬È»ºó¶ÔÊý¾Ý¿âµÄÊ£Óಿ·ÖÑéÖ¤Õâ¸ö½á¹û¡£ToivonenµÄËã·¨Ï൱¼òµ¥²¢ÏÔÖøµØ¼õÉÙÁËI/O´ú¼Û£¬µ«ÊÇÒ»¸öºÜ´óµÄȱµã¾ÍÊDzúÉúµÄ½á¹û²»¾«È·£¬¼´´æÔÚËùνµÄÊý¾ÝŤÇú(data skew)¡£·Ö²¼ÔÚÍ¬Ò»Ò³ÃæÉϵÄÊý¾Ýʱ³£ÊǸ߶ÈÏà¹ØµÄ£¬¿ÉÄܲ»ÄܱíʾÕû¸öÊý¾Ý¿âÖÐģʽµÄ·Ö²¼£¬Óɴ˶øµ¼ÖµÄÊDzÉÑù5%µÄ½»Ò×Êý¾ÝËù»¨·ÑµÄ´ú¼Û¿ÉÄÜͬɨÃèÒ»±éÊý¾Ý¿âÏà½ü¡£ 4.2.6 ¶¯Ì¬Ï¼ÆÊý BrinµÈÈ˸ø³ö¸ÃËã·¨[BMUT97]¡£¶¯Ì¬Ï¼ÆÊý¼¼Êõ½«Êý¾Ý¿â»®·ÖΪ±ê¼Ç¿ªÊ¼µãµÄ¿é¡£²»ÏóApriori½öÔÚÿ´ÎÍêÕûµÄÊý¾Ý¿âɨÃè֮ǰȷ¶¨ÐµĺòÑ¡£¬ÔÚÕâÖÖ±äÐÎÖУ¬¿ÉÒÔÔÚÈκοªÊ¼µãÌí¼ÓеĺòÑ¡Ï¡£¸Ã¼¼Êõ¶¯Ì¬µØÆÀ¹ÀÒÔ±»¼ÆÊýµÄËùÓÐÏµÄÖ§³Ö¶È£¬Èç¹ûÒ»¸öÏµÄËùÓÐ×Ó¼¯ÒÔ±»È·¶¨ÎªÆµ·±µÄ£¬ÔòÌí¼ÓËü×÷ΪеĺòÑ¡¡£½á¹ûËã·¨ÐèÒªµÄÊý¾Ý¿âɨÃè±ÈApriori ÉÙ¡£ 4.3 FP-Ê÷Ƶ¼¯Ëã·¨ Õë¶ÔAprioriËã·¨µÄ¹ÌÓÐȱÏÝ£¬J. HanµÈÌá³öÁ˲»²úÉúºòÑ¡ÍÚ¾òƵ·±ÏµÄ·½·¨¡ªFP-Ê÷Ƶ¼¯Ëã·¨[HPY00]¡£²ÉÓ÷ֶøÖÎÖ®µÄ²ßÂÔ£¬ÔÚ¾¹ýµÚÒ»±éɨÃèÖ®ºó£¬°ÑÊý¾Ý¿âÖÐµÄÆµ¼¯Ñ¹Ëõ½øÒ»¿ÃƵ·±Ä£Ê½Ê÷£¨FP-tree£©£¬Í¬Ê±ÒÀÈ»±£ÁôÆäÖеĹØÁªÐÅÏ¢£¬ËæºóÔÙ½«FP-tree·Ö»¯³ÉһЩÌõ¼þ¿â£¬Ã¿¸ö¿âºÍÒ»¸ö³¤¶ÈΪ1µÄƵ¼¯Ïà¹Ø£¬È»ºóÔÙ¶ÔÕâЩÌõ¼þ¿â·Ö±ð½øÐÐÍÚ¾ò¡£µ±ÔʼÊý¾ÝÁ¿ºÜ´óµÄʱºò£¬Ò²¿ÉÒÔ½áºÏ»®·ÖµÄ·½·¨,ʹµÃÒ»¸öFP-tree¿ÉÒÔ·ÅÈëÖ÷´æÖС£ÊµÑé±íÃ÷£¬FP-growth¶Ô²»Í¬³¤¶ÈµÄ¹æÔò¶¼ÓкܺõÄÊÊÓ¦ÐÔ£¬Í¬Ê±ÔÚЧÂÊÉϽÏÖ®aprioriËã·¨Óо޴óµÄÌá¸ß¡£ 4.4 ¶à²ã¹ØÁª¹æÔòÍÚ¾ò ¶ÔÓںܶàµÄÓ¦ÓÃÀ´Ëµ£¬ÓÉÓÚÊý¾Ý·Ö²¼µÄ·ÖÉ¢ÐÔ£¬ËùÒÔºÜÄÑÔÚÊý¾Ý×îϸ½ÚµÄ²ã´ÎÉÏ·¢ÏÖһЩǿ¹ØÁª¹æÔò¡£µ±ÎÒÃÇÒýÈë¸ÅÄî²ã´Îºó£¬¾Í¿ÉÒÔÔڽϸߵIJã´ÎÉϽøÐÐÍÚ¾ò[HF95, SA95]¡£ËäÈ»½Ï¸ß²ã´ÎÉϵóöµÄ¹æÔò¿ÉÄÜÊǸüÆÕͨµÄÐÅÏ¢£¬µ«ÊǶÔÓÚÒ»¸öÓû§À´ËµÊÇÆÕͨµÄÐÅÏ¢£¬¶ÔÓÚÁíÒ»¸öÓû§È´Î´±ØÈç´Ë¡£ËùÒÔÊý¾ÝÍÚ¾òÓ¦¸ÃÌṩÕâÑùÒ»ÖÖÔÚ¶à¸ö²ã´ÎÉϽøÐÐÍÚ¾òµÄ¹¦ÄÜ¡£ ¶à²ã¹ØÁª¹æÔòµÄ·ÖÀࣺ¸ù¾Ý¹æÔòÖÐÉæ¼°µ½µÄ²ã´Î£¬¶à²ã¹ØÁª¹æÔò¿ÉÒÔ·ÖΪͬ²ã¹ØÁª¹æÔòºÍ²ã¼ä¹ØÁª¹æÔò¡£ ¶à²ã¹ØÁª¹æÔòµÄÍÚ¾ò»ù±¾ÉÏ¿ÉÒÔÑØÓá°Ö§³Ö¶È-¿ÉÐŶȡ±µÄ¿ò¼Ü¡£²»¹ý£¬ÔÚÖ§³Ö¶ÈÉèÖõÄÎÊÌâÉÏÓÐһЩҪ¿¼ÂǵĶ«Î÷¡£ ͬ²ã¹ØÁª¹æÔò¿ÉÒÔ²ÉÓÃÁ½ÖÖÖ§³Ö¶È²ßÂÔ£º 1) ͳһµÄ×îС֧³Ö¶È¡£¶ÔÓÚ²»Í¬µÄ²ã´Î£¬¶¼Ê¹ÓÃͬһ¸ö×îС֧³Ö¶È¡£ÕâÑù¶ÔÓÚÓû§ºÍË㷨ʵÏÖÀ´Ëµ¶¼±È½ÏµÄÈÝÒ×£¬µ«ÊDZ׶ËÒ²ÊÇÏÔÈ»µÄ¡£ 2) µÝ¼õµÄ×îС֧³Ö¶È¡£Ã¿¸ö²ã´Î¶¼Óв»Í¬µÄ×îС֧³Ö¶È£¬½ÏµÍ²ã´ÎµÄ×îС֧³Ö¶ÈÏà¶Ô½ÏС¡£Í¬Ê±»¹¿ÉÒÔÀûÓÃÉϲãÍÚ¾òµÃµ½µÄÐÅÏ¢½øÐÐһЩ¹ýÂ˵Ť×÷¡£ ²ã¼ä¹ØÁª¹æÔò¿¼ÂÇ×îС֧³Ö¶ÈµÄʱºò£¬Ó¦¸Ã¸ù¾Ý½ÏµÍ²ã´ÎµÄ×îС֧³Ö¶ÈÀ´¶¨¡£ 4.5 ¶àά¹ØÁª¹æÔòÍÚ¾ò ¶ÔÓÚ¶àάÊý¾Ý¿â¶øÑÔ£¬³ýάÄڵĹØÁª¹æÔòÍ⣬»¹ÓÐÒ»Àà¶àάµÄ¹ØÁª¹æÔò¡£ÀýÈ磺 ÄêÁ䣨X£¬¡°20...30¡±£© Ö°Òµ£¨X,¡°Ñ§Éú¡±£©==> ¹ºÂò(X£¬¡°±Ê¼Ç±¾µçÄÔ¡±) ÔÚÕâÀïÎÒÃǾÍÉæ¼°µ½Èý¸öάÉϵÄÊý¾Ý£ºÄêÁä¡¢Ö°Òµ¡¢¹ºÂò¡£ ¸ù¾ÝÊÇ·ñÔÊÐíͬһ¸öÎ¬ÖØ¸´³öÏÖ£¬¿ÉÒÔÓÖϸ·ÖΪά¼äµÄ¹ØÁª¹æÔò£¨²»ÔÊÐíÎ¬ÖØ¸´³öÏÖ£©ºÍ»ìºÏά¹ØÁª¹æÔò£¨ÔÊÐíάÔÚ¹æÔòµÄ×óÓÒͬʱ³öÏÖ£©¡£ ÄêÁ䣨X£¬¡°20...30¡±£© ¹ºÂò(X£¬¡°±Ê¼Ç±¾µçÄÔ¡±) ==> ¹ºÂò(X£¬¡°´òÓ¡»ú¡±) Õâ¸ö¹æÔò¾ÍÊÇ»ìºÏά¹ØÁª¹æÔò¡£ ÔÚÍÚ¾òά¼ä¹ØÁª¹æÔòºÍ»ìºÏά¹ØÁª¹æÔòµÄʱºò£¬»¹Òª¿¼ÂDz»Í¬µÄ×Ö¶ÎÖÖÀࣺÖÖÀàÐͺÍÊýÖµÐÍ¡£ ¶ÔÓÚÖÖÀàÐ͵Ä×ֶΣ¬ÔÏȵÄËã·¨¶¼¿ÉÒÔ´¦Àí¡£¶ø¶ÔÓÚÊýÖµÐ͵Ä×ֶΣ¬ÐèÒª½øÐÐÒ»¶¨µÄ´¦ÀíÖ®ºó²Å¿ÉÒÔ½øÐÐ[KHC97]¡£´¦ÀíÊýÖµÐÍ×ֶεķ½·¨»ù±¾ÉÏÓÐÒÔϼ¸ÖÖ£º 1) ÊýÖµ×ֶα»·Ö³ÉһЩԤ¶¨ÒåµÄ²ã´Î½á¹¹¡£ÕâÐ©Çø¼ä¶¼ÊÇÓÉÓû§Ô¤Ïȶ¨ÒåµÄ¡£µÃ³öµÄ¹æÔòÒ²½Ð×ö¾²Ì¬ÊýÁ¿¹ØÁª¹æÔò¡£ 2) ÊýÖµ×ֶθù¾ÝÊý¾ÝµÄ·Ö²¼·Ö³ÉÁËһЩ²¼¶û×ֶΡ£Ã¿¸ö²¼¶û×ֶζ¼±íʾһ¸öÊýÖµ×ֶεÄÇø¼ä£¬ÂäÔÚÆäÖÐÔòΪ1£¬·´Ö®Îª0¡£ÕâÖÖ·Ö·¨ÊǶ¯Ì¬µÄ¡£µÃ³öµÄ¹æÔò½Ð²¼¶ûÊýÁ¿¹ØÁª¹æÔò¡£ 3) ÊýÖµ×ֶα»·Ö³ÉһЩÄÜÌåÏÖËüº¬ÒåµÄÇø¼ä¡£Ëü¿¼ÂÇÁËÊý¾ÝÖ®¼äµÄ¾àÀëµÄÒòËØ¡£µÃ³öµÄ¹æÔò½Ð»ùÓÚ¾àÀëµÄ¹ØÁª¹æÔò¡£ 4) Ö±½ÓÓÃÊýÖµ×Ö¶ÎÖеÄÔʼÊý¾Ý½øÐзÖÎö¡£Ê¹ÓÃһЩͳ¼ÆµÄ·½·¨¶ÔÊýÖµ×ֶεÄÖµ½øÐзÖÎö£¬²¢ÇÒ½áºÏ¶à²ã¹ØÁª¹æÔòµÄ¸ÅÄÔÚ¶à¸ö²ã´ÎÖ®¼ä½øÐбȽϴӶøµÃ³öһЩÓÐÓõĹæÔò¡£µÃ³öµÄ¹æÔò½Ð¶à²ãÊýÁ¿¹ØÁª¹æÔò¡£ 5 Õ¹Íû ¶ÔÓÚ¹ØÁª¹æÔòÍÚ¾òÁìÓòµÄ·¢Õ¹£¬±ÊÕßÈÏΪ¿ÉÒÔÔÚÈçÏÂһЩ·½ÏòÉϽøÐÐÉîÈëÑо¿£ºÔÚ´¦Àí¼«´óÁ¿µÄÊý¾Ýʱ£¬ÈçºÎÌá¸ßË㷨ЧÂÊ£»¶ÔÓÚÍÚ¾òѸËÙ¸üеÄÊý¾ÝµÄÍÚ¾òËã·¨µÄ½øÒ»²½Ñо¿£»ÔÚÍÚ¾òµÄ¹ý³ÌÖУ¬ÌṩһÖÖÓëÓû§½øÐн»»¥µÄ·½·¨£¬½«Óû§µÄÁìÓò֪ʶ½áºÏÔÚÆäÖУ»¶ÔÓÚÊýÖµÐÍ×Ö¶ÎÔÚ¹ØÁª¹æÔòÖеĴ¦ÀíÎÊÌ⣻Éú³É½á¹ûµÄ¿ÉÊÓ»¯£¬µÈµÈ¡£ ²Î¿¼ÎÄÏ× [AIS93b] R. Agrawal, T. Imielinski, and A. Swami. Mining association rules between sets of items in large databases. Proceedings of the ACM SIGMOD Conference on Management of data, p.p. 207-216, May 1993. [AS94a] R. Agrawal, and R. Srikant. Fast algorithms for mining association rules in large database. Technical Report FJ9839, IBM Almaden Research Center, San Jose, CA, Jun. 1994. [AS94b] R. Agrawal, and R. Srikant. Fast algorithms for mining association rules. In Proc. 1994 Int. Conf. Very Large Databases(VLDB¡¯94), Sep. 1994. [AS96] R. Agrawal, and J. Shafer. Parallel mining of association rules: Design, Implementation, and Experience. IEEE Trans. Knowledge and data engineering, 8:962-969, Jan. 1996. [BMUT97] S. Brin, R. Motwani, J. D. Ullman, and S. Tsur. Dynamic itemset counting and implication rules for market basket data. In ACM SIGMOD International Conference On the Management of Data, p.p. 255-264, May 1997. [HF95] J, Han and Y. Fu. Discovery of multiple-level association rules from large databases. In Proc. 1995 Int. Conf. Very Large Databases(VLDB¡¯95), p.p. 402-431, Sep. 1995. [HPY00] J.Han,J.Pei, and Y.Yin.Mining. Frequent patterns without candidate generation. In Proc. 2000 ACM-SIGMOD Int. Conf. Management of Data (SIGMOD¡¯00), p.p. 1-12, May 2000. [KHC97] M. Kamber, J. Han, J. Chang. Metarule-guided mining of multi-dimensional association rules using data cubes. In Proc. 1997 Int. Conf. Khowledge Discovery and Data Mining(KDD¡¯97), p.p. 207-210, Aug. 1997 [KPR98] J. Kleinberg, C. Papadimitriou, and P. Raghavan. Segmentation problems. Proceedings of the 30th Annual Symposium on Theory of Computing, ACM. Sep. 1998. [MTV94] H. Mannila, H. Toivonen, and A. Verkamo. Efficient algorithm for discovering association rules. AAAI Workshop on Knowledge Discovery in Databases, p.p. 181-192, Jul. 1994. [PCY95a] J. S. Park, M. S. Chen, and P. S. Yu. An effective hash-based algorithm for mining association rules. Proceedings of ACM SIGMOD International Conference on Management of Data, p.p. 175-186, May 1995. [PCY95b] J. S. Park, M. S. Chen, and P. S. Yu. Efficient parallel data mining of association rules. 4th International Conference on Information and Knowledge Management, Baltimore, Maryland, Nov. 1995. [SA95] R. Srikant, and R. Agrawal. Mining generalized association rules. Proceedings of the 21st International Conference on Very Large Database, p.p. 407-419, Sep.1995. [SON95] A. Savasere, E. Omiecinski, and S. Navathe. An efficient algorithm for mining association rules in large databases. Proceedings of the 21st International Conference on Very Large Database, p.p. 432-443, Sep. 1995. [Toi96] H. Toivonen. Sampling large databases for association rules. Proceedings of the 22nd International Conference on Very Large Database, Bombay, India, p.p. 134-145, Sep. 1996. [ Last edited by »ÃÓ°ÎÞºÛ on 2006-11-4 at 07:38 ] |
» ²ÂÄãϲ»¶
295Çóµ÷¼Á¡£Ò»Ö¾Ô¸±¨¿¼Ö£ÖÝ´óѧ»¯Ñ§¹¤ÒÕѧ˶£¬×Ü·Ö295·Ö
ÒѾÓÐ4È˻ظ´
289Çóµ÷¼Á
ÒѾÓÐ4È˻ظ´
½ËÕʡũ¿ÆÔºÕе÷¼Á1Ãû
ÒѾÓÐ4È˻ظ´
290·Ö²ÄÁϹ¤³Ì085601Çóµ÷¼Á Êý¶þÓ¢Ò»
ÒѾÓÐ3È˻ظ´
0856µ÷¼Á
ÒѾÓÐ7È˻ظ´
»¯¹¤×¨Ë¶348£¬Ò»Ö¾Ô¸985Çóµ÷¼Á
ÒѾÓÐ9È˻ظ´
0856»¯¹¤×¨Ë¶Çóµ÷¼Á
ÒѾÓÐ14È˻ظ´
282Çóµ÷¼Á
ÒѾÓÐ5È˻ظ´
³ÏÕÐũҵ²©Ê¿
ÒѾÓÐ3È˻ظ´
²ÄÁÏѧ˶318Çóµ÷¼Á
ÒѾÓÐ9È˻ظ´
yuefour
½ð³æ (ÕýʽдÊÖ)
- Ó¦Öú: 0 (Ó×¶ùÔ°)
- ¹ó±ö: 9.5
- ½ð±Ò: 1310.7
- Ìû×Ó: 838
- ÔÚÏß: 17Сʱ
- ³æºÅ: 64621
- ×¢²á: 2005-04-16
- ÐÔ±ð: GG
- רҵ: ÖÐÒ½ÄÚ¿Æ
2Â¥2005-06-21 13:34:18
3Â¥2006-02-08 19:26:44
wawlpa
ľ³æ (ÕýʽдÊÖ)
- Ó¦Öú: 6 (Ó×¶ùÔ°)
- ½ð±Ò: 3699
- É¢½ð: 1043
- Ìû×Ó: 796
- ÔÚÏß: 282.2Сʱ
- ³æºÅ: 188067
- ×¢²á: 2006-02-17
- רҵ: ÎïÁ÷Ó빩ӦÁ´¹ÜÀí
4Â¥2006-02-20 18:10:48
Microsoft
ľ³æ (СÓÐÃûÆø)
- Ó¦Öú: 0 (Ó×¶ùÔ°)
- ½ð±Ò: 2726.4
- Ìû×Ó: 115
- ÔÚÏß: 15.5Сʱ
- ³æºÅ: 130304
- ×¢²á: 2005-12-11
- רҵ: »úе¶¯Á¦Ñ§
5Â¥2006-03-12 10:52:01
0.5
|
6Â¥2006-03-13 08:48:38













»Ø¸´´ËÂ¥
10