24小时热门版块排行榜    

Znn3bq.jpeg
查看: 735  |  回复: 1

nuptsww

木虫 (小有名气)

[求助] 帮 翻译一段 论文

(Google 或 baidu 的翻译就不要回了)
T
HE practices of data-driven management and decision making have been pervasive and widely used in today’s industrial, business and governmental applications after initial successes of big data techniques in internet business. The data quality is regarded as a significant issue of industrial process, market success and decision-making activities .
However, more than 41% of the relevant projects would fail if only the original data were used due to the poor or insufficient quality of raw data according to a study by the Meta Group. Missing data which means that electronic data during some period is lost or hidden by uncontrollable factors is one of the major potential flaws in raw data and could result in severe failure. Therefore, the engineers have to sacrifice much time to retrieve this kind of data for further analysis. As a consequence, (semi-)automatic missing data prediction methods have been proposed.
A large collection of data mining and statistical methods have been proposed to improve data quality due to missing data. For example, Ma’s team proposed a good method for missing data prediction. The algorithm focused on recommender systems using improved collaborative filtering method which outperformed the traditional collaborative filtering method. Nogueira et alsolved a practical problem based on the Fast Fuzzy Clustering Algorithm in real world: the prediction of bankruptcy, in which the used data set has missing values. Lei and Wang presents a method for pre-processing the missing observed data by adopting the multiple imputation technique for Macau air pollution index (API) prediction using the Adaptive Neuro-Fuzzy Inference System (ANFIS). The API forecasting performance after missing data pre-processing is better than the conventional case without pre-processing.
In power grid systems, data missing happens so frequently due to the harsh working condition of sensors that classic methods often fail to handle. Expensive critical equipment such as main power transformers are monitored by multiple sensors. Unfortunately, these sensors are not as reliable as the equipment in the harsh open air working condition under the workload of 7*24 hours. Moreover, sensors in remote rural areas such as mountains are usually maintained at an even worse level by workers who received less training than workers in city. Thus, it is normal and inevitable for the sensor system to produce flaw data sets, which lost or hidden some necessary information . These losses affect the data quality so badly that classic data mining and statistical methods alone cannot process these data properly.
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

baoshanqiu

至尊木虫 (著名写手)

【答案】应助回帖

数据驱动管理和决策的运用在互联网业务的大数据技术中取得最初成功以后,已经得到普及,现在广泛用于工业、商业和政府部门。数据的质量被认为是工业生产过程、市场成功以及决策活动的关键。
然而根据Meta集团的研究,如果直接使用原始数据,有超过41%的相关项目会因原始数据质量差或不足而失败。缺失数据是指因某一时期电子数据丢失或因无法控制的因素造成隐匿,是原始数据最主要的潜在缺陷之一,可造成严重失效。因此工程师必须花大量时间去恢复这样的数据作进一步分析。结果就提出了(半)自动化的缺失数据预测方法。
缺失数据的存在产生了大量的数据挖掘和统计学方法以改善数据质量。如Ma的团队提出了一种预测缺失数据的好方法。其算法集中于采纳一种比传统协同筛选法优越的改良协同筛选法的推荐系统。Nogueira等利用快速模糊聚类算法解决了现实生活中的一个实际问题:预测破产,其中使用的数据组有缺失值。Lei 和 Wang报告了一种采用多重填补技术预处理缺失观察数据的方法,利用自适应神经模糊推理系统(ANFIS)对澳门空气污染指数(API)进行预测。经缺失数据预处理后API预测性能比未经预处理的常规方法更好。
在电网中,由于传感器处于恶劣的工作条件而造成数据缺失屡见不鲜,惯常的处理方法难以奏效。像电力变压器这类昂贵的关键设备是由多个传感器监控的。不幸的是,在恶劣的露天工作条件和周7天24小时工作负荷下,这些传感器并没有该设备那么可靠。此外,在偏远的农村地区比如山区这些传感器得到的维护甚至更差。那里参与维护的工人接受的训练比城里的工人少。因此这样的传感器系统产生丢失或隐藏了一些必要信息的缺陷数据组是正常的、不可避免的。这些缺失严重影响数据的质量,单纯用传统的数据挖掘和统计方法不能恰当地处理这些数据。
2楼2015-06-24 04:58:06
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖
相关版块跳转 我要订阅楼主 nuptsww 的主题更新
最具人气热帖推荐 [查看全部] 作者 回/看 最后发表
[基金申请] 青C资助名额大幅增加! +9 西葫芦炒鸡蛋 2026-05-13 13/650 2026-05-15 00:18 by jackeychen7922
[文学芳草园] 风把牡丹吹跑了 +4 myrtle 2026-05-12 7/350 2026-05-14 23:58 by myrtle
[教师之家] 教学课件你会给同学吗 +8 硕士研究生吗 2026-05-13 8/400 2026-05-14 22:23 by 常规沥青
[考博] 26应届毕业生考博求助 +3 wo一定上岸 2026-05-13 3/150 2026-05-14 21:47 by 明海天涯
[基金申请] 重磅!青年科学基金项目(C类)资助增幅预计超过50% +5 水和泥不是水泥 2026-05-13 7/350 2026-05-14 20:57 by 水和泥不是水泥
[有机交流] 求助2,4-二氯-5-嘧啶甲醛的合成方法 20+3 光吃不拉 2026-05-14 5/250 2026-05-14 20:15 by 一切都是空工
[高分子] 本人最近太闲了,谁有问题可以提,每天会统一回复 +8 一切都是空工 2026-05-12 19/950 2026-05-14 20:03 by 一切都是空工
[考博] 申博自荐 +4 食品的橙子 2026-05-09 6/300 2026-05-14 16:05 by great1919
[基金申请] 这年头没有找到涵评专家,还有中面上的可能吗 +7 dd921ww 2026-05-12 8/400 2026-05-14 14:22 by dd921ww
[考博] 材料类只有一篇综述能申博么 +4 乐逍遥谷 2026-05-13 4/200 2026-05-14 12:05 by zhyzzh
[基金申请] 请问大佬b0816评完了吗 +3 市民华南虎 2026-05-12 7/350 2026-05-14 07:41 by 市民华南虎
[基金申请] 精华III评审感受-评审感受-评审感受 +12 ferrarichen 2026-05-11 16/800 2026-05-14 07:33 by 2000zf36392
[论文投稿] 有带发论文的吗 +3 山楂之术 2026-05-09 3/150 2026-05-13 17:56 by Cyhcl2629
[硕博家园] 导师各种操作恶心咋办 +11 苍白的小青天 2026-05-09 13/650 2026-05-13 17:11 by 六两废铜
[论文投稿] 护理论文 晋升 +5 Taylor1990, 2026-05-08 5/250 2026-05-13 14:40 by tegsgjy20
[论文投稿] 求助大佬sci投稿哪个好中 +3 江沅188 2026-05-12 4/200 2026-05-13 14:35 by 江沅188
[考博] 西南大学考核制博士 +3 lijunjie84 2026-05-11 6/300 2026-05-12 18:09 by lijunjie84
[文学芳草园] 窗边初夏的小雨 +7 阿美_Lml888 2026-05-09 10/500 2026-05-12 15:27 by 阿美_Lml888
[考博] 现在不知道怎么办,感觉很痛苦 +4 qweww 2026-05-11 5/250 2026-05-11 20:23 by Oversize
[考博] 生物学博士 +3 17749024330 2026-05-08 6/300 2026-05-11 14:29 by 17749024330
信息提示
请填处理意见