社交媒体数据包含理解人类行为的陷阱_英语科普

社交媒体数据包含理解人类行为的陷阱

文章来源：未知文章作者：enread 发布时间：2014-12-30 05:16 字体： [大中小]　进入论坛

(单词翻译:双击或拖选)

A growing number of academic researchers are mining social media data to learn about both online and offline human behavior. In recent years, studies have claimed the ability to predict everything from summer blockbusters to fluctuations¹ in the stock market. But mounting evidence of flaws in many of these studies points to a need for researchers to be wary² of serious pitfalls³ that arise when working with huge social media data sets, according to computer scientists at McGill University in Montreal and Carnegie Mellon University in Pittsburgh.

Such erroneous results can have huge implications: thousands of research papers each year are now based on data gleaned⁴ from social media. "Many of these papers are used to inform and justify⁵ decisions and investments among the public and in industry and government," says Derek Ruths, an assistant professor in McGill's School of Computer Science.

In an article published in the Nov. 28 issue of the journal Science, Ruths and Jürgen Pfeffer of Carnegie Mellon's Institute for Software Research highlight several issues involved in using social media data sets -- along with strategies to address them. Among the challenges:

* Different social media platforms attract different users -- Pinterest, for example, is dominated by females aged⁶ 25-34 -- yet researchers rarely correct for the distorted picture these populations can produce.

* Publicly available data feeds used in social media research don't always provide an accurate representation of the platform's overall data -- and researchers are generally in the dark about when and how social media providers filter their data streams.

* The design of social media platforms can dictate⁷ how users behave and, therefore, what behavior can be measured. For instance, on Facebook the absence of a "dislike" button makes negative responses to content harder to detect than positive "likes."

* Large numbers of spammers and bots, which masquerade as normal users on social media, get mistakenly incorporated into many measurements and predictions of human behavior.

* Researchers often report results for groups of easy-to-classify users, topics, and events, making new methods seem more accurate than they actually are. For instance, efforts to infer political orientation⁸ of Twitter users achieve barely 65% accuracy for typical users -- even though studies (focusing on politically active users) have claimed 90% accuracy.

Many of these problems have well-known solutions from other fields such as epidemiology, statistics, and machine learning, Ruths and Pfeffer write. "The common thread in all these issues is the need for researchers to be more acutely aware of what they're actually analyzing⁹ when working with social media data," Ruths says.

Social scientists have honed their techniques and standards to deal with this sort of challenge before. "The infamous¹⁰ 'Dewey Defeats Truman' headline of 1948 stemmed from telephone surveys that under-sampled Truman supporters in the general population," Ruths notes. "Rather than permanently¹¹ discrediting¹² the practice of polling, that glaring error led to today's more sophisticated techniques, higher standards, and more accurate polls. Now, we're poised¹³ at a similar technological¹⁴ inflection point. By tackling the issues we face, we'll be able to realize the tremendous potential for good promised by social media-based research."

点击

收听单词发音

1 fluctuations
波动，涨落，起伏( fluctuation的名词复数 )
参考例句：
He showed the price fluctuations in a statistical table. 他用统计表显示价格的波动。 There were so many unpredictable fluctuations on the Stock Exchange. 股票市场瞬息万变。

2 wary
adj.谨慎的，机警的，小心的
参考例句：
He is wary of telling secrets to others.他谨防向他人泄露秘密。 Paula frowned,suddenly wary.宝拉皱了皱眉头，突然警惕起来。

3 pitfalls
（捕猎野兽用的）陷阱( pitfall的名词复数 )；意想不到的困难，易犯的错误
参考例句：
the potential pitfalls of buying a house 购买房屋可能遇到的圈套 Several pitfalls remain in the way of an agreement. 在达成协议的进程中还有几个隐藏的困难。

4 gleaned
v.一点点地收集（资料、事实）( glean的过去式和过去分词 )；（收割后）拾穗
参考例句：
These figures have been gleaned from a number of studies. 这些数据是通过多次研究收集得来的。 A valuable lesson may be gleaned from it by those who have eyes to see. 明眼人可从中记取宝贵的教训。来自《现代汉英综合大词典》

5 justify
vt.证明…正当(或有理)，为…辩护
参考例句：
He tried to justify his absence with lame excuses.他想用站不住脚的借口为自己的缺席辩解。 Can you justify your rude behavior to me?你能向我证明你的粗野行为是有道理的吗？

6 aged
adj.年老的，陈年的
参考例句：
He had put on weight and aged a little.他胖了，也老点了。 He is aged,but his memory is still good.他已年老，然而记忆力还好。

7 dictate
v.口授；(使)听写；指令，指示，命令
参考例句：
It took him a long time to dictate this letter.口述这封信花了他很长时间。 What right have you to dictate to others?你有什么资格向别人发号施令？

8 orientation
n.方向，目标；熟悉，适应，情况介绍
参考例句：
Children need some orientation when they go to school.小孩子上学时需要适应。 The traveller found his orientation with the aid of a good map.旅行者借助一幅好地图得知自己的方向。

9 analyzing
v.分析；分析( analyze的现在分词 )；分解；解释；对…进行心理分析n.分析
参考例句：
Analyzing the date of some socialist countries presents even greater problem s. 分析某些社会主义国家的统计数据，暴露出的问题甚至更大。来自辞典例句 He undoubtedly was not far off the mark in analyzing its predictions. 当然，他对其预测所作的分析倒也八九不离十。来自辞典例句

10 infamous
adj.声名狼藉的，臭名昭著的，邪恶的
参考例句：
He was infamous for his anti-feminist attitudes.他因反对女性主义而声名狼藉。 I was shocked by her infamous behaviour.她的无耻行径令我震惊。

11 permanently
adv.永恒地，永久地，固定不变地
参考例句：
The accident left him permanently scarred.那次事故给他留下了永久的伤疤。 The ship is now permanently moored on the Thames in London.该船现在永久地停泊在伦敦泰晤士河边。

12 discrediting
使不相信( discredit的现在分词 )；使怀疑；败坏…的名声；拒绝相信
参考例句：
It has also led to the discrediting of mainstream macroeconomics. 它还使得人们对主流宏观经济学产生了怀疑。

13 poised
a.摆好姿势不动的
参考例句：
The hawk poised in mid-air ready to swoop. 老鹰在半空中盘旋，准备俯冲。 Tina was tense, her hand poised over the telephone. 蒂娜心情紧张，手悬在电话机上。

14 technological
adj.技术的；工艺的
参考例句：
A successful company must keep up with the pace of technological change.一家成功的公司必须得跟上技术变革的步伐。 Today,the pace of life is increasing with technological advancements.当今，随着科技进步，生活节奏不断增快。

上一篇：纽约街道垃圾处理昆虫起重要作用下一篇：澳洲科学家在桉树叶里找到了黄金