Time Limit: 1000MS | Memory Limit: 65536K | |
Total Submissions: 2269 | Accepted: 710 |
Description
.
As can be inferred from the above formula, a passage with a high Flesch Reading Ease score tends to favor shorter sentences and words, which is in compliance with commonsense in spite of partial accuracy. (Think of, for instance, the word "television". Long as it may seem, it is indeed one of the first words that any individual who studies English learns.) A related Wikipedia entry on Flesch Reading Ease [1] suggests that passages scoring 90~100 are comprehensible for an average American 5th grader, and 8th and 9th graders possess the ability to follow passages with a score in the range of 60~70, whereas passages not exceeding 30 in the score are best suitable for college graduates. The text of this problem, all sections taken into account, scores roughly 50 as per the calculation of Google Documents.
Despite the simplicity in its ideas, several aspects of its definition remains vague for any real-world implementation of Flesch Reading Ease. For the sake of precision and uniformity, the following restrictions adapted from [2] are adopted for this problem, to which you are to write a solution that effectively computes the Flesch Reading Ease score of a given passage of English text.
- Periods, explanation points, colons and semicolons serve as sentence delimiters.
- Each group of continuous non-blank characters with beginning and ending punctuation removed counts as a word.
- Each vowel (one of a, e, i, o, u and y) in a word is considered one syllable subject to that
- -es, -ed and -e (except -le) endings are ignored,
- words of three letters or shorter count as single syllables,
- consecutive vowels count as one syllable.
References
- Wikipedia contributors. Flesch-Kincaid Readability Test. Wikipedia, The Free Encyclopedia. August 30, 2007, 01:57 UTC. Available at: http://en.wikipedia.org/w/index.php?title=Flesch-Kincaid_Readability_Test&oldid=154509512. Accessed September 5, 2007.
- Talburt, J. 1985. The Flesch index: An easily programmable readability analysis algorithm. In Proceedings of the 4th Annual international Conference on Systems Documentation. SIGDOC '85. ACM Press, New York, NY, 114-122.
Input
Output
Sample Input
Flesch Reading Ease, a readability test named after its deviser Rudolf Flesch, is among most ubiquitously used readability tests, which are principally employed for assessment of the difficulty to understand a reading passage written in English. The Flesch Reading Ease score of a passage relies solely on three statistics, namely the total numbers of sentences, words and syllables, of the passage.
Sample Output
26.09
Source
大致题意:
给出一篇规范的文章,求其 句子数、单词数 和 音节数把这3个值代入题目给出的公式,输出其结果,保留2位小数。
标记单词分隔符: 逗号(,) 和 空格( )
句子分隔符:句号(.) 问号(?) 冒号(:) 分号(;) 感叹号(!)
音节处理要求:
(1)当单词总长度<=3时,音节数无条件+1
(2)
当单词总长度>3时,单词中每出现一个元音字母(a、e、i、o、u、y),音节数+1,但是连续的(>=2)元音字母只按1个音节计算,且当单词后缀为-es、-ed和-e时,后缀的元 音字母e不列为音节数计算。但是后缀-le例外,要计算音节数。
解题思路:
注意:
由于用while(cin>>msg)输入文章,因此是按 空字符
把文章分开若干片段,直到出现EOF时才结束输入,因此msg中的单词分隔符不会出现空格,只要当msg最后一个字符为字母时,就说明此时的单词分隔符为空格。
音节数是最难处理的,其规律如下:
(1) 当单词总长度<=3时,音节数无条件+1
(2)
当单词总长度>3时,单词中每出现一个元音字母(a、e、i、o、u、y),音节数+1,但是连续的元音字母只按1个音节计算,且当单词后缀为-es、-ed和-e时,后缀的元音字母e不列为音节数计算。但是后缀-le例外,要计算音节数。
注意:
(1)元音字母要判断12个,6个小写,6个大写。
(2)输入的文章每个字符只能扫描一次,若重复扫描会超时。
(3)G++用%.2f ; C++用%.2lf
AC代码+详细注释:
#include<cstdio> #include<iostream> using namespace std; char msg[1000]; int word=0;//单词数 int sentance=0;//句子数 int syllable=0;//音标数 bool isalpha(char ch){//检查字符ch是否为字母 if(ch>='A'&&ch<='Z') return 1; if(ch>='a'&&ch<='z') return 1; return 0; } bool isvowel(char ch){//检查字符ch是否为元音字母 if(ch=='a'||ch=='e'||ch=='i'||ch=='o'||ch=='u'||ch=='y') return 1; if(ch=='A'||ch=='E'||ch=='I'||ch=='O'||ch=='U'||ch=='Y') return 1; return 0; } bool isword(char ch){//检查字符ch是否为单词分隔符 if(ch==',') return 1; return 0; } bool issentance(char ch){//检查字符ch是否为句子分隔符 if(ch=='.'||ch=='?' ||ch==':'||ch==';'||ch=='!') return 1; return 0; } int main(){ while(cin>>msg){//以空格为标记,截取文章片段 int wordlen=0; bool flag_frevowel=0;//当当前字符为元音时,检查前一字符是否为元音的标记 int syl=0;//假设当前单词长度>3时,记录音节数的变化量。若单词实际长度<=3,则syllable-syl int i; for(i=0;msg[i];i++){ if(isalpha(msg[i])){//当前字符为 字母 wordlen++;//当前所处理的单词的已知长度 (已知长度<=实际长度) if(wordlen<=3){//当 已知长度<=3 时 if(!isalpha(msg[i+1])){//检查单词实际长度是否<=3 syllable++;//当实际长度<=3时,syllable无条件+1 syllable-=syl;//实际音节数调整,单词实际长度<=3,则syllable减去 "假设单词长度>3时" 音节数的变化量syl syl=0; continue; } } if(isvowel(msg[i])){//当前字母为 元音字母 if(msg[i]=='e'){ if(!isalpha(msg[i+1])&&msg[i-1]=='l'){//-le syllable++; syl++;//由于不知道单词的实际长度,因此总音节数syllable与音节数变化量syl同时递增 continue; } else if(!isalpha(msg[i+1])) continue;// -e else if((msg[i+1]=='d'||msg[i+1]=='s')&&!isalpha(msg[i+2])) continue;// -ed -es } /*处理连续或单个元音*/ if(!flag_frevowel){//当前字母为元音,但前一字符不是元音 flag_frevowel=1; syllable++; syl++; continue; } else continue;//当前字母为元音,但前一字母也是元音,即出现连续元音,syllable不计数 } flag_frevowel=0;//当前字母不是元音 } else if(isword(msg[i])){//当前字符为 单词分隔符 flag_frevowel=0; wordlen=0;//当前单词操作已结束,长度清零,计算下一单词 syl=0; word++; } else if(issentance(msg[i])){//当前字符为 句子分隔符 flag_frevowel=0; wordlen=0;//当前单词操作已结束,长度清零,计算下一单词 word++; syl=0; sentance++; } } if(isalpha(msg[i-1])) word++;//当前文章片段最后一个字符为 字母 } printf("%.2lf",206.835-1.015*(double)word/(double)sentance-84.6*(double)syllable/(double)word); return 0; }