poj3371

zoukankan html css js c++ java

poj3371
Flesch Reading Ease

Time Limit: 1000MS 　 Memory Limit: 65536K

Total Submissions: 2269 　 Accepted: 710

Description
Flesch Reading Ease, a readability test named after its deviser Rudolf Flesch, is among most ubiquitously used readability tests, which are principally employed for assessment of the difficulty to understand a reading passage written in English. The Flesch Reading Ease score of a passage relies solely on three statistics, namely the total numbers of sentences, words and syllables, of the passage. Specifically, the score is defined by the following formula:
$206.835-1.015\times\frac{\mathrm{words}}{\mathrm{sentences}}-84.6\times\frac{\mathrm{syllables}}{\mathrm{words}}$ .

As can be inferred from the above formula, a passage with a high Flesch Reading Ease score tends to favor shorter sentences and words, which is in compliance with commonsense in spite of partial accuracy. (Think of, for instance, the word "television". Long as it may seem, it is indeed one of the first words that any individual who studies English learns.) A related Wikipedia entry on Flesch Reading Ease [1] suggests that passages scoring 90~100 are comprehensible for an average American 5th grader, and 8th and 9th graders possess the ability to follow passages with a score in the range of 60~70, whereas passages not exceeding 30 in the score are best suitable for college graduates. The text of this problem, all sections taken into account, scores roughly 50 as per the calculation of Google Documents.

Despite the simplicity in its ideas, several aspects of its definition remains vague for any real-world implementation of Flesch Reading Ease. For the sake of precision and uniformity, the following restrictions adapted from [2] are adopted for this problem, to which you are to write a solution that effectively computes the Flesch Reading Ease score of a given passage of English text.

Periods, explanation points, colons and semicolons serve as sentence delimiters.

Each group of continuous non-blank characters with beginning and ending punctuation removed counts as a word.

Each vowel (one of a, e, i, o, u and y) in a word is considered one syllable subject to that

-es, -ed and -e (except -le) endings are ignored,

words of three letters or shorter count as single syllables,

consecutive vowels count as one syllable.

References

Wikipedia contributors. Flesch-Kincaid Readability Test. Wikipedia, The Free Encyclopedia. August 30, 2007, 01:57 UTC. Available at: http://en.wikipedia.org/w/index.php?title=Flesch-Kincaid_Readability_Test&oldid=154509512. Accessed September 5, 2007.

Talburt, J. 1985. The Flesch index: An easily programmable readability analysis algorithm. In Proceedings of the 4th Annual international Conference on Systems Documentation. SIGDOC '85. ACM Press, New York, NY, 114-122.
Input

The input contains a passage in English whose Flesch Reading Ease score is to be computed. Only letters of the English alphabet (both lowercase and uppercase), common punctuation marks (periods, question and exclamation marks, colons, semicolons as well as commas, quotation marks, hyphens and apostrophes), and spaces appear in the passage. The passage is of indefinite length and possibly occupies multiple lines. Additionally, it is guaranteed to be correct in punctuation.
　

Output

Output the Flesch Reading Ease score of the given passage rounded to two digits beyond decimal point.
　

Sample Input
```
Flesch Reading Ease, a readability test named after its deviser Rudolf Flesch,
is among most ubiquitously used readability tests, which are principally
employed for assessment of the difficulty to understand a reading passage
written in English. The Flesch Reading Ease score of a passage relies solely
on three statistics, namely the total numbers of sentences, words and
syllables, of the passage.
```
Sample Output
```
26.09
```
Source

POJ Monthly--2007.09.09, frkstyc

大致题意：
给出一篇规范的文章，求其句子数、单词数和音节数把这3个值代入题目给出的公式，输出其结果，保留2位小数。
　　标记单词分隔符：逗号（,）和空格（）
　　句子分隔符：句号（.）问号（?）冒号（:）分号（;）感叹号（!）
　　音节处理要求：
　　（1）当单词总长度<=3时，音节数无条件+1
　　（2）当单词总长度>3时，单词中每出现一个元音字母（a、e、i、o、u、y），音节数+1，但是连续的（>=2）元音字母只按1个音节计算，且当单词后缀为-es、-ed和-e时，后缀的元　　音字母e不列为音节数计算。但是后缀-le例外，要计算音节数。

解题思路：

注意：
由于用while(cin>>msg)输入文章，因此是按空字符把文章分开若干片段，直到出现EOF时才结束输入，因此msg中的单词分隔符不会出现空格，只要当msg最后一个字符为字母时，就说明此时的单词分隔符为空格。

音节数是最难处理的，其规律如下：
（1）当单词总长度<=3时，音节数无条件+1
（2）当单词总长度>3时，单词中每出现一个元音字母（a、e、i、o、u、y），音节数+1，但是连续的元音字母只按1个音节计算，且当单词后缀为-es、-ed和-e时，后缀的元音字母e不列为音节数计算。但是后缀-le例外，要计算音节数。

注意：
(1)元音字母要判断12个，6个小写，6个大写。

(2)输入的文章每个字符只能扫描一次，若重复扫描会超时。

(3)G++用%.2f ; C++用%.2lf

AC代码+详细注释：
#include<cstdio> #include<iostream> using namespace std; char msg[1000]; int word=0;//单词数 int sentance=0;//句子数 int syllable=0;//音标数 bool isalpha(char ch){//检查字符ch是否为字母 if(ch>='A'&&ch<='Z') return 1; if(ch>='a'&&ch<='z') return 1; return 0; } bool isvowel(char ch){//检查字符ch是否为元音字母 if(ch=='a'||ch=='e'||ch=='i'||ch=='o'||ch=='u'||ch=='y') return 1; if(ch=='A'||ch=='E'||ch=='I'||ch=='O'||ch=='U'||ch=='Y') return 1; return 0; } bool isword(char ch){//检查字符ch是否为单词分隔符 if(ch==',') return 1; return 0; } bool issentance(char ch){//检查字符ch是否为句子分隔符 if(ch=='.'||ch=='?' ||ch==':'||ch==';'||ch=='!') return 1; return 0; } int main(){ while(cin>>msg){//以空格为标记，截取文章片段 int wordlen=0; bool flag_frevowel=0;//当当前字符为元音时，检查前一字符是否为元音的标记 int syl=0;//假设当前单词长度>3时，记录音节数的变化量。若单词实际长度<=3，则syllable-syl int i; for(i=0;msg[i];i++){ if(isalpha(msg[i])){//当前字符为字母 wordlen++;//当前所处理的单词的已知长度 (已知长度<=实际长度) if(wordlen<=3){//当已知长度<=3 时 if(!isalpha(msg[i+1])){//检查单词实际长度是否<=3 syllable++;//当实际长度<=3时，syllable无条件+1 syllable-=syl;//实际音节数调整，单词实际长度<=3，则syllable减去 "假设单词长度>3时" 音节数的变化量syl syl=0; continue; } } if(isvowel(msg[i])){//当前字母为元音字母 if(msg[i]=='e'){ if(!isalpha(msg[i+1])&&msg[i-1]=='l'){//-le syllable++; syl++;//由于不知道单词的实际长度，因此总音节数syllable与音节数变化量syl同时递增 continue; } else if(!isalpha(msg[i+1])) continue;// -e else if((msg[i+1]=='d'||msg[i+1]=='s')&&!isalpha(msg[i+2])) continue;// -ed -es } /*处理连续或单个元音*/ if(!flag_frevowel){//当前字母为元音，但前一字符不是元音 flag_frevowel=1; syllable++; syl++; continue; } else continue;//当前字母为元音，但前一字母也是元音，即出现连续元音，syllable不计数 } flag_frevowel=0;//当前字母不是元音 } else if(isword(msg[i])){//当前字符为单词分隔符 flag_frevowel=0; wordlen=0;//当前单词操作已结束，长度清零，计算下一单词 syl=0; word++; } else if(issentance(msg[i])){//当前字符为句子分隔符 flag_frevowel=0; wordlen=0;//当前单词操作已结束，长度清零，计算下一单词 word++; syl=0; sentance++; } } if(isalpha(msg[i-1])) word++;//当前文章片段最后一个字符为字母 } printf("%.2lf",206.835-1.015*(double)word/(double)sentance-84.6*(double)syllable/(double)word); return 0; }
查看全文

相关阅读:
一道比较有趣的题
 笑话两则
 时钟
 组策略简单介绍
 网页乱码问题ASP.NET
同性恋的公鸡
 SQL中CASE函数_可解决编程中空表检索的一些问题
 百万网？
黑客故事
 word有趣问题集锦

原文地址：https://www.cnblogs.com/shenben/p/5997999.html

Time Limit: 1000MS		Memory Limit: 65536K
Total Submissions: 2269		Accepted: 710

References