zoukankan      html  css  js  c++  java
  • poj3371

    Flesch Reading Ease
    Time Limit: 1000MS   Memory Limit: 65536K
    Total Submissions: 2269   Accepted: 710

    Description

    Flesch Reading Ease, a readability test named after its deviser Rudolf Flesch, is among most ubiquitously used readability tests, which are principally employed for assessment of the difficulty to understand a reading passage written in English. The Flesch Reading Ease score of a passage relies solely on three statistics, namely the total numbers of sentences, words and syllables, of the passage. Specifically, the score is defined by the following formula:

    .

    As can be inferred from the above formula, a passage with a high Flesch Reading Ease score tends to favor shorter sentences and words, which is in compliance with commonsense in spite of partial accuracy. (Think of, for instance, the word "television". Long as it may seem, it is indeed one of the first words that any individual who studies English learns.) A related Wikipedia entry on Flesch Reading Ease [1] suggests that passages scoring 90~100 are comprehensible for an average American 5th grader, and 8th and 9th graders possess the ability to follow passages with a score in the range of 60~70, whereas passages not exceeding 30 in the score are best suitable for college graduates. The text of this problem, all sections taken into account, scores roughly 50 as per the calculation of Google Documents.

    Despite the simplicity in its ideas, several aspects of its definition remains vague for any real-world implementation of Flesch Reading Ease. For the sake of precision and uniformity, the following restrictions adapted from [2] are adopted for this problem, to which you are to write a solution that effectively computes the Flesch Reading Ease score of a given passage of English text.

    1. Periods, explanation points, colons and semicolons serve as sentence delimiters.
    2. Each group of continuous non-blank characters with beginning and ending punctuation removed counts as a word.
    3. Each vowel (one of a, e, i, o, u and y) in a word is considered one syllable subject to that
      1. -es, -ed and -e (except -le) endings are ignored,
      2. words of three letters or shorter count as single syllables,
      3. consecutive vowels count as one syllable.

    References

    1. Wikipedia contributors. Flesch-Kincaid Readability Test. Wikipedia, The Free Encyclopedia. August 30, 2007, 01:57 UTC. Available at: http://en.wikipedia.org/w/index.php?title=Flesch-Kincaid_Readability_Test&oldid=154509512. Accessed September 5, 2007.
    2. Talburt, J. 1985. The Flesch index: An easily programmable readability analysis algorithm. In Proceedings of the 4th Annual international Conference on Systems Documentation. SIGDOC '85. ACM Press, New York, NY, 114-122.

    Input

    The input contains a passage in English whose Flesch Reading Ease score is to be computed. Only letters of the English alphabet (both lowercase and uppercase), common punctuation marks (periods, question and exclamation marks, colons, semicolons as well as commas, quotation marks, hyphens and apostrophes), and spaces appear in the passage. The passage is of indefinite length and possibly occupies multiple lines. Additionally, it is guaranteed to be correct in punctuation. 
     

    Output

    Output the Flesch Reading Ease score of the given passage rounded to two digits beyond decimal point. 
     

    Sample Input

    Flesch Reading Ease, a readability test named after its deviser Rudolf Flesch,
    is among most ubiquitously used readability tests, which are principally
    employed for assessment of the difficulty to understand a reading passage
    written in English. The Flesch Reading Ease score of a passage relies solely
    on three statistics, namely the total numbers of sentences, words and
    syllables, of the passage.

    Sample Output

    26.09

    Source

    大致题意:
    给出一篇规范的文章,求其 句子数、单词数 和 音节数把这3个值代入题目给出的公式,输出其结果,保留2位小数。
      标记单词分隔符: 逗号(,) 和 空格( )
      句子分隔符:句号(.) 问号(?) 冒号(:) 分号(;) 感叹号(!)
      音节处理要求:
      (1)当单词总长度<=3时,音节数无条件+1
      (2) 当单词总长度>3时,单词中每出现一个元音字母(a、e、i、o、u、y),音节数+1,但是连续的(>=2)元音字母只按1个音节计算,且当单词后缀为-es、-ed和-e时,后缀的元  音字母e不列为音节数计算。但是后缀-le例外,要计算音节数。


    解题思路:


    注意:
    由于用while(cin>>msg)输入文章,因此是按 空字符 把文章分开若干片段,直到出现EOF时才结束输入,因此msg中的单词分隔符不会出现空格,只要当msg最后一个字符为字母时,就说明此时的单词分隔符为空格。

    音节数是最难处理的,其规律如下:
    (1) 当单词总长度<=3时,音节数无条件+1
    (2) 当单词总长度>3时,单词中每出现一个元音字母(a、e、i、o、u、y),音节数+1,但是连续的元音字母只按1个音节计算,且当单词后缀为-es、-ed和-e时,后缀的元音字母e不列为音节数计算。但是后缀-le例外,要计算音节数。

    注意:
    (1)元音字母要判断12个,6个小写,6个大写。

    (2)输入的文章每个字符只能扫描一次,若重复扫描会超时。

    (3)G++用%.2f ; C++用%.2lf

    AC代码+详细注释:

    #include<cstdio>
    #include<iostream>
    using namespace std;
    char msg[1000];
    int word=0;//单词数
    int sentance=0;//句子数
    int syllable=0;//音标数
    bool isalpha(char ch){//检查字符ch是否为字母
        if(ch>='A'&&ch<='Z') return 1;
        if(ch>='a'&&ch<='z') return 1;
        return 0;
    }
    bool isvowel(char ch){//检查字符ch是否为元音字母
        if(ch=='a'||ch=='e'||ch=='i'||ch=='o'||ch=='u'||ch=='y') return 1;
        if(ch=='A'||ch=='E'||ch=='I'||ch=='O'||ch=='U'||ch=='Y') return 1;
        return 0;
    }
    bool isword(char ch){//检查字符ch是否为单词分隔符
        if(ch==',')
            return 1;
        return 0;
    }
    bool issentance(char ch){//检查字符ch是否为句子分隔符
        if(ch=='.'||ch=='?' ||ch==':'||ch==';'||ch=='!') return 1;
        return 0;
    }
    int main(){
        while(cin>>msg){//以空格为标记,截取文章片段
            int wordlen=0;
            bool flag_frevowel=0;//当当前字符为元音时,检查前一字符是否为元音的标记
            int syl=0;//假设当前单词长度>3时,记录音节数的变化量。若单词实际长度<=3,则syllable-syl
            int i;
            for(i=0;msg[i];i++){
                if(isalpha(msg[i])){//当前字符为 字母
                    wordlen++;//当前所处理的单词的已知长度 (已知长度<=实际长度)
                    if(wordlen<=3){//当 已知长度<=3 时
                        if(!isalpha(msg[i+1])){//检查单词实际长度是否<=3
                            syllable++;//当实际长度<=3时,syllable无条件+1
                            syllable-=syl;//实际音节数调整,单词实际长度<=3,则syllable减去 "假设单词长度>3时" 音节数的变化量syl
                            syl=0;
                            continue;
                        }
                    }
                    if(isvowel(msg[i])){//当前字母为 元音字母
                        if(msg[i]=='e'){
                            if(!isalpha(msg[i+1])&&msg[i-1]=='l'){//-le
                                syllable++;
                                syl++;//由于不知道单词的实际长度,因此总音节数syllable与音节数变化量syl同时递增
                                continue;
                            }
                            else if(!isalpha(msg[i+1])) continue;// -e
                            else if((msg[i+1]=='d'||msg[i+1]=='s')&&!isalpha(msg[i+2])) continue;// -ed  -es
                        }
                        /*处理连续或单个元音*/
                        if(!flag_frevowel){//当前字母为元音,但前一字符不是元音
                            flag_frevowel=1;
                            syllable++;
                            syl++;
                            continue;
                        }
                        else continue;//当前字母为元音,但前一字母也是元音,即出现连续元音,syllable不计数
                    }
                    flag_frevowel=0;//当前字母不是元音
                }
                else if(isword(msg[i])){//当前字符为 单词分隔符
                    flag_frevowel=0;
                    wordlen=0;//当前单词操作已结束,长度清零,计算下一单词
                    syl=0;
                    word++;
                }
                else if(issentance(msg[i])){//当前字符为 句子分隔符
                    flag_frevowel=0;
                    wordlen=0;//当前单词操作已结束,长度清零,计算下一单词
                    word++;
                    syl=0;
                    sentance++;
                }
            }
            if(isalpha(msg[i-1])) word++;//当前文章片段最后一个字符为 字母
        }
        printf("%.2lf",206.835-1.015*(double)word/(double)sentance-84.6*(double)syllable/(double)word);
        return 0;
    }
  • 相关阅读:
    删除IE缓存中指定的文件
    新破天一剑合区SQL(半成品)
    获取DOS命令的返回值.
    截取 a.exe 向 b.exe 传递 的启动参数
    五、Delphi 2009 泛型容器单元(Generics.Collections): TDictionary<T>
    C#结构体
    Delphi使用Zlib
    判断数据库是否存在
    去掉CodeGear的Welcome page.
    Indy10中IdMappedPortTCP的用法
  • 原文地址:https://www.cnblogs.com/shenben/p/5997999.html
Copyright © 2011-2022 走看看