zoukankan      html  css  js  c++  java
  • Natural Language Processing with Python

    Natural Language Processing with Python

    Steven Bird, Ewan Klein, and Edward Loper

    Table of Contents

    Preface ..................................................................... ix

    1. Language Processing and Python . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

      1. 1.1  Computing with Language: Texts and Words

      2. 1.2  A Closer Look at Python: Texts as Lists of Words 10

      3. 1.3  Computing with Language: Simple Statistics 16

      4. 1.4  Back to Python: Making Decisions and Taking Control 22

      5. 1.5  Automatic Natural Language Understanding 27

      6. 1.6  Summary 33

      7. 1.7  Further Reading 34

      8. 1.8  Exercises 35

    2. Accessing Text Corpora and Lexical Resources . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39

      1. 2.1  Accessing Text Corpora

      2. 2.2  Conditional Frequency Distributions 52

      3. 2.3  More Python: Reusing Code 56

      4. 2.4  Lexical Resources 59

      5. 2.5  WordNet 67

      6. 2.6  Summary 73

      7. 2.7  Further Reading 73

      8. 2.8  Exercises 74

    3. Processing Raw Text . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79

      1. 3.1  Accessing Text from the Web and from Disk

      2. 3.2  Strings: Text Processing at the Lowest Level

      3. 3.3  Text Processing with Unicode

      4. 3.4  Regular Expressions for Detecting Word Patterns

      5. 3.5  Useful Applications of Regular Expressions 102

      6. 3.6  Normalizing Text 107

      7. 3.7  Regular Expressions for Tokenizing Text 109

      8. 3.8  Segmentation 112

      9. 3.9  Formatting: From Lists to Strings 116

    1

    39

    80 87 93 97

    v

    7. vi |

    7.1 Information Extraction

    Table of Contents

    261

    4.

    1. 3.10  Summary

    2. 3.11  Further Reading

    3. 3.12  Exercises

    Writing Structured Programs

    121 122 123

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129

    5.

    1. 4.9  Summary

    2. 4.10  Further Reading

    3. 4.11  Exercises

    Categorizing and Tagging Words

    130 133 138

    172 173 173

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 179

    6.

    1. 4.1  Back to the Basics

    2. 4.2  Sequences

    3. 4.3  Questions of Style

    4. 4.4  Functions: The Foundation of Structured Programming 142

    5. 4.5  Doing More with Functions 149

    6. 4.6  Program Development 154

    7. 4.7  Algorithm Design 160

    8. 4.8  A Sample of Python Libraries 167

    1. 5.1  Using a Tagger

    2. 5.2  Tagged Corpora

    3. 5.3  Mapping Words to Properties Using Python Dictionaries 189

    4. 5.4  Automatic Tagging 198

    5. 5.5  N-Gram Tagging 202

    6. 5.6  Transformation-Based Tagging 208

    7. 5.7  How to Determine the Category of a Word 210

    8. 5.8  Summary 213

    9. 5.9  Further Reading 214

    10. 5.10  Exercises 215

    Learning to Classify Text . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 221

    1. 6.1  Supervised Classification 221

    2. 6.2  Further Examples of Supervised Classification 233

    3. 6.3  Evaluation 237

    4. 6.4  Decision Trees 242

    5. 6.5  Naive Bayes Classifiers 245

    6. 6.6  Maximum Entropy Classifiers 250

    7. 6.7  Modeling Linguistic Patterns 254

    8. 6.8  Summary 256

    9. 6.9  Further Reading 256

    10. 6.10  Exercises 257

    Extracting Information from Text . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 261

    179 181

    1. 7.2  Chunking 264

    2. 7.3  Developing and Evaluating Chunkers 270

    3. 7.4  Recursion in Linguistic Structure 277

    4. 7.5  Named Entity Recognition 281

    5. 7.6  Relation Extraction 284

    6. 7.7  Summary 285

    7. 7.8  Further Reading 286

    8. 7.9  Exercises 286

    8. Analyzing Sentence Structure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 291

    1. 8.1  Some Grammatical Dilemmas 292

    2. 8.2  What’s the Use of Syntax? 295

    3. 8.3  Context-Free Grammar 298

    4. 8.4  Parsing with Context-Free Grammar 302

    5. 8.5  Dependencies and Dependency Grammar 310

    1. 8.6  Grammar Development

    2. 8.7  Summary

    3. 8.8  Further Reading

    4. 8.9  Exercises

    9. Building Feature-Based Grammars

                                      315
                                      321
                                      322
                                      322
    

    . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 327

    1. 9.1  Grammatical Features

    2. 9.2  Processing Feature Structures 337

    3. 9.3  Extending a Feature-Based Grammar 344

    4. 9.4  Summary 356

    5. 9.5  Further Reading 357

    6. 9.6  Exercises 358

    1. Analyzing the Meaning of Sentences . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 361

      1. 10.1  Natural Language Understanding 361

      2. 10.2  Propositional Logic 368

      3. 10.3  First-Order Logic 372

      4. 10.4  The Semantics of English Sentences 385

      5. 10.5  Discourse Semantics 397

      6. 10.6  Summary 402

      7. 10.7  Further Reading 403

      8. 10.8  Exercises 404

    2. Managing Linguistic Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 407

      1. 11.1  Corpus Structure: A Case Study 407

      2. 11.2  The Life Cycle of a Corpus 412

      3. 11.3  Acquiring Data 416

      4. 11.4  Working with XML 425

    327

    Table of Contents

    | vii

    1. 11.5  Working with Toolbox Data 431

    2. 11.6  Describing Language Resources Using OLAC Metadata 435

    3. 11.7  Summary 437

    4. 11.8  Further Reading 437

    5. 11.9  Exercises 438

    Afterword: The Language Challenge . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 441 Bibliography ............................................................... 449 NLTK Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 459 General Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 463

    自然语言处理爱好者,欢迎交流。QQ: 7214218
  • 相关阅读:
    Backbone源码解析(六):观察者模式应用
    NodeJs 开发微信公众号(五)真实环境部署
    NodeJs 开发微信公众号(四)微信网页授权
    NodeJs 开发微信公众号(三)微信事件交互
    NodeJs 开发微信公众号(二)测试环境部署
    NodeJs 开发微信公众号(一)准备工作
    Css 动画的回调
    GIT常用命令笔记
    论如何在手机端web前端实现自定义原生控件的样式
    Box-sizing:小身材,大拳头!
  • 原文地址:https://www.cnblogs.com/z-cm/p/14918804.html
Copyright © 2011-2022 走看看