zoukankan      html  css  js  c++  java
  • Python(Head First)学习笔记:四

    4 持久存储:文件存储、读写

       数据保存到文件:在学习的过程中出现了一个问题,老是报一个错:SyntaxError: invalid syntax;

            这个是语法错误,后来搜了下才知道是python2.7和python3.5并不兼容,因为之前一直是在ubuntu的终端里

    写这些简单的实例,后来程序稍微大点就不方便了,就安装了idle,用命令:sudo apt-get install idle,安装完启动后,

    载入python文件,然后运行发现是python2.7,然后逐行运行,发现报错,而之前这些代码都是没问题的,后来重新安

    装idle3,命令:sudo apt-get install idle3,然后启动:idle3,运行实例代码,没有问题。

    实例一:

     1 Python 3.5.2 (default, Nov 17 2016, 17:05:23) 
     2 [GCC 5.4.0 20160609] on linux
     3 Type "copyright", "credits" or "license()" for more information.
     4 >>> import os
     5 >>> os.getcwd()
     6 '/home/user'
     7 >>> os.chdir('/home/user/project/python_model/HeadFirstPython/chapter3')
     8 >>> os.getcwd()
     9 '/home/user/project/python_model/HeadFirstPython/chapter3'
    10 >>> man=[]
    11 >>> other=[]
    12 >>> try:
    13     data=open('sketch.txt')
    14     for each_line in data:
    15         try:
    16             (role,line_spoken)=each_line.split(':',1)
    17             line_spoken=line_spoken.strip()
    18             if role=='Man':
    19                 man.append(line_spoken)
    20             elif role=='Other Man':
    21                 other.append(line_spoken)
    22         except ValueError:
    23             pass
    24     data.close()
    25 except IOError:
    26     print('The datafile is missing!')
    27 
    28     
    29 >>> print(man)
    30 ['Is this the right room for an argument?', "No you haven't!", 'When?', "No you didn't!", "You didn't!", 'You did not!', 'Ah! (taking out his wallet and paying) Just the five minutes.', 'You most certainly did not!', "Oh no you didn't!", "Oh no you didn't!", "Oh look, this isn't an argument!", "No it isn't!", "It's just contradiction!", 'It IS!', 'You just contradicted me!', 'You DID!', 'You did just then!', '(exasperated) Oh, this is futile!!', 'Yes it is!']
    31 >>> print(other)
    32 ["I've told you once.", 'Yes I have.', 'Just now.', 'Yes I did!', "I'm telling you, I did!", "Oh I'm sorry, is this a five minute argument, or the full half hour?", 'Just the five minutes. Thank you.', 'Anyway, I did.', "Now let's get one thing quite clear: I most definitely told you!", 'Oh yes I did!', 'Oh yes I did!', 'Yes it is!', "No it isn't!", 'It is NOT!', "No I didn't!", 'No no no!', 'Nonsense!', "No it isn't!"]
    33 >>> 
    View Code

     以写模式打开文件

      使用open()BIF打开磁盘文件时,可以指定访问的模式,open()的帮助文件如下:

      1 help(open)
      2 Help on built-in function open in module io:
      3 
      4 open(file, mode='r', buffering=-1, encoding=None, errors=None, newline=None, closefd=True, opener=None)
      5     Open file and return a stream.  Raise IOError upon failure.
      6     
      7     file is either a text or byte string giving the name (and the path
      8     if the file isn't in the current working directory) of the file to
      9     be opened or an integer file descriptor of the file to be
     10     wrapped. (If a file descriptor is given, it is closed when the
     11     returned I/O object is closed, unless closefd is set to False.)
     12     
     13     mode is an optional string that specifies the mode in which the file
     14     is opened. It defaults to 'r' which means open for reading in text
     15     mode.  Other common values are 'w' for writing (truncating the file if
     16     it already exists), 'x' for creating and writing to a new file, and
     17     'a' for appending (which on some Unix systems, means that all writes
     18     append to the end of the file regardless of the current seek position).
     19     In text mode, if encoding is not specified the encoding used is platform
     20     dependent: locale.getpreferredencoding(False) is called to get the
     21     current locale encoding. (For reading and writing raw bytes use binary
     22     mode and leave encoding unspecified.) The available modes are:
     23     
     24     ========= ===============================================================
     25     Character Meaning
     26     --------- ---------------------------------------------------------------
     27     'r'       open for reading (default)
     28     'w'       open for writing, truncating the file first
     29     'x'       create a new file and open it for writing
     30     'a'       open for writing, appending to the end of the file if it exists
     31     'b'       binary mode
     32     't'       text mode (default)
     33     '+'       open a disk file for updating (reading and writing)
     34     'U'       universal newline mode (deprecated)
     35     ========= ===============================================================
     36     
     37     The default mode is 'rt' (open for reading text). For binary random
     38     access, the mode 'w+b' opens and truncates the file to 0 bytes, while
     39     'r+b' opens the file without truncation. The 'x' mode implies 'w' and
     40     raises an `FileExistsError` if the file already exists.
     41     
     42     Python distinguishes between files opened in binary and text modes,
     43     even when the underlying operating system doesn't. Files opened in
     44     binary mode (appending 'b' to the mode argument) return contents as
     45     bytes objects without any decoding. In text mode (the default, or when
     46     't' is appended to the mode argument), the contents of the file are
     47     returned as strings, the bytes having been first decoded using a
     48     platform-dependent encoding or using the specified encoding if given.
     49     
     50     'U' mode is deprecated and will raise an exception in future versions
     51     of Python.  It has no effect in Python 3.  Use newline to control
     52     universal newlines mode.
     53     
     54     buffering is an optional integer used to set the buffering policy.
     55     Pass 0 to switch buffering off (only allowed in binary mode), 1 to select
     56     line buffering (only usable in text mode), and an integer > 1 to indicate
     57     the size of a fixed-size chunk buffer.  When no buffering argument is
     58     given, the default buffering policy works as follows:
     59     
     60     * Binary files are buffered in fixed-size chunks; the size of the buffer
     61       is chosen using a heuristic trying to determine the underlying device's
     62       "block size" and falling back on `io.DEFAULT_BUFFER_SIZE`.
     63       On many systems, the buffer will typically be 4096 or 8192 bytes long.
     64     
     65     * "Interactive" text files (files for which isatty() returns True)
     66       use line buffering.  Other text files use the policy described above
     67       for binary files.
     68     
     69     encoding is the name of the encoding used to decode or encode the
     70     file. This should only be used in text mode. The default encoding is
     71     platform dependent, but any encoding supported by Python can be
     72     passed.  See the codecs module for the list of supported encodings.
     73     
     74     errors is an optional string that specifies how encoding errors are to
     75     be handled---this argument should not be used in binary mode. Pass
     76     'strict' to raise a ValueError exception if there is an encoding error
     77     (the default of None has the same effect), or pass 'ignore' to ignore
     78     errors. (Note that ignoring encoding errors can lead to data loss.)
     79     See the documentation for codecs.register or run 'help(codecs.Codec)'
     80     for a list of the permitted encoding error strings.
     81     
     82     newline controls how universal newlines works (it only applies to text
     83     mode). It can be None, '', '
    ', '
    ', and '
    '.  It works as
     84     follows:
     85     
     86     * On input, if newline is None, universal newlines mode is
     87       enabled. Lines in the input can end in '
    ', '
    ', or '
    ', and
     88       these are translated into '
    ' before being returned to the
     89       caller. If it is '', universal newline mode is enabled, but line
     90       endings are returned to the caller untranslated. If it has any of
     91       the other legal values, input lines are only terminated by the given
     92       string, and the line ending is returned to the caller untranslated.
     93     
     94     * On output, if newline is None, any '
    ' characters written are
     95       translated to the system default line separator, os.linesep. If
     96       newline is '' or '
    ', no translation takes place. If newline is any
     97       of the other legal values, any '
    ' characters written are translated
     98       to the given string.
     99     
    100     If closefd is False, the underlying file descriptor will be kept open
    101     when the file is closed. This does not work when a file name is given
    102     and must be True in that case.
    103     
    104     A custom opener can be used by passing a callable as *opener*. The
    105     underlying file descriptor for the file object is then obtained by
    106     calling *opener* with (*file*, *flags*). *opener* must return an open
    107     file descriptor (passing os.open as *opener* results in functionality
    108     similar to passing None).
    109     
    110     open() returns a file object whose type depends on the mode, and
    111     through which the standard file operations such as reading and writing
    112     are performed. When open() is used to open a file in a text mode ('w',
    113     'r', 'wt', 'rt', etc.), it returns a TextIOWrapper. When used to open
    114     a file in a binary mode, the returned class varies: in read binary
    115     mode, it returns a BufferedReader; in write binary and append binary
    116     modes, it returns a BufferedWriter, and in read/write mode, it returns
    117     a BufferedRandom.
    118     
    119     It is also possible to use a string or bytearray as a file for both
    120     reading and writing. For strings StringIO can be used like a file
    121     opened in a text mode, and for bytes a BytesIO can be used like a file
    122     opened in a binary mode.
    View help

     实例二:

     1 import os
     2 os.getcwd()
     3 os.chdir('/home/user/project/python_model/HeadFirstPython/chapter3')
     4 man = []
     5 other = []
     6 try:
     7  data = open('sketch.txt')
     8  for each_line in data:
     9   try:
    10    (role,line_spoken) = each_line.split(':',1)
    11    line_spoken = line_spoken.strip()
    12    if role == 'Man':
    13     man.append(line_spoken)
    14    elif role == 'Other Man':
    15     other.append(line_spoken) 
    16   except ValueError:
    17    pass
    18  data.close()
    19 except IOError:
    20  print('The datafile is missing!')
    21 try:
    22  man_file = open('man_data.txt','w') # open a new file man_data.txt in-mode 'w'
    23  other_file = open('other_data.txt','w')# if the file don't exist then creat it.
    24  print(man,file=man_file)# write man data into man_file.txt
    25  print(other,file=other_file)# write other data into other_file.txt
    26  man_file.close()# close man_file
    27  other_file.close()# close other_file
    28 except IOError:
    29  print('File error')
    View Code

    注:发生异常后文件会保持打开 

          为了解决发生异常文件没有自动关闭的问题,引入finally。

    用finally扩展try

      在实例二的最后增加:

        finally:

          man_file.close()

          other_file.close()

           在python中字符串是不可变的,因为永远不知道还有哪些变量指向某个特定的字符串;

      尽管可以为Python变量赋数值,但实际上变量并不包含所赋的数据;

      此外,还有元组也不可以改变,即:不可改变的列表;

      所有数值类型也是不可变的。

    知道错误类型还不够

      如果想知道产生错误的具体原因,就需要添加异常处理捕获机制,如下:

      假设现在要打开一个文件:missing.txt,但这个文件并不存在,如下代码:

      try:

        data=open('missing.txt')

        print(data.readline(),end='')

      except IOError:

        print('File error')

      finally:

        if 'data' in locals():

          data.close()

    继续改进:

      except IOError as err:           #为异常对象起一个名

        print('File error: ' + str(err))  #然后作为错误消息的一部分

    然后运行,结果是:File error:[Errno 2] No such file or directory: 'missing.txt';

    但是如果代码量大了,这种逻辑处理方法会很麻烦,这样引入with。

    用with处理文件

      使用以下代码可以替代上面的try/except/finally代码:

      try:

       with open('its.txt',"w") as data:

        print("It's...",file=data)

      except IOError as err:

        print('File error:' + str(err))

      注:使用with时,不需要操心关闭打开文件,Python解释器会自动处理;

        其实,with语句使用了一种名叫:上下文管理协议(context management protocol)的Python技术。

    接下来修改第二章笔记中的print_lol()函数

      在Python中,标准输出是:sys.stdout,可以从标准库sys模块导入。

      实例三

    对函数print_lol做修改

    def print_lol(the_list,indent=False,level=0,fh=sys.stdout ):
      for each_item in the_list:
        if isinstance(each_item,list):
          print_lol(each_item,indent,level+1,fh)
        else:
          for tab_stop in range(level):
            print(" " *level,end='',file=fh)
          print(each_item,file=fh)

       不知道为什么,print_lol函数在添加了第四个参数fh=sys.stdout后,用import sys及import nester后报错

    Traceback (most recent call last):
    File "<pyshell#9>", line 1, in <module>
    import nester
    File "/home/user/project/python_model/nester/nester.py", line 1, in <module>
    def print_lol(the_list,indent=False,level=0,fh=sys.stdout ):
    NameError: name 'sys' is not defined

       上网查找也没有解决这个问题,挺郁闷的,已经卡住两天了,先跳过去了。。。

    定制代码剖析

    “腌制”数据

      Python提供了一个标准库,名为:pickle,它可以保存和加载几乎任何Python数据对象,包括列表。

      可以把“腌制”数据存储到磁盘,放到数据库或者通过网络传输到另一台计算机上。

    用dump保存,用load恢复

      使用pickle很简单,只需导入模块:import pickle;

                   用dump()保存数据;

                   用load()恢复数据;

      注:处理“腌制数据”,唯一的要求是,必须以二进制访问模式打开这些文件。

    如果出问题了呢?

      腌制或解除数据腌制时如果出了问题,pickle模块会产生一个PickleError类型的异常。

      实例四:文件数据的腌制和恢复

    >>> print(man)
    ['Is this the right room for an argument?', "No you haven't!", 'When?', "No you didn't!", "You didn't!", 'You did not!', 'Ah! (taking out his wallet and paying) Just the five minutes.', 'You most certainly did not!', "Oh no you didn't!", "Oh no you didn't!", "Oh look, this isn't an argument!", "No it isn't!", "It's just contradiction!", 'It IS!', 'You just contradicted me!', 'You DID!', 'You did just then!', '(exasperated) Oh, this is futile!!', 'Yes it is!']
    >>> try:
        man_file=open('man_data.txt','w')
        other_file=open('other_data.txt','w')
        print(man,file=man_file)
        print(other,file=other_file)
        man_file.close()
        other_file.close()
    except IOError:
        print('File error')
    
        
    >>> import pickle
    >>> try:
        with open('man_data1.txt','wb')as man_file:
            pickle.dump(man,man_file)
    except IOError as err:
        print('File error:'+str(err))
    except pickle.PickleError as perr:
        print('Pickling error:'+str(perr))
    
        
    >>> new_man=[]
    >>> try:
        with open('man_data1.txt','rb')as man_file:
            new_man = pickle.load(man_file)
    except IOError as err:
        print('File error:'+str(err))
    except pickle.PickleError as perr:
        print('Pickling error:'+str(perr))
    
        
    >>> import nester01
    >>> nester01.print_lol(new_man)
    Is this the right room for an argument?
    No you haven't!
    When?
    No you didn't!
    You didn't!
    You did not!
    Ah! (taking out his wallet and paying) Just the five minutes.
    You most certainly did not!
    Oh no you didn't!
    Oh no you didn't!
    Oh look, this isn't an argument!
    No it isn't!
    It's just contradiction!
    It IS!
    You just contradicted me!
    You DID!
    You did just then!
    (exasperated) Oh, this is futile!!
    Yes it is!
    View Code

     最后,显示数据的第一行和最后一行:

    >>> print(new_man[0]) #显示第一行
    Is this the right room for an argument?
    >>> print(new_man[-1]) #显示最后一行
    Yes it is!

    总结

      使用Pickle的通用文件I/O才是上策!嘿嘿~

      让Python去负责文件I/O的细节,这样把关注重点放在代码的实际作用;

      利用Python处理、保存和恢复列表中的数据,现在已经有一套可行、可靠的机制,

      本章主要用到的方法有:

                  strip():可以从字符串中去除不想要的空白符;

                    print():BIF的参数控制,将数据发送、保存到相应地址;

                  finally:最终会执行的语句;

                  except:会传入一个异常对象并通过as赋值到一个标识符;

                  str():BIF可以用来访问任何数据对象的串表示;

                  locals():返回当前作用域的变量集合;

                  in:操作符用于检查成员关系;

                  +:连接两个字符串或两个数字相加;

                  with:自动处理已有打开文件的关闭工作,即使出现异常也会执行;

                  sys.stdout:Python中的标准输出,需要加载sys模块;

                  pickle模块:高效的将Python数据对象保存到磁盘(二进制)及从磁盘恢复,包括dump()保存和load()恢复。

    -------------------------------------------The End of Fourth Chapter-------------------------------------------              

  • 相关阅读:
    获取控件的xy坐标
    你不知道的JavaScript--Item4 基本类型和基本包装类型(引用类型)
    你不知道的JavaScript--Item3 隐式强制转换
    ajax技术基础详解
    git回退到某个历史版本
    jQuery中 $.ajax()方法详解
    Eclipse Java注释模板设置详解
    MySQL中group_concat函数深入理解
    javascript知识详解之8张思维导图
    javascript 事件编程之事件(流,处理,对象,类型)
  • 原文地址:https://www.cnblogs.com/heart2futrue/p/7560931.html
Copyright © 2011-2022 走看看