zoukankan      html  css  js  c++  java
  • 抓取登录后的数据

        这次是应一个客户需要,抓取另外一个网站的数据,包括数据提交。这些操作需要在登录之后完成。技术上没有什么难点。关键都是用fiddler找到参数和url。

    记住登录状态

        HttpClient能够记住登录状态的,登录完了可以讲Httpclient保存起来。

     private HttpClient _client;
            public HttpClient HttpClient
            {
                get
                {
                    if (_client == null)
                    {
                        if (Session["Client"]!= null)
                        {
                            _client = Session["Client"] as HttpClient;
                        }
                        else
                        {
                            var handler = new HttpClientHandler
                            {
                                AutomaticDecompression = DecompressionMethods.GZip,
                                UseCookies = true,
                                Proxy =
                                    new WebProxy("http://ip:8080/", true, null,
                                        new NetworkCredential("username", "pwd", "domain"))
                            };//代理 
                            _client = new HttpClient(handler);
                            _client.DefaultRequestHeaders.Add("user-agent", "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/36.0.1985.143 Safari/537.36");
                            ClientLogin(new ClientLogoModel());
                            Session["Client"] = _client;
                        }
                      
                    }
    
                    return _client;
                }
            }

     因为目标网站都是用的json传的参数。也是用json返回的参数。不是form提交的格式。所以post之前也要将参数转成json。

     public object ClientLogin(ClientLogoModel logoModel)
            {
                if (logoModel == null)
                {
                    logoModel=new ClientLogoModel();
    
                }
                var data = JsonConvert.SerializeObject(logoModel); ;
                var logoParams = new List<KeyValuePair<string, string>>();
                logoParams.Add(new KeyValuePair<string, string>("data", data));
                var response = _client.PostAsync(new Uri(LogonUrl), new FormUrlEncodedContent(logoParams)).Result;
                var result = response.Content.ReadAsStringAsync().Result;
                return result;
            }

    返回数据转化

    从Fiddler左边获得Url,右边TextView上方是参数格式,下方是返回的数据格式。 

    每次都要转换,写成泛型。

        public T GetTList<T>(object obj, string url)
            {
                var data = JsonConvert.SerializeObject(obj); ;
                var paramList = new List<KeyValuePair<String, String>> { new KeyValuePair<string, string>("data", data) };
                var response = HttpClient.PostAsync(new Uri(url), new FormUrlEncodedContent(paramList)).Result;
    
                var result = response.Content.ReadAsStringAsync().Result;
                return JsonConvert.DeserializeObject<T>(result);
            }

    调用:

        public ActionResult TradePage(TradeQueryParm param)
            {
           var data = GetTList<TradeRequstResult>(obj, tradeListUrl);
                return PartialView(data);
            }

     前端再将参数传递过来。

      $.post("/Trade/TradePage", {
                        agentName: agentName, shortName: shortName,
                        startDate: startDate, endDate: endDate, page: cpage
                    }, function (data) {
                        $("#mtable").html(data);
      }
    HttpClient 上传图片:
       private string UploadImage(string fileName,string path)
            {
                FileStream aFile = new FileStream(path, FileMode.Open);
                MultipartFormDataContent form = new MultipartFormDataContent();
                var  content = new StreamContent(aFile);
                content.Headers.ContentType = new MediaTypeHeaderValue("image/jpeg");
                content.Headers.ContentDisposition = new ContentDispositionHeaderValue("form-data")
                {
                    Name = "protocolFile",
                    FileName = fileName
                };
                form.Add(content);
                var response = HttpClient.PostAsync(imgLoadUrl, form).Result;
                return response.Content.ReadAsStringAsync().Result;
            }

  • 相关阅读:
    物联网相关期刊会议
    Java O O第十一天
    mac下安装伪分布hadoop2.6.0和hbase1.0.1.1
    Ubuntu下使用openssl为apache2配置ssl
    ubuntu下安装hadoop
    ubuntu 添加应用到Dash启动器
    Ubuntu12.04中安装Oracle JDK和NetBeans的方法
    Ubuntu12.04中在桌面建立指向网页的链接文件的方法
    Ubuntu12.04中Gvim无法固定到启动器的解决办法
    Ubuntu中apt-get安装或更新软件错误的解决办法
  • 原文地址:https://www.cnblogs.com/stoneniqiu/p/5005890.html
Copyright © 2011-2022 走看看