为什么百度文库的东西能看到 但是无法用爬虫下载呢?
看了一下 这个样子的, 这是怎么做到的 看上去和原始 pdf 的效果一样。
<v id="pageNo-2" data-page-no="2" data-mate-width="892.949" data-mate-height="1262.85" data-scale="0.707090311596785" class="reader-page" style="height: 1336.46px; position: relative;" data-render="1"><div class="reader-parent-9ac4300002d276a201292e7d reader-parent " style="position: relative;top: 0;left: 0;-webkit-transform: scale(1);-webkit-transform-origin: left top;">
<div class="reader-wrap9ac4300002d276a201292e7d" style="position: absolute;top: 0;left: 0;width: 100%;height: 100%;">
<div class="reader-main-9ac4300002d276a201292e7d" style="position: absolute;top: 0;left: 0;width: 100%;height: 100%;">
<div class="reader-pic-layer" style="z-index:1">
<div class="ie-fix">
<p class="reader-pic-item" style="width: 893px;height: 1194px;z-index: 0;left: 26.003963349999974px;top: 42.21320000000001px;opacity: 1;-webkit-transform: scale(1.058239559574468, 1.0583);;position: absolute;overflow: hidden;">
<img width="893" height="1199" style="position: absolute;top: -0px;left: -0px;clip: rect(0px, 893px, 1199px, 0px);" src="
https://wkbjcloudbos.bdimg.com/v1/docconvert6751/wk/8aa8433afd853765cb3d2880eb888ca0/0.png?responseContentType=image%2Fpng&responseCacheControl=max-age%3D3888000&responseExpires=Wed%2C%2003%20Mar%202021%2020%3A39%3A00%20%2B0800&authorization=bce-auth-v1%2Ffa1126e91489401fa7cc85045ce7179e%2F2021-01-17T12%3A39%3A00Z%2F3600%2Fhost%2Fa88691505b9b4d7885f744ce9029e740796e6a117c4b8de14c45b84c555baa10&x-bce-range=21754-43507&token=eyJ0eXAiOiJKSVQiLCJ2ZXIiOiIxLjAiLCJhbGciOiJIUzI1NiIsImV4cCI6MTYxMDg5MDc0MCwidXJpIjp0cnVlLCJwYXJhbXMiOlsicmVzcG9uc2VDb250ZW50VHlwZSIsInJlc3BvbnNlQ2FjaGVDb250cm9sIiwicmVzcG9uc2VFeHBpcmVzIiwieC1iY2UtcmFuZ2UiXX0%3D.dmoXmRxIEhGxeSVZ4w5dFc1eD8x0T35qwtIJg2law%2BY%3D.1610890740">
</p>