this post was submitted on 19 Jun 2023
11 points (100.0% liked)

FREEMEDIAHECKYEAH

22 readers
1 users here now

๐Ÿฟ ๐Ÿ“บ ๐ŸŽต ๐ŸŽฎ ๐Ÿ“— ๐Ÿ“ฑ


๐Ÿดโ€โ˜ ๏ธ Wiki / ๐Ÿ’ฌ Chat


Rules

1. Please be kind and helpful to one another.

2. No racism, sexism, ableism, homophobia, transphobia, spam.

3. Linking to piracy sites is fine, but please keep links directly to pirated content in DMs.

founded 1 year ago
MODERATORS
 

I bought an online course recently. They mention on their websites that a note will be provided. But in the names of note they provide a 2-page summary for download, and the original note PDFs are not downloadable. I tried to download PDF from their sites via IDM but file corrupted. Even I tried to force copy and pasting into other documents but not successful as it pastes some encoded language. Only way I found to click screenshots of each and every pages on pc and paste into the word. It is very time-consuming. Is there another way to download PDF from their sites? Edit: look like the best way to capture it through screenshot.

top 21 comments
sorted by: hot top controversial new old
[โ€“] godless@latte.isnot.coffee 8 points 1 year ago (1 children)

Without telling us the exact site, the only answer you can expect to get is a resounding "maybe".

[โ€“] inosuke@lemmy.fmhy.ml 2 points 1 year ago (1 children)

it is a indian website and all the content are DRM protected.

You're not helping if you don't tell us the exact name of the website. Every DRM implementation works different.

[โ€“] lavafroth@programming.dev 4 points 1 year ago (2 children)

Try reverse engineering the DRM. Most DRMs are stupid simple to break. Look at the JS code or the network tab under devtools. Maybe you can get the key.

If all fails, login through a headless browser session using python + selenium and use some quick and dirty scripting to screenshot each page.

[โ€“] inosuke@lemmy.fmhy.ml 2 points 1 year ago (1 children)

https://drive.google.com/drive/folders/1ussgvci_rGPZLrm4eLTEfkbJRv5Sgh3s?usp=sharing .. take a look into this screenshot.. it looks like all the page are in div container, every text in span container but in distorted format

[โ€“] lavafroth@programming.dev 3 points 1 year ago (1 children)

I can't view this because the screenshot is not public and I don't have a Google account to request access. In general you will want to right click the div, copy its selector (or if you wanna be specific, the XPATH) and use that in selenium to save the text or the image content.

[โ€“] inosuke@lemmy.fmhy.ml 1 points 1 year ago (1 children)
[โ€“] lavafroth@programming.dev 3 points 1 year ago* (last edited 1 year ago) (1 children)

Seems like you can remove the header and the sidebar HTML nodes, set the width of the pdfViewer class to 100% and hit Ctrl+P. Alternatively you may save one "page" at a time using Ctrl+Shift+S as described by @tumulus_scrolls

[โ€“] inosuke@lemmy.fmhy.ml 2 points 1 year ago

yes I think the best and easy way is to take screenshot.. document is secured and text are cipher. I tried to download their videos but they blocked me from their site too..

[โ€“] dekatron@lemmy.fmhy.ml 2 points 1 year ago

I found another site that seems to use the same backend and pdf renderer. I looked at the network tab, and it downloads a .epdf file, but I'm not sure how to open it. If you want to have a go at cracking it, here's a link to it: https://pastebin.com/w6JkNDYH

[โ€“] theskyisfalling@lemmy.fmhy.ml 3 points 1 year ago (2 children)
[โ€“] inosuke@lemmy.fmhy.ml 2 points 1 year ago

pdf open only to their websites in a window.. it is DRM protected content.

[โ€“] inosuke@lemmy.fmhy.ml 2 points 1 year ago* (last edited 1 year ago)

I tried, not working.. It is not a pdf, some how all the pages are in different div container and text inside the page are in the span container but in distorted form.

[โ€“] dekatron@lemmy.fmhy.ml 3 points 1 year ago (1 children)

If you are not able to share the name of the website, could you maybe share a screenshot of how the PDF is displayed on the website? That would a bit more helpful towards solving your problem.

[โ€“] inosuke@lemmy.fmhy.ml 3 points 1 year ago* (last edited 1 year ago) (1 children)

https://temp-file.org/jcwX1twb7yjgByb/preview https://temp-file.org/hIjgiABXlvinKlh/preview https://temp-file.org/TiC5Kp8vpya5YsM/preview https://temp-file.org/Z3df1Yzh47CvtIu/preview it look like this.. it looks like every page in the div container, and every element or text of 1 page in the different span container, and even the span container the word is encoded not showing accurately..

[โ€“] dekatron@lemmy.fmhy.ml 3 points 1 year ago (1 children)

Looks like the link you shared is not set to public. Just a heads-upโ€”Google Drive will show your account name and profile picture for shared files. If you don't want that, you could upload the image directly here in a Lemmy comment, or use something like Pixelfed.

[โ€“] administrator@lemmy.pro 3 points 1 year ago (1 children)

Search internet for the file name maybe?

[โ€“] inosuke@lemmy.fmhy.ml 2 points 1 year ago

their notes are like compilation of some books regarding a particular exam. I can find books online but not their notes.

[โ€“] tumulus_scrolls@lemmy.fmhy.ml 3 points 1 year ago* (last edited 1 year ago) (1 children)

They may not expose the actual PDF to you at all, just some software rendering of it. In that case I'd focus on making screenshotting efficient. Find a program that lets you save the whole screen to file automatically at once (one button press), or use Firefox ctrl+shift+s -> click on the page area -> save -> enter.

[โ€“] inosuke@lemmy.fmhy.ml 1 points 1 year ago* (last edited 1 year ago)

I think the same.. because I tried every way that I know.. On developer menu it shows every page inside a div container, and every element on that page word or text in the span container. even it is not directly written in the span container it is distorted the way words are written.. https://temp-file.org/jcwX1twb7yjgByb/preview https://temp-file.org/hIjgiABXlvinKlh/preview https://temp-file.org/TiC5Kp8vpya5YsM/preview https://temp-file.org/Z3df1Yzh47CvtIu/preview have a look

load more comments
view more: next โ€บ