umami_wasbi

joined 1 year ago
[–] umami_wasbi@lemmy.ml 5 points 2 days ago* (last edited 2 days ago) (1 children)

I don't a single guide for you but I can layout a road map.

  1. A programming language. I prefer Python.
  2. Basic HTML syntax and CSS selectors
  3. HTTP, specifically methods, status code (no need to memorize all cuz you can go look it up), and cookies

After you got those foundation ready, you can go on and try to build a webscraper. I advice aginst using Scrapy. Not because it is bad but too overwhelming and abstracted for any beginner. I will instead advice you use requests for HTTP, and BeautifulSoup4 for HTML parsing. You will build a more solid foundation and transition to scrapy later when you need those advanced function.

When you get stuck, don't afraid to pause on your attempt and read tutorials again. Head to the Python Community on Discord to get interactive help. We welcome noobs as we once were noobs too. Just don't ever mention scraping there as they can't help if they suspect you're trying to do something inappropriate, malicious, or illegal. They are notoriously aginst yt-dlp which frustrates me a bit. Phrase it nicely and in an generic way. I will be there occasionally offering help.

[–] umami_wasbi@lemmy.ml 9 points 2 days ago (3 children)

There is no simplification that you're looking for. It seems you don't have a programing background. If you really need to scrape something, you need to learn a programing language, HTTP, HTML, and maybe javascript. AFAIK, there is no easy way or point and click scrapper building tool. You will need to invest time and learn. Don't worry, you should be able to get it done in 2-3 months if you do invest your time in.

[–] umami_wasbi@lemmy.ml 2 points 2 days ago

It is a ok tool to get things started.

[–] umami_wasbi@lemmy.ml 1 points 1 week ago

Ops. Missed that part.

[–] umami_wasbi@lemmy.ml 2 points 1 week ago* (last edited 1 week ago)

I use BTRFS for snapshots, and auto compression. Maybe it can be done with raids with LVM? AFAIK BTRFS redundancy is basically the same as traditional RAID, similar to using mdadm. Still, you would want a backup strat instead relying on the disk redundancy. I learn that the hardway.

[–] umami_wasbi@lemmy.ml 3 points 1 week ago* (last edited 1 week ago) (11 children)

I would just skip RAID, add all disk to a single BTRFS and use the built in profiles for (meta)data redundancy.

Cache I don't know much tho.

https://btrfs.readthedocs.io/en/latest/btrfs-device.html

[–] umami_wasbi@lemmy.ml 2 points 1 week ago (6 children)

Is this finally the dusk of SO? It helps alot, but also suck alot.

[–] umami_wasbi@lemmy.ml 39 points 1 week ago* (last edited 1 week ago) (2 children)

Yay, more subscriptions.

👌Adobe, I am sticking to my Affinity Photo 1.

[–] umami_wasbi@lemmy.ml 7 points 1 week ago (1 children)

It is their job to find evidences, not my resposibility to provide them.

[–] umami_wasbi@lemmy.ml 3 points 1 week ago

I'm on S21FE and it does NOT.

[–] umami_wasbi@lemmy.ml 3 points 1 week ago* (last edited 1 week ago)

Not that hard to deal with honestly. Rebooting at night which I'm sleeping does not reduces any functionality, cuz I'm not using it. If someone needs to find me during the night he better call me cuz I won't wake up by notification which is also suppressed by DND. Yeah it is not design for security but a solution better than none.

Furthermore, rebooting the device periodically is good for security, especially for non-persistent fileless malware.

[–] umami_wasbi@lemmy.ml 7 points 1 week ago* (last edited 1 week ago) (9 children)

It does, labled "Auto Restart", but only when "preformance issues detected" or time specified. Apple is quite late on this feature.

Screenshot of Android Auto Restart Settings page

 

There are reports in Registar's comment section that Malaysia didn't only redirect DNS traffic, but took active measures to block VPN, and MITM DoH where Cloudflare's DoH returns local ISP certificate.

In fact, some ISPs like Maxis and Yes were already blocking VPN (I see a lot of complains on Lowyat.net about Maxis blocking VPN, and I was using Yes WiMax and experienced the blocking firsthand. I couldn't connect to PPTP endpoints and L2TP endpoints caused the modem to disconnect from the network and reboot).

They were outright trying a MITM redirect attack on those using DOH. Many reported error messages saying that Cloudflare's DOH server were practically returning the certificate for Telekom Malaysia's DNS servers.

Even with many new technologies, I ralized that I not as safe and free as I want to be, maybe you too.

 

If $70 +$10/mo can get me through all those annoying CAPCHAs, I will gladly pay. Of course, if cheaper or even free solutions exists, I will use it. My only requirement is it work 90%+ of the time.

27
submitted 3 months ago* (last edited 3 months ago) by umami_wasbi@lemmy.ml to c/linux@lemmy.ml
 

I want to check if my Lenovo T480 is afftected by the recent PKFail, but have no idea how to extract the bios firmware for validation. Can someone detail the steps? Thanks.

40
submitted 4 months ago* (last edited 4 months ago) by umami_wasbi@lemmy.ml to c/selfhosted@lemmy.world
 

Just wonder what if my mail server went offline for some periods, and the sending party couldn't deliver.

Will there be any consequences except I don't get the mail? I tried searching but they all in the perspective of a sender and get a bounce, rather the other way around.

20
submitted 4 months ago* (last edited 4 months ago) by umami_wasbi@lemmy.ml to c/selfhosted@lemmy.world
 

Saw they have promotion £1/mo without setup when paid for a 12mo contract for the lowest end VPS. Anyone use it before?

Just planning to run frp on it. https://github.com/fatedier/frp

 

Lesson learnt: don't ever buy an used server from Quanta

Also, isn't Epyc have an efuse that will pair it with the mobo?

 

LOL

 

archive.is

Shall we trust LM defining legal definitions, deepfake in this case? It seems the state rep. is unable to proof read the model output as he is "really struggling with the technical aspects of how to define what a deepfake was."

 

If a stamp have a barcode, why not just let people who have printers at home to print it on the envelope directly? This eliminates the need to buy physical stamp, thus the probability of buying counterfeit stamps.

 

I want to host a small game server for friends and myself in my home but doesn't want to open up the firewall. Any tunneling solutions supports UDP? Thnaks.

view more: next ›