I’m currently parsing the sentence β€œimAM vAva te jitiM jeSyanti yeyaM mama” in lines 2-3 of the bottom selection. My difficulty is with β€œyeyaM.” I considered that maybe the phrase broke down into β€œye” and β€œyam” but I don’t think that’s right. Any help?
πŸ‘︎ 6
πŸ’¬︎
πŸ‘€︎ u/tattvamasi108
πŸ“…︎ Apr 16 2019
🚨︎ report
V8 7.6 Release: "In V8 v7.6, we’ve overhauled our JSON parser to be much faster at scanning and parsing JSON. This results in up to 2.7Γ— faster parsing of data served by popular web pages." v8.dev/blog/v8-release-76
πŸ‘︎ 402
πŸ’¬︎
πŸ‘€︎ u/OlanValesco
πŸ“…︎ Jun 29 2019
🚨︎ report
Is there a way to speed up parsing a massive list?

I made a playlist of classical albums that have been in my library for at least 2 years, and have less than 2 plays. This is about 9300 tracks. What I want to do is run a shortcut that presents a list of composers, then after selecting one of those, a list of albums with music by that composer on it. I pick an album from that list and it plays the album from the beginning.

I have it all working; however, it takes about 3.5 minutes to generate the list of composers because it has to cycle through all 2500+ dictionaries to pull the data for the lists. Is there any way to do this faster?

FYI: I'm stupid at dictionaries and just barely above stupid at shortcuts.

Here's a version of the dictionary parsing part: https://www.icloud.com/shortcuts/4ff2a5d34dff4b31a149e5892d25e715. I put the JSON dictionaries in a text field at the top; but, in my version, I load it from a file on iCloud Drive. Beware: if you run it, it's going to take a while to complete.

πŸ‘︎ 2
πŸ’¬︎
πŸ‘€︎ u/itsbynnereel
πŸ“…︎ Jan 15 2020
🚨︎ report
[Kushner] Sources: The Pelicans process to hire a new general manager is expected to ramp up next week. A search firm has been parsing out candidates and is expected to provide ownership a list of 5-10 names next week. From there, they can conduct interviews, pick finalists. twitter.com/ScottDKushner…
πŸ‘︎ 82
πŸ’¬︎
πŸ‘€︎ u/deadskin
πŸ“…︎ Mar 13 2019
🚨︎ report
Speeding up XML parsing

I am trying to parse through some XML files that are in the size of up to 10gb files.

I have tried parsing using XML ElementTrees parseIter as well as lxml iterative parsing as loading these directly into memory doesn't feel like the best way to go about it.

The end goal is to get a bunch of this data into some SQLite database (because querying an SQL database will probably be much faster and easier, and for a local solution that doesnt need many of the features that other SQL solutions like PostgreSQL or even using a NoSQL solution)

The problem is... Just parsing through the XML file takes forever, I'm testing it on a smaller set of data but it's still over 4 hours and not finished

Is there some way to improve the speed of this process? Also, when I get my desired data, should I execute inserts one at a time or build a query string to insert every 1000 lines or something? What would be most optimal?

I'm not sure if it's possible to speed things up by trying to multi-thread the operation, I've got limited experience for this.

Any ideas or suggestions are welcome, even if it's telling me that this idea is wrong! Thank you!

πŸ‘︎ 2
πŸ’¬︎
πŸ‘€︎ u/penfold1992
πŸ“…︎ Jan 13 2020
🚨︎ report
Investing with Python code and write up (parsing XML and RSS feeds, emailing and texting in Python and price monitoring using requests and Yahoo Finance)

I thought I'd post my write up and code for using Python to drive an investment strategy based on monitoring and parsing an SEC (Security Exchange Commission) database in case it helps any beginning algorithmic investors out there! The code involves parsing both XML and RSS feeds, automatic emailing and texting, and portfolio monitoring.

I would love to hear any thoughts you have on the code itself (its gotta be terribly inefficient), the strategy, or any tips for improvement! This is my fourth major project, so I'm still quite the beginner.


Link to the write up:

http://hofdata.com/2016/03/18/investing-with-python/

Link to the complete code:

https://github.com/ericlighthofmann/EdgarScrape/blob/master/edgar_email.py

πŸ‘︎ 48
πŸ’¬︎
πŸ‘€︎ u/HomerG
πŸ“…︎ Mar 21 2016
🚨︎ report
I was testing some of my parsing functions and came up with some neat-looking text versions of the maps.

No real backstory, I just needed to see where the walls were for my python client and noticed that it looked pretty neato-burrito. Here's a few of the maps for your viewing pleasure.

Edit: If you're looking at this on your phone, it probably won't look right.

Boombox

                  #############################                  
                ##                             ##                
                #                               #                
                #                               #                
                #                               #                
  ###############                               ###############  
 #             ##                               ##             # 
#               #################################               #
#                   |||||||     #     |||||||                   #
#      P            |||||||           |||||||            P      #
#                _  |||||||           |||||||  _                #
#                   #######           #######                   #
#                               X                               #
#                               X                               #
#                               X                               #
#       _           ########         ########           _       #
#                  #                         #                  #
#|||||| ###                                           ### ||||||#
#|||||| ###                                           ### ||||||#
#                          X         X                          #
# # +                                                       + # #
#  #             +            # ? #            +             #  #
#   #              #                         #              #   #
#?             ####                           ####             ?#
 ######                                                   ###### 
       #                                                 #       
        #                       X                       #        
         ###############################################       

##Geokoala

         #####                           #####         
  ########xxx##                         ##xxx########  
 ##O         x#                         #x         O## 
##         _ x#                         #x _         ##
... keep reading on reddit ➑

πŸ‘︎ 24
πŸ’¬︎
πŸ‘€︎ u/AMorpork
πŸ“…︎ Apr 09 2014
🚨︎ report
I bought a new deodorant yesterday. The instructions said "Remove cap and push up bottom"

It hurt like hell, but my farts smell great.

πŸ‘︎ 35
πŸ’¬︎
πŸ‘€︎ u/HellsJuggernaut
πŸ“…︎ Jan 25 2020
🚨︎ report
A President So Unhinged There Is No Bottom; A Tweet So Bizarre It Cannot Be Parsed. twitter.com/realDonaldTru…
πŸ‘︎ 62
πŸ’¬︎
πŸ‘€︎ u/FowelBallz
πŸ“…︎ Nov 25 2018
🚨︎ report
TIL: If you gently press the bottom right corner of your eyeball through the eyelid, you see a little dark spot in the top left of your vision. This is because we see the world upside down, but our brain translate it so it looks right side up to us. mentalfloss.com/article/9…
πŸ‘︎ 1k
πŸ’¬︎
πŸ‘€︎ u/jellyjack
πŸ“…︎ Apr 21 2020
🚨︎ report
New moderator - welcome aboard u/parsing_trees

We are extremely happy to welcome /u/parsing_trees to our mod team. PT has been an amazing contributor to this community, sharing his time and his knowledge with us and we very excited to have him working with us to better this sub and bring some cool new features out.

Welcome aboard PT :)

πŸ‘︎ 106
πŸ’¬︎
πŸ‘€︎ u/Henry_Haberdasher
πŸ“…︎ Apr 09 2020
🚨︎ report
What do we use for PDF parsing these days?

I have around 1000 PDFs of roughly 1000 pages each, and would like to do text analysis. I therefore need to extract the text into a string, which I can then split in senteces/words etc.

I have tried PyPDF2, and while everything is working as intended, the result is not satisfactory as words are often extracted with no space between them, and the text generally isn't very "clean".

EDIT: I really love reddit, thank you so much for all the answers, read every one of them! I made tika work in a new conda environment, and this has produced results that are just good enough for the task at hand - I might return to this later as I am sure other methods of parsing could give even better results. Thank you!!

Anyone have experience with effective ways of doing text extraction from PDFs?

πŸ‘︎ 195
πŸ’¬︎
πŸ‘€︎ u/Doitforafrica
πŸ“…︎ Mar 08 2020
🚨︎ report
It's not super obvious but I've lost 21kgs with 16:8 (into the 4th month now). Fallen off the wagon plenty of times but hey - once you've hit bottom, the only way is up right? Not even close to my goal weight but it's progress.
πŸ‘︎ 7k
πŸ’¬︎
πŸ‘€︎ u/kimikokash
πŸ“…︎ Nov 03 2019
🚨︎ report
Tape the bottom of your recliner with packing tape to keep out small pets from hiding under and getting injured. The sticky side will face up, and they won’t like the stickiness on their paws, and avoid the area.
πŸ‘︎ 4k
πŸ’¬︎
πŸ‘€︎ u/fog-mann
πŸ“…︎ Jan 22 2020
🚨︎ report
Any chance ampersand parsing in tweet previews can be fixed? (β€œ&” > β€œ&”)
πŸ‘︎ 475
πŸ’¬︎
πŸ‘€︎ u/MyCatEdwin
πŸ“…︎ Dec 17 2019
🚨︎ report
In celebration of Ruby Weapon's trial being called Cinder Drift, here's "Parsing in the 90s"
πŸ‘︎ 521
πŸ’¬︎
πŸ‘€︎ u/FluffyViera
πŸ“…︎ Feb 11 2020
🚨︎ report
LEGO Reverse Trike 2.0. I overhauled the back and bottom and I’m much happier with it. Minifig is a WIP. May design some decals. Will post instructions and parts list once I figure out who on Stud.io. Thoughts on the clear rims? The are empty in game but light up when you drive. Critics welcome.
πŸ‘︎ 2k
πŸ’¬︎
πŸ‘€︎ u/ProperTurnip
πŸ“…︎ Mar 04 2020
🚨︎ report
PSA from SideloadVR - We have upgraded up our server for the holidays, but when you get a "parsing error", it is because the internet connection was unstable

Happy Holidays from http://www.SideloadVR.com!

We have upgraded our Amazon EC2 instance to handle the increased load. This should reduce the build time for an .apk. However, the downloader in the Android app is not that sophisticated and when the internet connection is unstable (which most are without people even noticing), the download gets corrupted and a "Parsing Error" will be thrown.

Two ways to fix that:

  • try again. The moment the build phase is done, it's really just hoping you get the file in time before a package drops.

  • if you have access to the original apk (i.e. StreamTheater), then download and use the standalone injector (also on SideloadVR.com)

Our EC2 instance is located in West of the USA and you can try pinging 54.201.16.115 to see if you get dropped packages.


Also, if you have problems setting up your signature (either automatic or manual), try the manual setup using the mobile Chrome browser. We are suspecting that the Samsung Internet Browser is truncating uploaded signature files. If all doesn't help, setup the signature manually on your computer and upload it here: http://54.201.16.115/apkbaker/uploadID.html


SideloadVR is a free project solely funded by donations. We wish we could spend more time on development to improve the Android app, etc, but currently we are in the crunch to finish our games for Rift and Vive launch. Until we have some time or money to hire additional programmers, we hope you patiently retry the download if you get any parsing errors :).

Cheers,

Mark from http://www.SideloadVR.com

πŸ‘︎ 3
πŸ’¬︎
πŸ‘€︎ u/skyworxx
πŸ“…︎ Dec 27 2015
🚨︎ report
I made an Up-N-Atomizer Prop ! ( top is game, bottom is mine )
πŸ‘︎ 9k
πŸ’¬︎
πŸ‘€︎ u/TheApropalypse
πŸ“…︎ Sep 03 2019
🚨︎ report
Customs is a great map but for beginners and early quests, it's terrible. If the devs would open up the bottom area (red) and add a few more holes in walls (yellow), it would be a much better area imo
πŸ‘︎ 2k
πŸ’¬︎
πŸ‘€︎ u/Kanista17
πŸ“…︎ Nov 12 2019
🚨︎ report
GitHub - lemire/simdjson: Parsing gigabytes of JSON per second github.com/lemire/simdjso…
πŸ‘︎ 2k
πŸ’¬︎
πŸ‘€︎ u/dgryski
πŸ“…︎ Feb 21 2019
🚨︎ report
STL header token parsing benchmarks for VS2017 and VS2019

One of the MSVC devs poked me to update the results from https://www.reddit.com/r/cpp/comments/860hya/stl_header_token_parsing_benchmarks_for_vs2008/ for VS2019:

Graph of VS2019: https://raw.githubusercontent.com/ned14/stl-header-heft/master/graphs/msvs-2019.png

Comparative graph: https://raw.githubusercontent.com/ned14/stl-header-heft/master/graphs/msvs-history.png

Detailed notes: https://github.com/ned14/stl-header-heft/blob/master/Readme.msvs.md

Project github: https://github.com/ned14/stl-header-heft

There is a lot of good news in this benchmark: overall VS2019 has 9% lower token processing times than VS2017 did. That makes VS2019 almost as quick as VS2008 used to be!

The single biggest surprise is surely now <array>, which is by far the biggest impact STL container now that Microsoft have greatly improved <string> and especially <vector> and <forward_list>.

I'll try to produce updated benchmarks for libstdc++ during this coming week.

I'd like to take this opportunity to thank the Visual C++ team for such outstanding work on slimming down their STL implementation, yet simultaneously implementing a large proportion of C++ 20! Great work guys, wish more STLs were like yours!

πŸ‘︎ 78
πŸ’¬︎
πŸ‘€︎ u/14ned
πŸ“…︎ Jan 27 2020
🚨︎ report
My sprouting shelves. Top: marigold, purple passion, omnitool, parsely/French parsley/chives/lavender/basil. Bottom: sage, peppermint, red maple, pomegranate (fingers crossed on that one!) m.imgur.com/apeFFNo
πŸ‘︎ 3
πŸ’¬︎
πŸ‘€︎ u/Lirkmor
πŸ“…︎ Jan 31 2019
🚨︎ report
El-Erian: US stock market could end up dropping 20%-30% before bottom is finally reached cnbc.com/2020/03/09/moham…
πŸ‘︎ 447
πŸ’¬︎
πŸ‘€︎ u/readerseven
πŸ“…︎ Mar 09 2020
🚨︎ report
Can we just appreciate once again how wonderful this game is! From top to the bottom it's beautiful, fun and super detailed, while swinging is second to none. I can travel the city to collect backpacks and fucking up thugs all day long. This game has the true spirit of playstation - presence of soul
πŸ‘︎ 2k
πŸ’¬︎
πŸ‘€︎ u/Ron1ncat
πŸ“…︎ Jan 04 2020
🚨︎ report
Street art in my town. The artist spent over 200 hours on it and so many people kept drawing on it he got fed up and abandoned it (see the big gear at the bottom).
πŸ‘︎ 829
πŸ’¬︎
πŸ‘€︎ u/wilmon_
πŸ“…︎ Feb 21 2020
🚨︎ report
[WP] You've never noticed the (+) in the bottom right of your vision before. Today on your 23rd birthday however, you focus on it intently, as a skill tree pops up before your eyes with 23 points to allocate.
πŸ‘︎ 8k
πŸ’¬︎
πŸ‘€︎ u/waldo667
πŸ“…︎ Aug 21 2019
🚨︎ report
As a bottom, I hate cleaning myself, going through the process, and having my time being wasting by these stupid tops. They hit you up, you giving the green light, and they leave you hanging. Sorry.. I'm just frustrated. Ugh
πŸ‘︎ 1k
πŸ’¬︎
πŸ‘€︎ u/BlackUniverse93
πŸ“…︎ Oct 14 2019
🚨︎ report
[Myles Turner] From 1-12 top to bottom this team this roster has sacrificed so much for our nation. our summers, our bodies, our mental. We came up short can’t nobodies more upset than us but I refuse to tolerate any Slander for our play you cannot question our heart

Full Statement

From 1-12 top to bottom this team this roster has sacrificed so much for our nation. our summers, our bodies, our mental. We came up short can’t nobodies more upset than us but I refuse to tolerate any Slander for our play you cannot question our heart

Our character, or are spirit we layed it all out on the line each and every game. Don’t disrespect us this coaching staff or USA Basketball as a whole but respect the world basketball is an international game these countries are talented

JB, Khris, JT, Dmitch, Mason, Marcus, Kemba, D White, BrooK, HB, Joe it’s been an honor I’d go to war with any of these fellas any given night. #Godspeed fellas let’s head home with a Dub πŸ‡ΊπŸ‡Έ

We’re also the ones who stepped up to the plate when others stepped down. We qualified our nation for the Olympics we got some work to do to rebuild a legacy that was left before us were on the wrong side of history indeed. But that’s gunna make the next medal that much sweeter!

https://twitter.com/Original_Turner/status/1172142615514492930?s=19

πŸ‘︎ 2k
πŸ’¬︎
πŸ‘€︎ u/urfaselol
πŸ“…︎ Sep 12 2019
🚨︎ report
On this day, 1 year ago, we hit the absolute bottom of $3200. Since then, we are up more than 115%. Dominance went from 55% to more than 66%. Who said we are in a new bear market?
πŸ‘︎ 648
πŸ’¬︎
πŸ‘€︎ u/doctor-crypto
πŸ“…︎ Dec 15 2019
🚨︎ report
'From the bottom up, the game dies from smurfs and boosters being allowed. That's what killed HoN' - Gorgc clips.twitch.tv/SillyCorr…
πŸ‘︎ 1k
πŸ’¬︎
πŸ‘€︎ u/inzru
πŸ“…︎ Sep 14 2019
🚨︎ report
Just getting started! Nothing is really high end quality but I figured I'd start from the bottom and work my way up!
πŸ‘︎ 759
πŸ’¬︎
πŸ“…︎ Mar 06 2020
🚨︎ report
simdjson-go: Parsing gigabyes of JSON per second in Go blog.min.io/simdjson-go-p…
πŸ‘︎ 208
πŸ’¬︎
πŸ‘€︎ u/klauspost
πŸ“…︎ Feb 11 2020
🚨︎ report
[Question] How to fix? iPad 9.3.5 jailbreak not parsing any repos
πŸ‘︎ 10
πŸ’¬︎
πŸ‘€︎ u/itsyeboicaleb
πŸ“…︎ Apr 07 2020
🚨︎ report
As transcripts are available for every bon appetit video, I couldnt help parsing It's Alive.

A few notes before we get started:

  1. This is 68 episodes of It's Alive. I could not find Going Places on the website.

  2. I may have made errors trying to copy and paste everything into 1 test document.

  3. Transcripts were definitely done by a different person on different episodes. One will use Vinny (said 498 times), and others use Vinnie (128 times).

Lets first go through the Brad Leone Bingo card:
a) Oh no - 39
b) Vincenzo - 19
c) I believe - 25
d) garlic - 193 / allicin - 17
e) Fermentation station - 15
f) Vessel - 24
g) Bad boy - 51
h) Cut that - 10
i) "water" - 256 / wourder - 4...
j) Who's better than - 11
k) Beautiful thing - 15 (Beautiful - 153)
l) It's been - 39
m) Big boy - 45
n) End of the day - 23
o) Upsize down - 4
p) Fact check - 7
q) I wish you guys could smell - 5
r) Good boy - 31

The ones I wanted to check:
Sumac - 10 (lower than I thought)
One shot one Kill - 11
Delaney - 11
Botulism - 10
Goobalini - 5
edit: landing zone - 5

If you have any other suggestions, here is the text document to..... enjoy: https://drive.google.com/open?id=1qRs861ybJM1AOPwgQ5lA8bDbU4NddHwb

πŸ‘︎ 57
πŸ’¬︎
πŸ‘€︎ u/Rossaaa
πŸ“…︎ Apr 24 2020
🚨︎ report
So a friend of mine drew these during 2 separate times, with each one taking up the entirety of our lunch period, meaning she did each of these in around 50-60 minutes. She even put the episode that Overhaul shows his quirk in at the bottom of his name.
πŸ‘︎ 5k
πŸ’¬︎
πŸ‘€︎ u/EpicgamerTN
πŸ“…︎ Oct 23 2019
🚨︎ report
A turkey flew into my bedroom, and my roommates set up a GoPro. You can see my smashed window frame in the bottom corner. The turkey also pooped on my bed. v.redd.it/nj8ifizz63p41
πŸ‘︎ 294
πŸ’¬︎
πŸ‘€︎ u/mrjohnmay
πŸ“…︎ Mar 26 2020
🚨︎ report
1month old(top). Was told she would ammount to NOTHING! That i should cut my loses(thx reddit). Anyways I didnt, and she ended up being the mother of some Fire that smells like Peach rings outa the jar. SO GLAD I pushed through with this "hopeless" little plant. She produced absolute fire!(bottom)😁
πŸ‘︎ 118
πŸ’¬︎
πŸ‘€︎ u/Webb1003
πŸ“…︎ Apr 14 2020
🚨︎ report
Deployed Christmas Cards. Let's see yours. "Your life has already hit the bottom and there's nowhere else to go but up."
πŸ‘︎ 1k
πŸ’¬︎
πŸ‘€︎ u/joshminer83
πŸ“…︎ Dec 18 2019
🚨︎ report
PcapPlusPlus v19.12 released!! A multi-platform C++ library for capturing, parsing and crafting of network packets github.com/seladb/PcapPlu…
πŸ‘︎ 83
πŸ’¬︎
πŸ‘€︎ u/seladb
πŸ“…︎ Dec 04 2019
🚨︎ report
Got a new deodorant stick today. Instructions said "Remove cap and push up bottom"

I can barely walk, but every time I fart the room smells great!

πŸ‘︎ 73
πŸ’¬︎
πŸ‘€︎ u/dandan_56
πŸ“…︎ Mar 03 2018
🚨︎ report
Investing with Python code and write up (parsing XML and RSS feeds, emailing and texting in Python and price monitoring using requests and Yahoo Finance)

I thought I'd post my write up and code for using Python to drive an investment strategy based on monitoring and parsing an SEC (Security Exchange Commission) database in case it helps any beginners/intermediates looking to do a similar project! The code involves parsing both XML and RSS feeds, automatic emailing and texting, and portfolio monitoring.

I would love to hear any thoughts you have on the code itself (its gotta be terribly inefficient), the strategy, or any tips for improvement! This is my fourth major project, so I'm still quite the beginner.


Link to the write up:

http://hofdata.com/2016/03/18/investing-with-python/

Link to the complete code: https://github.com/ericlighthofmann/EdgarScrape/blob/master/edgar_email.py

πŸ‘︎ 34
πŸ’¬︎
πŸ‘€︎ u/HomerG
πŸ“…︎ Mar 19 2016
🚨︎ report

Please note that this site uses cookies to personalise content and adverts, to provide social media features, and to analyse web traffic. Click here for more information.