Unicode Normalization Forms: When ΓΆ β‰  ΓΆ blog.opencore.ch/posts/un…
πŸ‘︎ 2
πŸ’¬︎
πŸ‘€︎ u/qznc_bot2
πŸ“…︎ Dec 31 2021
🚨︎ report
Unicode Normalization for NLP in Python youtube.com/watch?v=9Od9-…
πŸ‘︎ 8
πŸ’¬︎
πŸ‘€︎ u/jamescalam
πŸ“…︎ Mar 18 2021
🚨︎ report
Why are there 4 modes of unicode normalization? (β€˜NFC’, β€˜NFKC’, β€˜NFD’, and β€˜NFKD’)

Reading python doc I understand its purpose is to make sure characters which look identical actually are identical. But why are there 4 ways of normalization? When should I use each one?

πŸ‘︎ 3
πŸ’¬︎
πŸ‘€︎ u/netok
πŸ“…︎ May 13 2019
🚨︎ report
utf8rewind 1.2.0 - Adds support for Unicode 7.0 case mapping and normalization bitbucket.org/knight666/u…
πŸ‘︎ 24
πŸ’¬︎
πŸ‘€︎ u/knight666
πŸ“…︎ Mar 23 2015
🚨︎ report
This week in Perl 6: Unicode normalization, many RTs 6guts.wordpress.com/2015/…
πŸ‘︎ 7
πŸ’¬︎
πŸ‘€︎ u/szabgab
πŸ“…︎ Apr 13 2015
🚨︎ report
Unicode normalization

I'm trying to write a "console" in my game, which would accept any key as input and then match the entered string against a list of accepted commands. However, I'm not entirely sure how Gamemaker handle input on keyboards that are not American QWERTY-style layouts. I know that there are many countries which naturally type letters that have accents or umlauts, etc. and usually there exists some library in a programming language that allows you to normalize those into a standard representation, such as a with accent becoming an a. I'm assuming that if any key is allowed, Gamemaker will put that value into keyboard_lastchar instead of an ASCII a? And if it does, I don't see any documentation or functions that would allow me to turn an accented a string into an ASCII a string, so I'm not sure how I would properly match against a theoretical command "a" for all potential keyboard layout.

Can someone give me a basic rundown of how I would go about accepting arbitrary key input from any keyboard layout (e.g.: Swedish or Italian) and then match that input against my own ASCII string commands? I'm using GameMaker: Studio 2.

πŸ‘︎ 3
πŸ’¬︎
πŸ‘€︎ u/freezerburnv
πŸ“…︎ Apr 29 2017
🚨︎ report
6guts: Unicode normalization, many RTs 6guts.wordpress.com/2015/…
πŸ‘︎ 5
πŸ’¬︎
πŸ‘€︎ u/MattEOates
πŸ“…︎ Apr 12 2015
🚨︎ report
Unicode Normalization and Android daniel-codes.blogspot.com…
πŸ‘︎ 9
πŸ’¬︎
πŸ‘€︎ u/rkcr
πŸ“…︎ Apr 10 2012
🚨︎ report
The Unicode Implementor's Guide - Part 2 : Normalization useless-factor.blogspot.c…
πŸ‘︎ 19
πŸ’¬︎
πŸ‘€︎ u/dharmatech
πŸ“…︎ Jul 12 2007
🚨︎ report
String normalization with Unicode::Normalize cattlegrid.info/blog/2009…
πŸ‘︎ 7
πŸ’¬︎
πŸ‘€︎ u/chorny
πŸ“…︎ Oct 15 2009
🚨︎ report
Unicode Normalization Form C diveintomark.org/archives…
πŸ‘︎ 6
πŸ’¬︎
πŸ‘€︎ u/shabda
πŸ“…︎ Mar 17 2007
🚨︎ report
What does this emoji mean and why is it a normal unicode character and not an emoji⚧

It was in my emoji list but like it doesnt show up as a normal emoji for me

Also why does literally everyone have 🏳️⚧ on their bio

πŸ‘︎ 5
πŸ’¬︎
πŸ‘€︎ u/TheOneTEM
πŸ“…︎ Dec 29 2021
🚨︎ report
3dMax|Vray 5 - Error and crash: Invalid geometric normal (0 0 0) for "Non-converted Unicode render instance name"

Hi people!

I am having an error when rendering, Warning: Invalid geometric normal (0 0 0) for "Non-converted Unicode render instance name", after this the program crash.Searching on the internet (Chaos forum) I found that the solution can recreate the proxies, but I don't have any active proxies other than forest pack, and when isolating it renders perfectly

Does anyone have any idea what may be causing this error or where I can start investigating in my model?

A screenshot that i took of the error

Edit: Already solved, was stair's normals

πŸ‘︎ 3
πŸ’¬︎
πŸ‘€︎ u/Dantoj
πŸ“…︎ Dec 30 2021
🚨︎ report
How to replace all other style unicode chars with a normal alphabet in javascript? like replacing this π–‹π–”π–šπ–“π–‰ with found

found

π–‹π–”π–šπ–“π–‰

https://stackoverflow.com/questions/59613915/javascript-function-to-convert-unicode-pseduo-alphabet-to-regular-characters

also, this didn't work

let test='π–‹π–”π–šπ–“π–‰';
alert(test.normalize());
πŸ‘︎ 4
πŸ’¬︎
πŸ‘€︎ u/hwpcspr112
πŸ“…︎ Jun 10 2021
🚨︎ report
Please help me find the name of this Unicode. I used to be able to see it as normal on my phone but after the update, I can only see it as a box. Thank you in advanced.
πŸ‘︎ 17
πŸ’¬︎
πŸ“…︎ Feb 20 2021
🚨︎ report
β…ˆ ITALIC SMALL VOWEL THAT SHOULD BE IN THE STEM OF THE PAST PARTICIPLE OF WHAT IS DIFFERENT ABOUT THIS LETTER TO THE NORMAL ITALIC VERSION BUT IS NOT BECAUSE THE UNICODE CONSORTIUM IS NOT GOOD ENOUGH AT ENGLISH
πŸ‘︎ 5
πŸ’¬︎
πŸ‘€︎ u/b1t6u
πŸ“…︎ May 05 2021
🚨︎ report
When "ZoΓ«" !== "ZoΓ«". Or why you need to normalize Unicode strings withblue.ink/2019/03/11/w…
πŸ‘︎ 362
πŸ’¬︎
πŸ‘€︎ u/ItalyPaleAle
πŸ“…︎ Mar 12 2019
🚨︎ report
Can ya'll help me? My cities district names are replaced with unicode characters, except for park areas, is this normal? Has anybody else witnessed this? I, just, need, help!
πŸ‘︎ 3
πŸ’¬︎
πŸ‘€︎ u/MKLoadedVR
πŸ“…︎ May 22 2020
🚨︎ report
When "ZoΓ«" !== "ZoΓ«". Or why you need to normalize Unicode strings withblue.ink/2019/03/11/w…
πŸ‘︎ 155
πŸ’¬︎
πŸ‘€︎ u/ItalyPaleAle
πŸ“…︎ Mar 12 2019
🚨︎ report
Why you need to normalize Unicode strings withblue.ink/2019/03/11/w…
πŸ‘︎ 36
πŸ’¬︎
πŸ‘€︎ u/cryptoz
πŸ“…︎ Mar 14 2019
🚨︎ report
yt-dlp release 2021.10.09

Release: https://github.com/yt-dlp/yt-dlp/releases/latest

Github: https://github.com/yt-dlp/yt-dlp
Documentation: github, readthedocs
PyPi: https://pypi.org/project/yt-dlp
Discord: https://discord.gg/H5MNcFW63r
Donate: Collaborators.md


Changelog:

  • Improved progress reporting
    • Separate --console-title and --no-progress
    • Add option --progress to show progress-bar even in quiet mode
    • Fix and refactor minicurses and use it for all progress reporting
    • Standardize use of terminal sequences and enable color support for windows 10
    • Add option --progress-template to customize progress-bar and console-title
    • Add postprocessor hooks and progress reporting
  • [postprocessor] Add plugin support with option --use-postprocessor
  • [extractor] Extract storyboards from SMIL manifests by fstirlitz
  • [outtmpl] Alternate form of format type l for \n delimited list
  • [outtmpl] Format type U for unicode normalization
  • [outtmpl] Allow empty output template to skip a type of file
  • Merge webm formats into mkv if thumbnails are to be embedded
  • [adobepass] Add RCN as MSO by jfogelman
  • [ciscowebex] Add extractor by damianoamatruda
  • [Gettr] Add extractor by i6t
  • [GoPro] Add extractor by i6t
  • [N1] Add extractor by u-spec-png
  • [Theta] Add video extractor by alerikaisattera
  • [Veo] Add extractor by i6t
  • [Vupload] Add extractor by u-spec-png
  • [bbc] Extract better quality videos by ajj8
  • [Bilibili] Add subtitle converter by u-spec-png
  • [CBC] Cleanup tests by makeworld-the-better-one
  • [Douyin] Rewrite extractor by MinePlayersPE
  • [Funimation] Fix for /v/ urls by pukkandan, Jules-A
  • [Funimation] Sort formats according to the relevant extractor-args
  • [Hidive] Fix duplicate and incorrect formats
  • [HotStarSeries] Fix cookies by Ashish0804
  • [LinkedI
... keep reading on reddit ➑

πŸ‘︎ 126
πŸ’¬︎
πŸ‘€︎ u/pukkandan
πŸ“…︎ Oct 09 2021
🚨︎ report
Sending Unicode via Tap Dance with Unicode available to normal keys Γ  la UC(c)

Alright guys so here's my situation.

I have an integer array of UCs which works for the keycodes (I guess - have yet to flash it) but does not work for sending keycodes within (tap dance) functions.

***keyboardspace&&userspace/rules.mk

UNICODE_ENABLE = yes

---



***keyboardspace/keymap.c

const uint16_t PROGMEM keymaps[][MATRIX_ROWS][MATRIX_COLS] = {

[_LAYER] = LAYOUT_ortho_3x3_wrapper (
    sthsthsth, sthsthsth, sthsthsth, \
    sthsthsth, TD_TMK   , UC_TMK   , \
    sthsthsth, sthsthsth, sthsthsth  \
)
}

---



***userspace/rumlyne.c

#include "quantum.h"
#include "process_keycode/process_tap_dance.h"
#include "rumlyne.h"

---



***userspace/rumlyne.h

#define TD_TMK TD(TMK)
#define UC_TMK UC(TMK)

enum /*tap dance*/ {
    TMK = 0
};

enum unicode_name {
    UST_TMK = 0 // β„’
};

const uint32_t PROGMEM unicode_map[] = {
    [UST_TMK] = 0x2122 // β„’
};

enum userspace_layers {
    _LAYER = 0
};



// setting UC-Mode to Windows on startup
void matrix_init_user(void) {
    set_unicode_input_mode(UC_WIN);
}

void TD_VOID_TRADEMARK (qk_tap_dance_state_t *state, void *user_data) {
  switch(state->count){
    case 1:
      tap_code(KC_A)
      break;
    case 2:
      send_unicode_hex_string(UST_TMK); // ?
      break;
    case 3:
      SEND_STRING(UST_TMK);
      break;
    case 4:
      send_unicode_hex_string("2122") // works (or at least does not output an error) but I don't want that. 
	  // I want to be able to have my unicodes in another headerfile in the future
      break;
    default:
      break;
  }
  reset_tap_dance (state); // Do I need that? Some have it some don't...
}

qk_tap_dance_action_t tap_dance_actions[] = {
    [TMK] = ACTION_TAP_DANCE_FN(TD_VOID_TRADEMARK)
};

---

Here are the compiler errors:

Making idobo with keymap rumlyne

avr-gcc.exe (AVR_8_bit_GNU_Toolchain_3.6.1_1752) 5.4.0
Copyright (C) 2015 Free Software Foundation, Inc.
This is free software; see the source for copying conditions.  There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

Compiling: users/rumlyne/rumlyne.c                                 
... keep reading on reddit ➑

πŸ‘︎ 3
πŸ’¬︎
πŸ‘€︎ u/rumlyne
πŸ“…︎ Jan 25 2019
🚨︎ report
When "ZoΓ«" !== "ZoΓ«". Or why you need to normalize Unicode strings withblue.ink/2019/03/11/w…
πŸ‘︎ 12
πŸ’¬︎
πŸ‘€︎ u/dobkin-1970
πŸ“…︎ Mar 27 2019
🚨︎ report
The most obscure Unicode bug ever: Old Reddit normalizes Ogham spaces (which have a line) into regular spaces when editing comments reddit.com/r/programming/…
πŸ‘︎ 12
πŸ’¬︎
πŸ‘€︎ u/CoastersPaul
πŸ“…︎ Oct 22 2018
🚨︎ report
SERIOUS: This subreddit needs to understand what a "dad joke" really means.

I don't want to step on anybody's toes here, but the amount of non-dad jokes here in this subreddit really annoys me. First of all, dad jokes CAN be NSFW, it clearly says so in the sub rules. Secondly, it doesn't automatically make it a dad joke if it's from a conversation between you and your child. Most importantly, the jokes that your CHILDREN tell YOU are not dad jokes. The point of a dad joke is that it's so cheesy only a dad who's trying to be funny would make such a joke. That's it. They are stupid plays on words, lame puns and so on. There has to be a clever pun or wordplay for it to be considered a dad joke.

Again, to all the fellow dads, I apologise if I'm sounding too harsh. But I just needed to get it off my chest.

πŸ‘︎ 17k
πŸ’¬︎
πŸ‘€︎ u/anywhereiroa
πŸ“…︎ Jan 15 2022
🚨︎ report
why does pkg.go.dev hate me?

Good morning redditing gophers (or evening, or ....). I'm new to Go but I've implemented a Unicode utilities module that appears to outperform packages like "unicode" and x/text "norm", especially my normalization package. And I have ideas that I think will improve its performance further.

The problem is that pkg.go.dev appears to have buried my modules to oblivion and I don't know why. I know they changed the search the other day, but, still, searching "unicode normalization" would bring up my unorm package waaaaaaaaay down near the bottom of the secondary list, but now I cannot find any search terms that bring it up at all. I don't expect it to appear in the first results, but I would hope that it would show up somewhere while scrolling through the secondary results. I cannot find any search terms that find it anywhere in the list.

Is there something I am doing that makes pkg.go.dev hate it so much?

For example, my normalization package is here: https://pkg.go.dev/github.com/orkvozku/go/uni@v0.0.7/unorm

At first I thought that, at least for "unicode", it may be due to putting a circle-R trademark on it. I am following the instructions at unicode.org to property trademark attribute it, even though nobody else seems to do it. I tried putting a zero-width space to help pkg.go.dev find the key word, but not luck. I then changed it to a normal space and still no luck.

p.s. I'm new to Go so if you snoop at my code, please be nice about the various stupid things you will find. Thanks. :)

πŸ‘︎ 3
πŸ’¬︎
πŸ‘€︎ u/endim
πŸ“…︎ Nov 10 2021
🚨︎ report
Normal font for unicode.

It's been 5 years since I play Minecraft, and even after that much time unicode fonts are not put in order. I already made a fix for the Russian language, but every time I update the fix annoying. I can provide a fix with a full description of the changes, I hope my request will be ignored and will finally fix unicode fonts. For those who do not understand: all unicode fonts in Minecraft have a thin base, unlike English, etc. languages.

πŸ‘︎ 18
πŸ’¬︎
πŸ‘€︎ u/YouTy6bl4
πŸ“…︎ Apr 20 2017
🚨︎ report
icu vs rust_icu

Both of these appear to be in active development, although rust_icu says that it means largely to be a stopgap until there's full native support for ICU which icu appears to be (it's developed under the aegis of unicode.org). Has anybody dug deep into this and determined what's missing from icu? Right now, my primary needs are for normalization and Unicode character categories, although I will eventually need case folding and collation.

πŸ‘︎ 14
πŸ’¬︎
πŸ‘€︎ u/dahosek
πŸ“…︎ Oct 10 2021
🚨︎ report
Need some help with Encoding!

I am trying to use PowerShell to normalize a CSV file in UTF8 format. The CSV has some miscoded info that I would like to validate and fix as part of the normalization.

Example of CSV data

------------------------------------------------

date,name

10/2/2021, ζ ͺεΌδΌšη€Ύγ‚Ήγ‚Ώγ‚Έγ‚ͺγ‚Έγƒ–γƒͺ

11/4/2012,John Smith

3/14/1995, æ ΒͺΓ₯ΒΌΒΓ€ΒΌΕ‘Γ§Β€ΒΎΓ£β€šΒΉΓ£β€šΒΏΓ£β€šΒΈΓ£β€šΒͺΓ£β€šΒΈΓ£Ζ’β€“Γ£Ζ’Βͺ

------------------------------------------------------

In the above example, I know that the name in the last row has the wrong encoding. The correct encoding would be the same as the first row, i.e. ζ ͺεΌδΌšη€Ύγ‚Ήγ‚Ώγ‚Έγ‚ͺγ‚Έγƒ–γƒͺ

Does anyone have a clue how to accomplish this in PowerShell? or how to use PowerShell to run an external Unicode 'scrubber?

Thanks!

πŸ‘︎ 2
πŸ’¬︎
πŸ‘€︎ u/ubercodex
πŸ“…︎ Nov 08 2021
🚨︎ report
Blind Girl Here. Give Me Your Best Blind Jokes!

Do your worst!

πŸ‘︎ 5k
πŸ’¬︎
πŸ‘€︎ u/Leckzsluthor
πŸ“…︎ Jan 02 2022
🚨︎ report
A visual studio code (vscode) extension for convert normal characters to unicode suberscript/subscript characters, if possible. github.com/kindlychung/su…
πŸ‘︎ 7
πŸ’¬︎
πŸ‘€︎ u/Kindlychung
πŸ“…︎ Feb 15 2018
🚨︎ report
This subreddit is 10 years old now.

I'm surprised it hasn't decade.

πŸ‘︎ 14k
πŸ’¬︎
πŸ‘€︎ u/frexyincdude
πŸ“…︎ Jan 14 2022
🚨︎ report
Dropped my best ever dad joke & no one was around to hear it

For context I'm a Refuse Driver (Garbage man) & today I was on food waste. After I'd tipped I was checking the wagon for any defects when I spotted a lone pea balanced on the lifts.

I said "hey look, an escaPEA"

No one near me but it didn't half make me laugh for a good hour or so!

Edit: I can't believe how much this has blown up. Thank you everyone I've had a blast reading through the replies πŸ˜‚

πŸ‘︎ 20k
πŸ’¬︎
πŸ‘€︎ u/Vegetable-Acadia
πŸ“…︎ Jan 11 2022
🚨︎ report
What starts with a W and ends with a T

It really does, I swear!

πŸ‘︎ 6k
πŸ’¬︎
πŸ‘€︎ u/PsychedeIic_Sheep
πŸ“…︎ Jan 13 2022
🚨︎ report
Why did Karen press Ctrl+Shift+Delete?

Because she wanted to see the task manager.

πŸ‘︎ 11k
πŸ’¬︎
πŸ‘€︎ u/Eoussama
πŸ“…︎ Jan 17 2022
🚨︎ report
pdfTeX, XeTeX, ConTeXt, LuaTeX, etc…: Which LaTeX distribution to use for 2022?

Background

I was a decently heavy amateur user of XeLaTeX between 2009–2014 (more or less), first typesetting assignments for my math and physics classes and later typesetting fiction with a length between novelette and short story. XeLaTeX was my tool of choice at the time because it allowed me to use MacOS system fonts in my documents and directly input Unicode without \inputspec[utf-8]{fontenc} (typed that from memory, so it may be subtly wrong).

However, I have not been that active in the TeX world in the following seven years or so.

I may have the itch (if not the time) to get back into typesetting fiction again. However, I now know about LuaTeX and ConTeXt. More accurately, LuaTeX hit 1.0 during my absence and ConTeXt LMTX became the new feature development branch of ConTeXt (and is mostly stable as of 2021).

General Questions as we enter 2022

  1. How do pdfTeX, XeTeX, and LuaTeX compare as engines? From my understanding, LuaTeX is the clear winner unless you have specific Unicode issues that require XeTeX. EDIT: LuaTeX being the obvious winner is the consensus answer
  2. How do ConTeXt MKIV, OpTex, LaTeX, and ConTeXt LMTX compare as formats? It seems that ConTeXt v LaTeX is mostly dependent on personal preference and occasionally by the needs of the specific project for projects with unique requirements.
  3. (I assume the answer is "yes" but I felt it needed to be asked anyway) Do the engines all support both ARM and Intel under both macOS and Linux? EDIT: Yes.
  4. Which ones have sensible ways to include LilyPond input? Answer: Both ConTeXt and normal LaTeX, through lilpond-book.

Specific Questions for my use cases

  1. How would ConTeXt compare with LaTeX for typesetting medium-length fiction (the aforementioned short stories & novellas)?
  2. If I were to make one of those stories look like a tribute to Chapters IX or XX of House of Leaves, how do the two formats compare? Namely, how does the minipage environment or its ConTeXt equivalent hold up to flipping, rotation, and other copious abuse?
  3. (For the stories that have straightforward formatting) Would there be any real difference between the two in terms of outputting BBCode for secondary publication? pandoc -t HTML input.tex | 2bbcode_hubzilla HTML > output.bb is my curre
... keep reading on reddit ➑

πŸ‘︎ 28
πŸ’¬︎
πŸ“…︎ Dec 30 2021
🚨︎ report

Please note that this site uses cookies to personalise content and adverts, to provide social media features, and to analyse web traffic. Click here for more information.