Skip to main content
Stack Overflow
  1. About
  2. For Teams

You are not logged in. Your edit will be placed in a queue until it is peer reviewed.

We welcome edits that make the post easier to understand and more valuable for readers. Because community members review edits, please try to make the post substantially better than how you found it, for example, by fixing grammar or adding additional resources and hyperlinks.

Required fields*

Draft saved
Draft discarded
Cancel
7
  • 1
    How certain are you that the file is properly encoded in UTF-8? This really seems like an improper encoding. Can you look at the bytes in a hex editor to see what's actually there? What version of R are you using? It would be helpful to have a reproducible example that we can test with and see for ourselves exactly what is in the file. Commented Jun 3, 2025 at 14:36
  • 1
    Check using hexdump 1, I think it's broken: 00000220 4e c3 83 c2 bc 72 6e 62 65 72 67 3b 32 30 31 39 |N....rnberg;2019| Commented Jun 3, 2025 at 14:50
  • 1
    @SaïdMaanan yeah, try hexdump -C APExpose_DE__2003-2022__nogeo.csv | less. Commented Jun 3, 2025 at 14:52
  • 1
    Yeah, the source csv seems to be corrupted. Maybe you can try sth dirty like df <- read.csv2("APExpose_DE__2003-2022__nogeo.csv");df$kreis <- stringr::str_replace_all(df$kreis, c("ü" = "ü", "ö" = "ö", "ä" = "ä", "ß" = "ß")) - maybe this needs a str_to_title(str) if there are mutated vowels at the beginning of a kreis. Commented Jun 3, 2025 at 15:02
  • 2
    @SaïdMaanan Zenodo isn't anonymous, data is fresh, why not pinging the authors for an update? Commented Jun 3, 2025 at 15:12

lang-r

AltStyle によって変換されたページ (->オリジナル) /