• Simon 𐕣he 🪨 Johnson@lemmy.ml
    link
    fedilink
    arrow-up
    3
    arrow-down
    1
    ·
    edit-2
    1 day ago

    There are plenty of sha1 implementations that are more readable and sensible and less readable and sensible. This portion is simply an manually unrolled loop (lmao these gcc nerds haven’t even heard of Gentoo) of the hash chunk computation rounds. Hash functions aren’t “impenetrable” they’re just math. You can write math programmatically in a way that explains the math.

    The point of this post is actually things like x[(I-3)&0x0f]. It’s entirely the same concept as coercion to manipulate index values this way. What’s funny is that void pointer math, function pointer math, void pointers and function pointers in general are typically seen as “beyond the pale” for whatever reason.

    Beyond that if you know C you know why this is written this way with the parens. It’s because C has fucked up order of operations. For example a + b == 7 is literally “does adding a + b equal 7”, but if you write a & b == 7 you would think it means “does a AND b equal 7”, but you’d be wrong. It actually means does b equal 7 AND a.

    Furthermore a & (b ==7) makes no sense because b == 7 is a boolean value. Bitwise ANDing a boolean value should not work because the width of the boolean is 1 bit and the width of the int is 8 bits. ANDing should fail because there’s 7 void bits between the two types. However the standard coerces booleans in these cases to fit the full width, coercing the void bits to 0’s to make bitwise ANDing make sense.

    Beyond that asking what the memory size of a variable in C is a fools errand because the real answer is “it depends” and “it also depends if someone decided to ignore what it typically depends on (compiler and platform) with some preprocessor fun”. Remember how I said “void pointers” are beyond the pale? Yeah the typical “why” of that is because they don’t have a known size, but remember the size of something for C is “it depends”. 🤷

    Almost every language has idiosyncratic stuff like this, but some let you make up your own shit on top of that. These kinda low hanging fruit jokes are just people virtue signaling their nerddom (JS bad am rite guis, use a real language like C), when in reality this stuff is everywhere in imperative languages and typically doesn’t matter too much in practice. This isn’t even getting into idiosyncracies based on how computers understand numbers which is what subtracting from 0x5F3759DF (fast inverse square root) references.

    • CanadaPlus@lemmy.sdf.org
      link
      fedilink
      arrow-up
      1
      ·
      edit-2
      23 hours ago

      Hash functions aren’t “impenetrable” they’re just math.

      I mean, they’re both. At a high level it’s math, but the individual operations are carefully designed to resist any symbolic manipulation (aka thought) - because that’s one step away from an attack.

      The point of this post is actually things like x[(I-3)&0x0f]. It’s entirely the same concept as coercion to manipulate index values this way. What’s funny is that void pointer math, function pointer math, void pointers and function pointers in general are typically seen as “beyond the pale” for whatever reason.

      If it compiles pretty directly to register pointer math that way, I think it could be justified here. I can’t say if it does, or if an alternate approach would too, though.

      This portion is simply an manually unrolled loop

      Lol, I didn’t notice it’s a perfect shift. Yeah, that could theoretically be done better. Presumably the justification is because it’s a leaf function, and it’s hard to guarantee every compilation will unroll it properly.

      (lmao these gcc nerds haven’t even heard of Gentoo)

      The flag -O3 exists. Or just -funroll-loops. You shouldn’t even need -funroll-all-loops in this case, since hashes have a fixed size.

      I sound way more competent with the flags than I am here, haha. Does Gentoo use an alternate compiler by default?

      Beyond that asking what the memory size of a variable in C is a fools errand because the real answer is “it depends” and “it also depends if someone decided to ignore what it typically depends on (compiler and platform) with some preprocessor fun”.

      As I understand it, that’s pretty unavoidable if you want C to both compile onto multiple processors and work at a high level the same way on all of them. JavaScript catches shit for doing funny things purely because it was hastily built.

      Ditto for fast inverse square root. It’s absolutely cursed, but when you’re at a certain low level you can’t afford pretty anymore. You’re feeling the constraint of limited die space and manufacturing steps not too far down the layers of abstraction. Browser scripting, on the other hand, is not low-level.

      • Simon 𐕣he 🪨 Johnson@lemmy.ml
        link
        fedilink
        arrow-up
        1
        ·
        edit-2
        18 hours ago

        The flag -O3 exists. Or just -funroll-loops. You shouldn’t even need -funroll-all-loops in this case, since hashes have a fixed size.

        I sound way more competent with the flags than I am here, haha. Does Gentoo use an alternate compiler by default?

        This is in reference to an ancient linux meme cw: slur

    • propter_hog [any, any]@hexbear.net
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      1 day ago

      Precisely. This exact situation from the comic would happen in Java, too, except it complains when you subtract an int from a string. JavaScript was merely designed to minimize errors (since a web browser takes the place of the compiler, and random strangers visiting your site shouldn’t get interpretation errors) so instead of throwing up it just does its best at interpreting what you meant.