Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Prism vs Highlight.js: Why choose one over the other? #3625

Open
joshgoebel opened this issue Sep 16, 2022 · 20 comments
Open

Prism vs Highlight.js: Why choose one over the other? #3625

joshgoebel opened this issue Sep 16, 2022 · 20 comments
Labels
bug help welcome Could use help from community parser

Comments

@joshgoebel
Copy link
Member

joshgoebel commented Sep 16, 2022

Related: #3619 #3621 #3623

In trying to figure out the direction of the project moving forward, where we go, how we should or shouldn't integrate with other parsers (at the 1st party level), it'd be helpful to know WHY people prefer Highlight.js to Prism - or how they think about the differences. No detail is too large or too small. Or if you evaluated other highlighting solutions (than just these two), those thoughts would be welcome here also - provided you have a few points of comparison.

The parsing engines are of course very different. But I've never really thought about which is "better" in an absolute sense... I've just always assumed that a mono-culture is bad - and that it's good we have two great choices for highlighting on the client-side - and that people obviously have reasons for preferring one over the other.

Suddenly I'm much more curious about the WHY, and what those reasons are. Anyone care to share?

Lets keep this civil and respectful... it's ok to share perceived negatives, buts lets about things like "Highlight.js sucks because" or vise versa, etc...

@joshgoebel joshgoebel added bug help welcome Could use help from community parser labels Sep 16, 2022
@joshgoebel
Copy link
Member Author

I'll start:

  • Highlight.js often has more readable grammar definitions (for non regex wizards) - this is one of the reasons that drew me to it.
  • Prism seems to have more community interest/contributors.
  • Highlight.js is currently largely closed to 1st party grammars (too few maintainers).
  • Prism seems more open, though I'm not sure how often new grammars are added.

@joshgoebel
Copy link
Member Author

@RunDevelopment How would you feel about some sort of cross-posting of this?

@joshgoebel joshgoebel pinned this issue Sep 16, 2022
@idleberg
Copy link
Contributor

idleberg commented Sep 16, 2022

Prism seems to have more community interest/contributors.

This is pure speculation, but I have the feeling that Prism's popularity is tied closely to that of its author.

Highlight.js often has more readable grammar definitions (for non regex wizards)

As a contributor to both projects, I wanted to share my opinion on this. For obvious reasons, I prefer reading (and writing!) grammar definitions as used by Highlight.js. I would love to see some numbers, on how much optimized RegEx patterns contribute to the performance of Prism. If there's a significant gain, it makes sense, but developer experience is an important factor as well. Personally, I've been using retrie to optimize my contributions to Prism. In my opinion, that's a clear task for a computer and it kind of surprises me that contributions are higher Prism, because optimized Regex patterns are requested by the maintainers and not part of an automatic process. I've noticed for myself, that I'm slower at updating my Prism grammar rules because of this extra step.

@RunDevelopment
Copy link

How would you feel about some sort of cross-posting of this?

I just did :)

It might be good to more clearly signpost this as a survey where we are collection feedback from the community.

@RunDevelopment
Copy link

because optimized Regex patterns are requested by the maintainers and not part of an automatic process

Sorry, that's my fault.

I did request manually optimized keyword lists in the past, and that was honestly stupid. They were not only hard to read, write, and maintain, they were also slower than many "unoptimized" regexes. Turns out, browsers optimize character runs, so breaking up words too much can make regexes slower. Optimizing regexes based on tries sounds good in theory, but unfortunately rarely helps in practice. The old saying "premature optimization is the root of all evil" rang true once again. So I've changed course for a few years now and focus on readable regexes instead. Most of my usual nits and critiques when reviewing regexes were even automated by making a linter.

I know this isn't a Q&A, but I still wanted to respond to it, because it was directly my fault.

@JoshyPHP
Copy link

For me, the number one reason to use Highlight.js over anything else is the automatic language detection. I maintain a PHP library that's mostly used in forum software such as phpBB and Flarum. In my experience, most users don't fill in name of the programming language when posting code blocks. Without auto-detection or some kind of default, language-agnostic rules, there wouldn't be any highlighting.

@MBoegers
Copy link
Contributor

I came to hightlight.js because I use Aciidoc-revealjs to create my slide decks and the has the best support for hightlight.js. I never worked with prism and only contributed some new keywords to hightlight, but I am think I am better with the "simple" approach taken by hightlight ;-)

@SalvatoreNoschese

This comment was marked as off-topic.

@isaacwein
Copy link

dose Prism have all these themes?

@matheusmoreira
Copy link

matheusmoreira commented Sep 3, 2023

I think a major weakness of both syntax highlighters is they're based on regular expressions. Most if not all of the languages they highlight are not regular languages. They run into issues with things such as nesting.

A syntax highlighter having proper context-free parsers would be a major reason to prefer one over all others. An earley parser with grammar data for all supported languages, for example.

@kyle-belle
Copy link

kyle-belle commented Nov 7, 2023

I am testing out react-syntax-highlighter which offers both highlight.js and prism and for my use case highlight.js is much faster. I am basically trying to have editable code with live syntax highlighting like how its done on discord.

Prisma has better jsx/tsx support but its not worth it for the difference in speed.

highlight.js does seem to support jsx/tsx a fair amount but a small thing that messes it up is something like this

import { Comp_ } from "@component";
import { cn } from "../../Utils/cn";
import { useDarkMode } from "usehooks-ts";

const Comp = <T,/*<- this trailing comma as well as this comment*/>({className, ...props}: Comp_.Props<T>) => {
    return (
        <Comp_ {...props} classname={cn("w-full", className)} />
    );
}
 
export {DatePicker};

@cyfung1031

This comment was marked as outdated.

@joshgoebel
Copy link
Member Author

joshgoebel commented Dec 15, 2023

I find this last comment a bit confusing/misleading on several points... so I'm hiding it. Neither library uses a simple lexer/tokenizer (that walks the code one character at a time).

@cyfung1031
Copy link
Contributor

cyfung1031 commented Dec 15, 2023

I find this last comment a bit confusing/misleading on several points... so I'm hiding it. Neither library uses a simple lexer/tokenizer (that walks the code one character at a time).

Thanks for the information.

From the https://highlightjs.org/, I cannot find any information mentioning the plugin part.
No demo with line numbers.

The discussion you mentioned seems to be lengthy. Any easy way to make highlights to the operators and punctuations etc like PrismJS?

@marcellino-ornelas
Copy link

marcellino-ornelas commented Feb 15, 2024

I chose to use highlight.js in my react application for the following reasons:

I liked highlightJS but unfortunately didn't end up fitting our needs for the various reasons:

  • Hard to find plugins and none listed on https://highlightjs.org/
  • Plugin system was hard to use, lacked functionality, and I couldn't find any good examples.

@badrelmers
Copy link

Prisme does not support language autodetection PrismJS/prism#1313
so highlightjs is the winner, i could not find anything that does autodetection except this closed source thing
https://torchlight.dev/

so the choice is easy for me
image

thank you highlightjs for your great work.

@joshgoebel
Copy link
Member Author

Plugin system was hard to use, lacked functionality,

I think we have a few good hooks, but I'd be open to adding more if they were well thought out. Open to suggestions.

@wiredprairie
Copy link

wiredprairie commented Sep 5, 2024

FYI: It looks like Prism isn't being maintained anymore unfortunately. The last activity was > 2 years ago (Aug 2022). So, for anyone who's starting fresh and choosing a (new) highlighter, that may be an important factor worth considering.

Update: There was progress in a v2 branch until 7 months ago (Feb 2024).

@marcellino-ornelas
Copy link

@joshgoebel sorry for the late reply. It been a while now, totally forgot what troubles i was running into with plugins. If I have spare moment in the next couple of weeks, ill try it out again and write something up for you 🙌🏽

@Chasmical
Copy link

I considered using Prism and Highlight.js for my blog website, but encountered issues with TypeScript grammars in both. In the end, I chose Shiki, it's a heavier and slower highlighter, that's more suited for server-side rendering, rather than client-side.

TypeScript issues in Prism (repro):

  1. Interfaces in TypeScript can extend multiple other types, but Prism only highlights the first base type. (see issue, 2023)
  2. The nested types aren't highlighted correctly, — only the parent/namespace part is highlighted.
  3. The type parameters of generic types and functions would often get highlighted as "constants", since they often have short uppercase-only names.
  4. Prism doesn't differentiate null/undefined values from null/undefined types.
  5. Prism doesn't highlight typed variables and parameters. Prism's TypeScript support is surprisingly lacking. (see issue, 2018)

TypeScript issues in Highlight.js (repro):

  1. Generic functions' names aren't highlighted correctly, when their type parameters are specified explicitly right after the name.
  2. If a generic function's name starts with an uppercase letter, it's highlighted as a type.
  3. Highlight.js doesn't differentiate between HTML tags and React Components in JSX.
  4. When reading JSX, Highlight.js uses XML's grammar, and JSX attributes aren't interpreted correctly. (although, to be fair, the grammar is called "typescript", not "tsx")
  5. Punctuation and operator tokens aren't highlighted (I've seen them on highlight.js' docs website though, so maybe I'm missing something in my setup). And most identifiers (parameters, untyped variables) don't have any classes.

I've compared Prism, highlight.js and Shiki in more detail in my blog post, with highlighting samples, size and speed comparison.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug help welcome Could use help from community parser
Projects
None yet
Development

No branches or pull requests