HN comments for: I recreated Shazam’s algorithm with Go

halfmatthalfcat

42 replies

1d5h

2024-08-01 13:18:27 UTC

FYI - If this is a true reproduction of Shazam, it’s under patent by Apple through at least March 2025[1].

[1] https://patents.google.com/patent/US7627477

Someone

8 replies

1d3h

2024-08-01 15:12:52 UTC

“An Industrial-Strength Audio Search Algorithm”, the paper where Shazam describes their algorithm (https://www.ee.columbia.edu/~dpwe/papers/Wang03-shazam.pdf) doesn’t have a clear publication date, but https://www.researchgate.net/publication/220723446_An_Indust... indicates it is from 2003.

That patent was filed in the US on 2004-10-21.

IANAL, but to me, that’s a point against that patent in the USA.

kajecounterhack

4 replies

2024-08-01 17:33:19 UTC

How is it a point against the patent if the paper was written by someone under the employment of Shazam? Isn't the point of the patent to award the innovator with the right to profit from the innovation?

mgillett54

1 replies

2024-08-01 18:24:55 UTC

You only get a one year grace period after first public discloser to file for a patent in the US. So if the dates in this scenario are: - paper in 2003 - patent 2004-10-21

Only if the paper was released between 2003-10-22 and 2003-12-31 would it meet the one year grace period requirement.

Looks like it’s not relevant in this case since they got a provisional patent in 2002, but that’s likely what the above was referring to as “a point against that patent”.

kajecounterhack

0 replies

31m

2024-08-02 17:59:25 UTC

Thank you! That's elucidating

mananaysiempre

0 replies

1h53m

2024-08-02 16:37:09 UTC

[T]he point of the patent [is to] award the innovator with the right to profit from the innovation

... in return for disclosing said innovation for use by others (in the patent) instead of keeping it secret. If the disclosure has already happened without the prospect of royalties (such as in a journal article), then the deal is off. But indeed various places have grace periods for that.

Someone

0 replies

2024-08-01 18:25:59 UTC

That it was written by someone under the employment of Shazam makes it likely that it describes their algorithm, but for patent protection, what matters is that you can’t apply for a patent for an invention that has been published.

https://www.science.org/content/article/patent-first-publish...: “According to U.S. law, a patent cannot be obtained if an invention was previously known or used by other people in the U.S., or was already patented or published anywhere in the world. Furthermore, publicly using or selling an invention more than 1 year prior to filing a patent application completely bars you from ever winning a patent on that invention.

[…]

In Europe, for instance, there is no 1-year grace period--the chances of winning patent protection is lost the instant an invention becomes public”

tsimionescu

0 replies

4h30m

2024-08-02 14:00:22 UTC

You should note that the USA even today still has a one year grace period between when you first published an idea and when you can valodly file for a patent for that idea. So if they published it in June 2003 but applied for the patent in April 2004, then the patent is well within the grace period and the paper doesn't constitute prior art.

tkuraku

0 replies

20h58m

2024-08-01 21:32:53 UTC

As long as you have submitted the patent before publication you are good to go.

refibrillator

0 replies

2024-08-01 17:37:21 UTC

Actually the provisional patent # 60/376,055 was filed on April 25 2002.

So if the paper was indeed published in Oct 2003 then all is well.

shoggouth

6 replies

1d3h

2024-08-01 14:33:07 UTC

I was under the impression that patents aren’t enforceable on open source projects. I assume I am wrong?

colejohnson66

4 replies

1d3h

2024-08-01 14:52:55 UTC

Yes. Just because something is open source doesn't mean it's unbound by IP laws.

ndriscoll

3 replies

1d1h

2024-08-01 17:14:32 UTC

It's not whether "IP" laws apply; it's whether source code itself is in scope for patents. Source code is a description of an algorithm, which in principle is what the software patent is supposed to be providing anyway. Patents shouldn't be relevant here for the same reason they aren't relevant to what you write in e.g. a textbook on signal processing where you might find an exact description of how Shazam works.

Compiling and running that source code on your computer/as part of a wider system may violate a patent, but my impression was that patents are not relevant to the actual code. Are there test cases in the US around pure source distribution of a patented algorithm? Particularly post-Alice?

concerndc1tizen

2 replies

23h55m

2024-08-01 18:35:37 UTC

I wonder if the massive amount of open source software can now be used as elaborate 'prior art' in a way that basically invalidates any software patent that is awarded after the source had been made available?

I.e. if any algorithm was already implemented, in some variation, then the patent is not valid?

For example, for the infamous Amazon 'one click purchase', if a similar pattern was used, maybe a 'one click start vacuum cleaner robot', would it that patent then be invalid?

rvnx

1 replies

23h25m

2024-08-01 19:05:59 UTC

Depends how much money you have.

For example, Tesla just patented the Robotaxi one year ago, despite having open-source solutions like Apollo Self-Driving platform.

Patents are really an obsolete system that favors the super-rich and the lawyers.

HPsquared

0 replies

9h51m

2024-08-02 08:39:21 UTC

I'd be surprised if they patented the general concept of a robotaxi. Almost certainly, it'd be one specific type they they could say they invented.

lights0123

0 replies

1d3h

2024-08-01 15:24:51 UTC

Patents are enforceable in the US for even personal use without distribution.

VierScar

5 replies

1d4h

2024-08-01 13:50:56 UTC

I dunno how software patents work but I was under the impression that unless you basically copy paste their code, the courts wouldn't consider it patent infringement as you can't patent the function, but rather the specific thing itself which for software is the exact code itself. But if I'm not understanding something please correct me.

mcfedr

1 replies

2024-08-01 18:07:19 UTC

Software patents are magic, you just start your process with 'on a computer do X' and because computers are a piece of hardware you can patent anything you like

subpub47

0 replies

17h2m

2024-08-02 01:28:26 UTC

I'm still upset that my "you can't patent software because of the Curry-Howard isomorphism" legal argument never took off. (Basically, software is equivalent to math, and you can't patent math. Therefore, you can't patent software. QED.)

wlesieutre

0 replies

1d4h

2024-08-01 14:01:33 UTC

You’re thinking of copyright, which covers a specific creative expression. Patents are more general on how something is done and would cover different code that works the same way.

dawnerd

0 replies

1d2h

2024-08-01 15:50:49 UTC

Overly simplistic: patents is the process not strictly design.

Edit: just to be clear there are design patents too, but I don’t think they’d be granted for software.

amelius

0 replies

1d4h

2024-08-01 13:55:39 UTC

This cannot be true because if you copied a physical design and made a few irrelevant modifications then it's still infringement.

amelius

4 replies

1d4h

2024-08-01 13:45:41 UTC

Figure 1 looks interesting since it has both a time and frequency axis, when usually signals have either a time __or__ frequency axis.

Now I'm curious how the Fourier (?) transform of a signal at a __single__ given timepoint is even defined ...

earthnail

2 replies

1d2h

2024-08-01 16:21:38 UTC

Jamessb already linked to the right terms. One thing to add is that there is always a tradeoff between time and frequency resolution on short time Fourier transforms. You just can’t have both. It’s always a somewhat unsatisfying tradeoff that still works well in practice.

01HNNWZ0MV43FF

1 replies

2024-08-01 17:37:30 UTC

I found this out when I was trying to turn an ordinary 5 dollar thrifted musical keyboard into a midi controller by plugging it into my PC, putting it on "sine wave" and using a Goertzel detector

The latency for detecting audio-frequency waves is quite bad

This also stymied my desire to put digital audio onto a vinyl record :( literally not enough bandwidth

wbl

0 replies

15h47m

2024-08-02 02:43:51 UTC

By definition there is vinyl quality worth of bandwidth on the record.

jamessb

0 replies

1d4h

2024-08-01 14:19:33 UTC

The concepts you are looking for are the short-time Fourier transform and spectrogram:

https://en.wikipedia.org/wiki/Short-time_Fourier_transform

https://en.wikipedia.org/wiki/Spectrogram

Thaxll

4 replies

1d3h

2024-08-01 15:03:25 UTC

I remember a popular HN post from 10 years ago, that was pulled or the source was pulled because Shazam legally threatened the disclosure of the algorithm. I think it's actually the Google drive file pdf capture from OP's article.

refibrillator

3 replies

1d3h

2024-08-01 15:28:55 UTC

Here is a link to that HN thread from 2013:

https://news.ycombinator.com/item?id=5723863

ivolimmen

0 replies

11h56m

2024-08-02 06:34:12 UTC

It was a Dutch guy. I have seen his talk on this something with fast Fourier transform and getting sued. Was fun

dang

0 replies

23h21m

2024-08-01 19:09:04 UTC

Thanks! Macroexpanded:

Creating Shazam in Java (2010) - https://news.ycombinator.com/item?id=32530056 - Aug 2022 (36 comments)

Patent infringement claim re: “Creating Shazam in Java” blogpost (2010) - https://news.ycombinator.com/item?id=9594480 - May 2015 (18 comments)

Creating Shazam in Java - https://news.ycombinator.com/item?id=5723863 - May 2013 (43 comments)

Implementing Shazam with Java in a weekend - https://news.ycombinator.com/item?id=1702975 - Sept 2010 (23 comments)

Longer list of relateds here: https://news.ycombinator.com/item?id=41132462.

dang

0 replies

23h22m

2024-08-01 19:08:20 UTC

Related. Others?

How Shazam Works (2003) [pdf] - https://news.ycombinator.com/item?id=40029036 - April 2024 (29 comments)

How does Shazam work? (2022) - https://news.ycombinator.com/item?id=38531428 - Dec 2023 (154 comments)

An Industrial-Strength Audio Search Algorithm (2003) [pdf] - https://news.ycombinator.com/item?id=33299853 - Oct 2022 (1 comment)

Creating Shazam in Java (2010) - https://news.ycombinator.com/item?id=32530056 - Aug 2022 (36 comments)

Shazam turns 20 - https://news.ycombinator.com/item?id=32520593 - Aug 2022 (227 comments)

How Shazam Works (2015) - https://news.ycombinator.com/item?id=23806142 - July 2020 (7 comments)

Designing an audio adblocker - https://news.ycombinator.com/item?id=18855029 - Jan 2019 (186 comments)

Show HN: A radio/podcast adblocker featuring ML and Shazam-like fingerprinting - https://news.ycombinator.com/item?id=18459058 - Nov 2018 (2 comments)

Apple has completed its acquisition of Shazam - https://news.ycombinator.com/item?id=18066724 - Sept 2018 (316 comments)

Apple Buys Shazam to Boost Apple Music - https://news.ycombinator.com/item?id=15899065 - Dec 2017 (156 comments)

Apple is close to acquiring Shazam, sources say - https://news.ycombinator.com/item?id=15881896 - Dec 2017 (292 comments)

Show HN: Shazam-like acoustic fingerprinting of continuous audio streams - https://news.ycombinator.com/item?id=15809291 - Nov 2017 (76 comments)

How Shazam Works (2015) - https://news.ycombinator.com/item?id=15350729 - Sept 2017 (13 comments)

Tell HN: Shazam picks up song from my kitchen light - https://news.ycombinator.com/item?id=11593305 - April 2016 (2 comments)

How Shazam works - https://news.ycombinator.com/item?id=9870408 - July 2015 (48 comments)

Patent infringement claim re: “Creating Shazam in Java” blogpost (2010) - https://news.ycombinator.com/item?id=9594480 - May 2015 (18 comments)

The Shazam Effect (2014) - https://news.ycombinator.com/item?id=9593429 - May 2015 (37 comments)

The Shazam Effect - https://news.ycombinator.com/item?id=8634357 - Nov 2014 (34 comments)

Ask HN: Is there an audio search technology that finds exact and similar audio? - https://news.ycombinator.com/item?id=8420141 - Oct 2014 (3 comments)

Source code example of the Shazam algorithm - https://news.ycombinator.com/item?id=5724442 - May 2013 (16 comments)

Creating Shazam in Java - https://news.ycombinator.com/item?id=5723863 - May 2013 (43 comments)

An Industrial-Strength Audio Search Algorithm (Shazam) - https://news.ycombinator.com/item?id=2621103 - June 2011 (4 comments)

Shazam's Search for Songs Creates New Music Jobs - https://news.ycombinator.com/item?id=2215295 - Feb 2011 (1 comment)

How does the music-identifying app Shazam work its magic? - https://news.ycombinator.com/item?id=2214992 - Feb 2011 (2 comments)

How Shazam Works To Identify (Nearly) Every Song You Throw At It - https://news.ycombinator.com/item?id=1727891 - Sept 2010 (1 comment)

Implementing Shazam with Java in a weekend - https://news.ycombinator.com/item?id=1702975 - Sept 2010 (23 comments)

Shazam: not magic after all - https://news.ycombinator.com/item?id=909263 - Oct 2009 (28 comments)

How does the music-identifying app Shazam work its magic? - https://news.ycombinator.com/item?id=893353 - Oct 2009 (16 comments)

Shazam Has 50 Million Users and Secures Investment From KPCB - https://news.ycombinator.com/item?id=882537 - Oct 2009 (13 comments)

maeln

3 replies

1d4h

2024-08-01 13:57:49 UTC

So not enforceable anywhere else than in the U.S.

concerndc1tizen

1 replies

23h59m

2024-08-01 18:31:25 UTC

Any seller of software would be liable for selling software to US customers without a patent license.

I'm curious about the legal consequences of freely distributed software (e.g. open source). I wonder if the author/provider could be held liable if they: - knowingly (passively) or actively market to US customers (e.g. provide support) - are aware that US users are using it, and take no actions to prevent its distribution etc.

Can someone share their knowledge on this?

If European software incidentally infringes a US patent, and it is distributed freely, is the provider then liable? E.g. is Github basically liable for restricting US users from access to (distributing) patent infringing software?

kube-system

0 replies

23h5m

2024-08-01 19:25:06 UTC

As I understand, you can violate a patent just by importing something infringing. Selling it isn't necessarily a requirement for infringement.

https://blog.shp.law/index.php/2021/03/28/open-source-softwa...

I'm sure there's tons of patent violations in FOSS, but patents are usually just used to go after companies with big pockets.

colejohnson66

0 replies

1d3h

2024-08-01 14:52:22 UTC

There are other patents for their implementation, and all are filed in multiple countries. Look at the blue sidebar under "Worldwide applications".

The linked one is just for the US version of a singular patent. It had applications in 14 other countries and WIPO, 6 of which are still active (plus US).

csmpltn

3 replies

2024-08-01 18:23:21 UTC

It's:

(1) deriving a simple fingerprint from the FFT of the audio signal

(2) simple indexing

(3) simple similarity search

You need the signatures of all music on earth for this to work though ;)

selcuka

1 replies

11h0m

2024-08-02 07:30:11 UTC

You need the signatures of all music on earth for this to work though ;)

Not that difficult with a Spotify subscription. It will take some time, though.

j33zusjuice

0 replies

2h19m

2024-08-02 16:11:08 UTC

Would-be Neil Young fans will be left wanting if Spotify is the only reference.

adelpozo

0 replies

14h35m

2024-08-02 03:55:26 UTC

all the music on earth… I would presume that adding a searchable fingerprint to every torrent link would do it

throwaway2562

0 replies

1d4h

2024-08-01 14:29:11 UTC

Soon!

acedTrex

0 replies

1d3h

2024-08-01 15:24:49 UTC

So you are saying to clone the repo now

vegabook

8 replies

1d5h

2024-08-01 13:06:32 UTC

Recently found Shazam is less accurate - somehow soundhound is giving me better results. On Shazam I'm getting a lot of results from Asian musical traditions which is great, if it wasn't the wrong song. Maybe they need to improve the algo if they've increased the range of music they will select from? Seems now there's a lot more hash table collision[1].

  [1] https://github.com/cgzirim/not-shazam?tab=readme-ov-file#resources--card_file_box

cglan

6 replies

1d5h

2024-08-01 13:09:22 UTC

Soundhound has always been better than Shazam. It can even pick up people singing and extremely quiet songs

jedberg

4 replies

1d2h

2024-08-01 15:38:22 UTC

In my sample of one song, I have to disagree. I played Watermelon by Mezerg, which is admittedly not very popular, and Soundhound couldn't get it with two tries, but Shazam picked it up in less than two seconds.

robbomacrae

3 replies

2024-08-01 17:41:49 UTC

I work at SoundHound. If it didn't get it in two tries it's likely we didn't have that song in the database. Both Shazam and SH have knowledge gaps.

IndySun

2 replies

20h14m

2024-08-01 22:16:27 UTC

I've championed Soundhound but it has literally stopped working (finding any tune) on my iphone. I've reinstalled, still nothing. It does not appear to 'hear' anything.

RockRobotRock

1 replies

16h59m

2024-08-02 01:31:38 UTC

Is it possible the microphone permission was turned off?

IndySun

0 replies

7h44m

2024-08-02 10:47:00 UTC

It's a fair question. Out of an abundance of caution I just toggled it. Definitely on. In fact, in doing so I have today noticed some correlation between swapping between Shazam and Soundhound and mic behaviour. That is to say, Shazam appears to hog the mic and not relinquish it to Soundhound upon request. If all apps are quit, then I launch Soundhound, I can get a response. But if I happen to then use Shazam, which always works, Soundhound fails until I do the double quit.

bsder

0 replies

20h45m

2024-08-01 21:45:24 UTC

Shazam seems to have a way bigger database than Soundhound.

For a while, it seems like Soundhound was about to shut down. It wouldn't match anything released in the last 12-18 months, but that seems to be fixed.

lucgommans

0 replies

19h59m

2024-08-01 22:31:17 UTC

I compared Shazam's, SoundHound's, and BeatFind's recognition library in August 2021. (And tried MusixMatch but it crashed on startup apparently.) Don't think I published it anywhere, these are my raw notes I found among saved chat messages. The format is starting to make more sense now that I'm putting it into a wider window than a chat screen, so I can recommend using a wide browser (94 characters per line should do it). Eyeing the song choices, it looks like I tried to find different genres and artist types (ccMixer/youtube celebrities, to indie, to established) but a larger sample size would obviously have been even better. Still, I hope it's one step up from adding another random opinion!

The conclusion appears to be that BeatFind and Shazam know the most songs, but are also somewhat complementary and all of the services had at least one song they uniquely recognised.

---

    Fun facts:
     * Night Driver (W) said "1 Shazams". I think I was the first person to ever Shazam that. Some of the most obscure things had hundreds, often thousands of shazams!
     * You know where they are taking the hobbits but none of the services do!

    ========

    - ABC = found the song
    - # = number of attempts
    - f = exceptionally fast matching (when it did match, might not be first attempt)
    - ~ = knew one of the songs

    BeatFind:     2B  C     1E    2G 2H  1I  2Jf 2Lf 4M 2Nf 2Of 1Pf 1Rf ~S 2T  1Uf 1Vf 1W 1Xf Y Z
    SoundHound: A 1B        1E 2F 1G 2H  2If     1L         2O  3P  2R     1Tf 2U
    Shazam:       1Bf    1D 1E 1F 1G 1Hf 1I  2J  1L  1M 4N  1O  1Pf 1Rf    1Tf 1Uf 1V  2W 1X  Y Z
    MusixMatch: crashes on startup, presumably it realizes it won't be able to show me ads

    missingno
    Shazam:     A C K Q S
    SoundHound: C D F J K M/N Q S V W X Y Z
    BeatFind:   A D F K Q

    non-universal finds (repeated letter = unique = counts double; slash means same artist so should be counted as one)
    Shazam:     DD F J M/N V X Y Z
    SoundHound: AA F
    BeatFind:   CC J M/N S V X Y Z

    A: Levan Polkka Epic Orchestral Cover version
    B: Pokémon red/blue soundtrack
    C: Mayhem (various songs, it seems either they have all or they have none)
    D: Art Now ft. Snowflake
    E: Hero's Choice
    F: Three Days Grace - Scared
    G: Syrian - Supernova
    H: The Explosion - Here I Am
    I: The Von Bondies - C'mon C'mon
    J: Frank Klepacki - Scouting (C&C TibSun)
    K: Conspiracy - Chaos Theory (demoscene)
    L: Cheshyre - Madness6 (remix) (Newgrounds ID 77998)
    M: Dimrain47 - Twilight Techno
    N: Dimrain47 - Cloud Control
    O: DragonForce - My Spirit Will Go On
    P: Yuki Kajiura - The First Town (SAO)
    Q: THEY'RE TAKING THE HOBBITS TO ISENGARD! THE HOBBITS- THE HOBBITS- TO ISENGARD! TO ISENGARD!
    R: Faithless - Insomnia
    S: Age of Empires 1 soundtrack
    T: Moulin Rouge - El Tango De Roxanne
    U: Van Canto - Master of Puppets
    V: Slack Bird - Jouni
    W: Floppytronic - Night Driver
    X: EgoSalad / Kitboga - Breathe in
    Y: Floppy Drive music: top 4 hits on yt: sweet dreams, imperial march, ghostbusters, beat it. Only ghostbusters was known to any
    Z: Obsidian Shell - Orphanage

---

Note that what I did not test/review introducing noise (like people talking through it) or filtering (like when you hear the music through a wall)

jokoon

5 replies

2024-08-01 18:09:38 UTC

This is useless unless you have all the songs on earth

Algorithm don't matter, only data matters

lucb1e

1 replies

19h38m

2024-08-01 22:52:25 UTC

That's like saying the Hutter prize is useless for anyone who doesn't want highly compressed versions of Wikipedia. The underlying code or algorithm is still interesting to study, use, and remix.

zerr

0 replies

6h19m

2024-08-02 12:11:49 UTC

AFAIK the underlying algorithm had been implemented multiple times with accompanying explaining blog posts and articles. So this is yet another iteration, done for fun. Nothing wrong with it, just nothing really useful/new I believe.

0cf8612b2e1e

1 replies

23h32m

2024-08-01 18:58:27 UTC

Although, would be curious how good you could get to isolating to a single artist. If you had say one exemplar fingerprint per artist, could an out of dataset fingerprint from their discography cluster to that artist? Obviously not for artists who transitioned musical styles.

Or is the algorithm more feature hash than a clusterable feature vector?

ccgzirim

0 replies

15h21m

2024-08-02 03:09:15 UTC

Isolating a single artist based on a fingerprint sounds challenging but interesting.

Using exemplar fingerprints, a representative sample of an artist's music, is a good approach, but success would require detailed fingerprints, a varied dataset, and a well-chosen algorithm.

For artists who change styles, time-series analysis can capture their evolving sound.

The solution will likely need machine learning.

The current solution doesn't use feature hashing or clusterable feature vectors. Instead, it relies on audio fingerprinting, which breaks down short audio samples into unique patterns or "fingerprints" for quick comparison with a large database of known songs.

nwsm

0 replies

23h33m

2024-08-01 18:57:16 UTC

Here we have an open-source algorithm that is useful to anyone with data. It doesn't have to be music

KomoD

5 replies

1d5h

2024-08-01 12:47:06 UTC

If you insert Spotify songs, wouldn't it make more sense to output Spotify songs too?

ccgzirim

4 replies

1d5h

2024-08-01 13:04:05 UTC

It would actually. But Spotify doesn't allow direct downloads so I had to find the songs on YouTube and download them from there.

crtasm

1 replies

22h23m

2024-08-01 20:07:52 UTC

Note that YouTube's ToS doesn't allow this either, be aware what you're potentially getting into by releasing a tool that rips music from there.

lucb1e

0 replies

19h32m

2024-08-01 22:58:18 UTC

Uploaders can choose a video license. Since there is no download button for those videos, I've always wondered if that is a creative commons' violation and therefore Youtube should run into the same legal issue that StackOverflow currently is (ctrl+f "violat" https://meta.stackexchange.com/questions/401324/announcing-a...), namely that this terminated google's (but nobody else's) license to use the video under these creative commons terms

written-beyond

0 replies

1d3h

2024-08-01 14:55:22 UTC

You're a G

subpub47

0 replies

17h0m

2024-08-02 01:30:29 UTC

All Spotify sees is bytes going to your computer. What happens to those bytes afterward is your own business.

euroderf

3 replies

2024-08-01 18:05:07 UTC

Run it as a daemon that displays every song in a UI notification ?

theabhinavdas

2 replies

23h53m

2024-08-01 18:37:19 UTC

You deserve reddit gold for this idea

nwsm

1 replies

23h32m

2024-08-01 18:58:25 UTC

I love spyware!

euroderf

0 replies

23h26m

2024-08-01 19:04:55 UTC

B-b-but... source code included !

ascorbic

3 replies

11h50m

2024-08-02 06:40:58 UTC

This is cool, but you urgently need to change the name.

montag

0 replies

11h22m

2024-08-02 07:08:51 UTC

Why does OP urgently need to change the name?

dmichulke

0 replies

8h45m

2024-08-02 09:45:33 UTC

Notch of Sam?

Not jus' AM?

ccgzirim

0 replies

10h42m

2024-08-02 07:48:19 UTC

A. SeekSound vs B. SoundSeek vs C. SoundScout

Which would be a better alternative?

DandyDev

3 replies

1d3h

2024-08-01 14:34:18 UTC

Isn’t the whole point of Shazam that you don’t know the song and want to find it? If you don’t know the song, hoeven you provide a Spotify link?

zild3d

1 replies

1d3h

2024-08-01 14:38:55 UTC

this is a demo of the algorithm, not a full app / hosted service using it with a pre-populated database. The spotify link would be to fingerprint the song and add it to the database

ccgzirim

0 replies

1d3h

2024-08-01 14:47:16 UTC

You're right. The Spotify link is used solely to get details about the song. These details are then used to search for and download the song from YouTube. Afterward, the fingerprint for the song is created and added to the database.

paxys

0 replies

1d2h

2024-08-01 15:52:07 UTC

The idea is that you add every Spotify song in the database, and then run your match against them.

Cieric

3 replies

1d2h

2024-08-01 15:58:20 UTC

While the project does look nice to use and modify. I'm not sure I personally would have posted it yet.

- The instructions seem not to be the best to get it up and running (e.g. "cd not-shazam" and just a few lines later "cd not-shazam/client")

- MongoDB is needed but information on how to hook it up/use it are absent (I would make the DB swapable and provide something less intrusive like sqlite)

- If replacing MongoDB is not possible, I would provide a dockerfile and a docker compose to allow easy startup and testing.

- The client npm install has 8 critical vulnerabilities, these might not actually matter but it makes me hesitant to continue testing

- You might not care about the patent or the copyright, but I would still change the name at the very least. Github itself is located in the US and will remove the project if they receives a DMCA.

- Last, this might not be as important, I would add a way to add songs from wav files. Not everything I'd want to test this with is on spotify or youtube.

I'm not saying this to discourage you or anything, I just think the project needs that little extra bit of polish. Minor things will cause people to discredit or ignore a project. If I get around to it I might make a PR for the project. I want to experiment with audio matching outside of the music space, and your project seems like it'll be the easiest to modify.

Edit: Formatting

ccgzirim

1 replies

22h47m

2024-08-01 19:43:29 UTC

Thank you for the time you took to provide such detailed feedback. I really appreciate your honest input. You've raised some valid points that I hadn't really considered.

I agree that the project could definitely use some polishing. I'll prioritize improving the setup instructions and look into adding a file-based DB for flexibility, as well as resolving the npm vulnerabilities. Adding support for directly fingerprinting wav files is a great idea and something I'll prioritize, too.

Regarding the project name, I understand the potential legal implications and will definitely change it. I'd appreciate any suggestions you might have.

I'm excited about the possibility of your contributions. Please, feel free to open a PR whenever you're ready.

Thanks again for your feedback!

bravura

0 replies

4h32m

2024-08-02 13:58:27 UTC

Why not use sqlite instead of mongodb?

riiii

0 replies

19h8m

2024-08-01 23:22:53 UTC

You post it on HN to get invaluable comments like yours. Good writeup!

wmichelin

2 replies

2024-08-01 17:59:10 UTC

Hardcoded sleeps for some reason, nice /s

https://github.com/cgzirim/not-shazam/blob/888070f3434acbc0a...

rvnx

0 replies

23h50m

2024-08-01 18:40:13 UTC

It's to go around the ban of the IP / account by Spotify and to be softer with them, you have to wait between two requests to download songs.

lucb1e

0 replies

19h26m

2024-08-01 23:04:52 UTC

I also use sleep a lot in my code when interfacing with third-party services (multiprocess usually so it's not blocking things, though I'd also totally see myself using a callback pattern or so if the caller can handle those). When it's more than an ad-hoc piece of code, it generally measures how long ago the previous request finished to determine how long to sleep if the next call is made within the cool-off period. If you're not doing that... please don't interface with my server

Philip-J-Fry

2 replies

1d2h

2024-08-01 16:28:07 UTC

I think you've leaked your developer key here... https://github.com/cgzirim/not-shazam/blob/main/spotify/yout...

zadokshi

0 replies

1d1h

2024-08-01 17:00:49 UTC

Does this mean he could accidentally get a $1 million credit card bill from google from someone using his key without his permission? (I don’t know how it works with google.)

ccgzirim

0 replies

1d1h

2024-08-01 17:20:01 UTC

Oops... Thank you. I've disabled it.

yazmeya

1 replies

1d3h

2024-08-01 14:39:59 UTC

I enjoyed this talk at the DAFx17 conference by Avery Wang, co-founder of Shazam. It goes a little into the theory behind the algorithm, and looks at some of the more practical issues (background noise, etc.): https://www.youtube.com/watch?v=YVTnj3OIhwI

DevX101

0 replies

1d1h

2024-08-01 17:09:14 UTC

Adding this to the watch list. Reading this paper was one of the first times I got a 'wow' moment around computing algorthms.

strongly-typed

1 replies

1d1h

2024-08-01 17:13:46 UTC

This is really cool. I’ve been itching to try building this exact kind of thing as part of my bucket list.

ccgzirim

0 replies

1d1h

2024-08-01 17:29:14 UTC

Thanks. I'm glad it inspires you! It'd be awesome to see you take it on. You can clone it and develop it further.

renierbotha

1 replies

6h12m

2024-08-02 12:18:19 UTC

Haven't crawled through the repo (yet) but quick question - where does the data that is being searched over come from? Are you loading a library or searching some large library acquired from somewhere else?

ccgzirim

0 replies

2024-08-02 18:23:29 UTC

The data comes from a database of fingerprints connected to the server. These fingerprints are created whenever songs are added.

pjs_

1 replies

16h37m

2024-08-02 01:53:46 UTC

Shazam's technology came in part out of CCRMA, which is a very cool and special place on Stanford Campus, with deep connections to early computer history.

I think it is very interesting that so many of the early applications of computer technology have to do with audio. John Bardeen's music box, the first commercial application of the transistor in hearing aids, the HP garage in Palo Alto was originally building audio oscillators, the iPhone evolved from the iPod, the internet was built on copper made to carry analog telephone calls, Bell Labs (ping!), the list goes on.

A friend of mine has the hypothesis that maybe human beings end up figuring out how to do kHz stuff before they go on to do MHz/GHz stuff. Not a perfect explanation but kind of attractive...

crngefest

0 replies

12h11m

2024-08-02 06:19:55 UTC

IMHO it’s because audio is „easy“ to manipulate electronically.

You can transform every audio signal into an electronic signal relatively easy - for graphics there is so much more complexity involved just in making them visible.

A speaker that translates electronic signal into sound waves is a super simple contraption at its core.

Edit:/ and audio is striking - it has a profound effect on every human (except deaf of course). If I wanted to demonstrate the power of electronics/computers I would choose audio as well.

johnneville

1 replies

36m

2024-08-02 17:54:26 UTC

I'd love a way to use local files instead of spotify/youtube to create the set of fingerprints that is searched.

ccgzirim

0 replies

13m

2024-08-02 18:17:23 UTC

I've added that to my to-do list and plan to implement it this weekend.

hactually

1 replies

1d5h

2024-08-01 13:12:36 UTC

really decent and nicely done Golang! I'll pull and play with it tomorrow!

ccgzirim

0 replies

1d3h

2024-08-01 15:03:19 UTC

Thanks! I appreciate the compliment on my Golang; This is actually my first full-fledged project with the language, haha. Feel free to reach out if you have any issues running it.

bravura

1 replies

4h31m

2024-08-02 13:59:41 UTC

It would be quite nice if there were a community-based way of sharing fingerprints.

johnneville

0 replies

39m

2024-08-02 17:51:30 UTC

I think musicbrainz supports this https://musicbrainz.org/doc/AcoustID

blackeyeblitzar

1 replies

23h37m

2024-08-01 18:53:43 UTC

I’ve heard that the Google phones have a built in music recognition feature that is the best implementation of this stuff. Anyone know what their approach was? Apart from that I always have felt Soundhound was better than Shazam

lucb1e

0 replies

19h27m

2024-08-01 23:03:14 UTC

Iirc there was some small algorithm, or perhaps even a piece of hardware, that should trigger when music is playing so that the phone isn't active all the time. From there, I guess they could use any old detection algorithm; for me, the magic was in this super-energy-efficient bit of the chain, though I never read up on the details (if they ever provided any)

anticristi

1 replies

1d2h

2024-08-01 15:54:05 UTC

I wonder how long until someone will simply smoosh a billion songs into a "large song model" and make all signal processing knowledge irrelevant.

immibis

0 replies

1d2h

2024-08-01 16:11:35 UTC

You mean Suno?

scoot

0 replies

7h54m

2024-08-02 10:36:27 UTC

Shazam is historically interesting, but Google's "hum to search" algorithm is far superior, and even that is nearly four years old (since production).

msie

0 replies

2024-08-01 18:18:14 UTC

I enjoyed reading the Go source. As opposed to the time I had to read some Ruby code.