I work on a new algorithm for Artificial Intelligence, AMA | BestofAMA

triangle up vote

277

triangle down vote

[removed]

Comments: 164 • Responses: 51 • Date: 2021-02-01 07:22:14 UTCsource

lowkey-goddess102 karma2021-02-01 08:15:16 UTC

Former ML engineer here. How does this differentiate from traditional ML approaches like decision trees? If it differentiates a lot, then what approach does it most resemble?

Second, can it handle large, multidimensional datasets with complex relationships, similar to deep learning approaches? Does it have forward/back propagation or some training mechanism?

Lastly, and more of a comment, I feel like AI is a misnomer for what's actually happening in the field. It automates a particular decision in a limited domain, and leverages statistical techniques and usually concepts in linear algebra and calculus to garner sometimes useful results, especially when the computing is distributed.

God, I must sound pretentious in my last paragraph. I'm just a tad tired of hearing the term AI thrown around and it misleading folks. I like people who build stuff, and this seems interesting.

View History Share Link

Over_Intention334222 karma2021-02-01 08:38:13 UTC

I will start from the end: Yes, the term AI is slapped on everything a bit too easily. But I think about it in the same way think about electricity. Years ago it was only about lighting a bulb, now it's semiconductors. So, despite both being 'electricity' one is more advanced than the other. It's similar to AI now, we are just at lighting a bulb stage. A tree in Primeclue can be thought of as a complicated equation, so it's hard for me to compare it to existing ML approaches. It can handle large datasets, it does not have gradient descend but it "learns" from existing "trees" and uses their parts to find a better solution.

View History Share Link

lowkey-goddess26 karma2021-02-01 08:57:57 UTC

I upvoted the downvote to keep this thread alive. I hate to be the grumpy skeptic, but may you point to some approach or a paper that inspired this? Anything in the world that you can concretely analogize related to computer science/machine learning/applied mathematics? I want to understand, but I'm not getting much.

Some of the equations and architectures in deep learning can get pretty complicated for some people in the field, and it sometimes takes me a few reads to fully digest a paper, no doubt. But, calling it complex for the sake of it simply being complex doesn't help us understand what this is. We are using the same mathematical principles to build these architectures. What might they be, specifically?

View History Share Link

Over_Intention33420 karma2021-02-01 09:01:46 UTC

Sorry, I came up with this on my own, although I'm sure someone has tried something similar before (monkeys on typewriters). I can't really point you to any paper describing this.

View History Share Link

lowkey-goddess18 karma2021-02-01 09:22:19 UTC

I'm not doubting your originality, I believe you. I'm asking about the principles you used to build your algo. Would it help if I went through your source code and asked questions about a particular function/class and what it accomplishes in your program?

View History Share Link

Over_Intention33429 karma2021-02-01 09:31:38 UTC

Yes, that would be much more productive.

View History Share Link

bye-lingual1 karma2021-02-01 10:15:49 UTC

(monkeys on typewriters)

Is this a reference to the philosophy of monkeys writing down the whole script of Shakespeare? If so I think I like you and you're incredible for starting (edit: meant stating not starting) that it's not your idea but rather anyone could come up with it, let alone with a typewriter, eventually (:

View History Share Link

Over_Intention33424 karma2021-02-01 10:24:52 UTC

That's exactly what I was getting at. Checking all research to see if someone tried it before would have taken longer than actually doing this "again".

View History Share Link

dangerous_9999 karma2021-02-01 09:30:03 UTC

Isn't a neural network just an equation too?

View History Share Link

Over_Intention33422 karma2021-02-01 09:34:39 UTC

I suppose so.

View History Share Link

quohr45 karma2021-02-01 08:18:53 UTC

... from your GitHub:

“- Split data randomly into training and testing sets.

Train classifiers on training data for 1 minute.
Take the best classifier and note its result on test data.
Repeat above steps 20 times.
Record median result on test data.”

I certainly hope you understand that you CANNOT use the test data in the process of determining which model to use. You go even further and “repeat 20 times”, then take the MEDIAN??

Why choose 20 times and not 10 or 30?
How can you claim your method avoids overfitting?
Have you tried using a validation AND test set instead?

EDIT: OP’s approach is an ensemble of learners and doesn’t touch the test set during training. Thanks for clarifying OP

View History Share Link

Over_Intention33425 karma2021-02-01 08:39:31 UTC

Primeclue does not look at test data during training. Test data is used for evaluating performance of what it learnt. I never claimed it avoids overfitting.

View History Share Link

AnArtistsRendition5 karma2021-02-01 08:54:47 UTC

You need a separate validation dataset for that. The test set should only be used for final evaluation.

Edit: Thanks op for the pointer! Seems good to me anyway

View History Share Link

Over_Intention33428 karma2021-02-01 09:04:07 UTC

The test set is only used for final evaluation. To get better grasp at what is happening see the code at https://github.com/lukaszwojtow/primeclue/blob/dev/backend/primeclue-api/examples/test_training.rs and check where test_data is used.

View History Share Link

Pumpernickelthethird1 karma2021-02-01 09:26:51 UTC

It seems like you use the test data to evaluate the accuracy of one "generation" of trees as you call it. If I understand your approach correctly, then you produce lots of random trees, prune them and pick the ones with the highest score, similar to a random forest method, right?

But since you measure the accuracy of one tree against the test score before picking the best one you're using knowledge about the test set in your final predictive model, effectively producing a highly overfit tree that cannot gerneralize whatsoever.

View History Share Link

Over_Intention33422 karma2021-02-01 09:32:41 UTC

No, I train on training data for one minute, many generations. At the end of training I build final classifier (one tree for each class) and test it on test_data.

View History Share Link

Pumpernickelthethird3 karma2021-02-01 10:13:08 UTC

Alright, I'm not very proficient in Rust so I may have misinterpreted your code. Still, I see a lot of problems with this approach, the use of primitive math functions as tree nodes which seems kind of a random and computationally inefficient thing to use, the lack of detail on how data is prepared, the striking similarity to decision trees and random forests, the general simplicity of the process and your explanations, etc.

I don't intend to be a naysayer without delivering any solid proof explicitly pointing out faults in your code, but I don't have the time to review your project thoroughly enough. I'd advice you to post your project to more specialized communities like /r/machinelearning in order to get some input by people proficient in the field instead of posting to /r/IAma where you won't find much technical knowledge.

Anyway, I like your dedication and creativity and hope you'll keep at it and create more interesting and non-traditional stuff in the future.

View History Share Link

Over_Intention33422 karma2021-02-01 10:16:18 UTC

Yes, r/machinelearning was my initial choice, but they said IAma is better for self promotion. If someone else reposts it there, I will gladly answer all questions.

View History Share Link

quohr3 karma2021-02-01 08:51:51 UTC

Okay I see, it’s an ensemble approach - my mistake.

Why do you choose 20 rerun cycles and not, for example, 50? Have you tested accuracy vs. total number of repetitions?
Why train for “one minute” each time? This would lead to different periods of training depending on the system that the end user is on (e.g, 2000 MacBook Pro vs. a supercomputer)

View History Share Link

Over_Intention33422 karma2021-02-01 08:55:12 UTC

Re: Why 20? Median seems quite stable when I do 20 runs. Re: Why 1 minute? This is to have some reliability as to when the training ends. Usually people don't know how long 'one epoch' will take, but they know they need answer within certain 'human' time.

View History Share Link

quohr11 karma2021-02-01 09:05:58 UTC

If you plan on publishing this, I recommend doing a formal test of repetition versus accuracy. I’d imagine it would plateau after some amount depending on whatever factors are involved (particular application, training set size, etc.)

I get what you mean, but computers don’t operate on equivalent timescales. Imagine training your method for a minute using AiMOS versus on a 1990s Macintosh haha.

Plus, the less subjective the better :)

View History Share Link

Over_Intention33421 karma2021-02-01 09:08:46 UTC

Thanks

View History Share Link

t0b4cc021 karma2021-02-01 10:47:34 UTC

Usually people don't know how long 'one epoch' will take, but they know they need answer within certain 'human' time.

haha love you dude. you are one of the few practical people who code. i often got annoyed that epochs is the default run and you have to get out of your way in most ml systems to make it time or something else

View History Share Link

Over_Intention33421 karma2021-02-01 10:55:29 UTC

Thanks

View History Share Link

CeladonBadger21 karma2021-02-01 08:08:05 UTC

It doesn’t really sound that different from traditional nn. Is it capable of categorisation without basically creating a new model for each class? Does it always have to combine 2 inputs in each node? Is it capable of processing different input size data? It definitely sounds like an interesting project but also like a bit of novelty. No offence, I might be missing something crucial there and I’d love to know more.

View History Share Link

Over_Intention33421 karma2021-02-01 08:14:10 UTC

Is it capable of categorisation without basically creating a new model for each class? Well, each class has its own tree that answers either 'yes' or 'no' Does it always have to combine 2 inputs in each node? Some nodes take one argument and apply one argument function (i.e. sqrt) I’d love to know more Hence provided source code.

View History Share Link

noelexecom16 karma2021-02-01 08:39:29 UTC

How is this different enough from traditional methods to warrant an AMA? So far what you describe sounds like 50 year old stuff.

View History Share Link

Over_Intention3342-7 karma2021-02-01 08:39:56 UTC

Sorry to be of a disappointment.

View History Share Link

Whatever4M12 karma2021-02-01 08:21:43 UTC

Honestly this sounds a lot like a normal neural network. To make it a question: What would you say is the fundemental difference between your work and an average neural network ?

View History Share Link

Over_Intention334212 karma2021-02-01 08:29:58 UTC

It doesn't use activation functions nor gradient descent.

View History Share Link

serifmasterrace4 karma2021-02-01 10:39:35 UTC

If all the nodes are linear operations, the function that the tree is modeling can be collapsed into the form wX+b.

Then we’d just be solving least squares with extra steps right? There’s already a fast analytical solution. Or is there something else I’m missing something here?

View History Share Link

Over_Intention3342-2 karma2021-02-01 10:44:09 UTC

I don't think it can be collapsed to wX+b.

View History Share Link

serifmasterrace3 karma2021-02-01 11:03:49 UTC

Any combination of linear operators can be collapsed into the form wX+b.

For example, if you have a tree representing (2X[1]+ 3X[2]) * 4 + 5, it's no different from wX+b where X = matrix([X[1], X[2]]), w = [8,12], b = 5.

max(a,b) is just a constrained linear program.

e^x and x^i are nonlinear, which are operations represented by activations in neural nets.

Your tree is creating some extra linear operations that could be simplified down to greatly improve runtime. Maybe try that, but the solution space being learned won't be different from that of a neural net

View History Share Link

Over_Intention33421 karma2021-02-01 11:07:32 UTC

Redit removed my post so I probably won't continue this thread here. However I'd like to continue conversation with you. If you feel like it, please contact me via email. Thanks

View History Share Link

Rubscrub11 karma2021-02-01 09:04:28 UTC

Hi, so I read your github. But isnt this just vefy similar to a ann but with random functions and trees instead of activation functions and backwards propegation?

I would think that by creating random trees and hoping one performs well you're very unlikely to reach a global optimum or even a local optimum. So how does the performance and training time compare to traditional methods?

View History Share Link

Over_Intention33422 karma2021-02-01 09:08:11 UTC

Perhaps it's similar to other approaches with a lot of "buts". Performance is better at some problems (stocks, sports betting) and worse at others (mnist fashion).

View History Share Link

mandown230810 karma2021-02-01 07:32:24 UTC

Are you doing it alone? Why your project differs from DL?

View History Share Link

Over_Intention33421 karma2021-02-01 07:34:23 UTC

Yes. It's completely different algorithm. I don't use any ML/AI libraries as shortcuts.

View History Share Link

quohr27 karma2021-02-01 08:06:34 UTC

Not using other libraries doesn’t have anything to do with whether what you’ve developed is or isn’t DL though.

View History Share Link

Over_Intention33421 karma2021-02-01 08:25:54 UTC

Correct. What I meant is that approach is a bit different.

View History Share Link

wiwerse8 karma2021-02-01 07:44:36 UTC

What lead you down this path?

How did you get started?

How long do you think it is until it's launch ready?

How long have you been working on it?

View History Share Link

Over_Intention33426 karma2021-02-01 07:51:43 UTC

I had enough with Java

Simple idea for processing data, then I looked for the right programming language

There won't be an official "launch". It works for me just fine.

Over a year, mainly evenings and some weekends.

View History Share Link

Miseryy8 karma2021-02-01 08:06:40 UTC

Your description of your algorithm seems to suggest it can make decisions in logarithmic time and space, for all inputs, since you describe the input as originating from leaves that merge paths. It's essentially the reverse of an exponential tree.

How would you expect your algorithm to perform on problems that cannot be compressed to a logarithmic number of conjunctive statements/functions? I.e. np hard problems

View History Share Link

Over_Intention33425 karma2021-02-01 08:25:06 UTC

Can you give an example of such problem with example data? I will take a look

View History Share Link

GuyARoss4 karma2021-02-01 08:47:21 UTC

subset sum could be one- so given a set {1,23,4,51,21} find n numbers that could produce the sum of a given value OR as close as possible; so this algorithm needs to take into account a precision value as well.

ive tried solving this optimization with a supervised approach before with pretty poor results, so im also curious what your algorithm would yield.

View History Share Link

Over_Intention33420 karma2021-02-01 08:59:47 UTC

Primeclue can do label classification. I'm not sure what label should be in your example. Can you elaborate?

View History Share Link

Excel075 karma2021-02-01 07:56:00 UTC

In this field, what kind of Mathematics is a must-know?

View History Share Link

Over_Intention33427 karma2021-02-01 08:00:45 UTC

It depends, for example decision trees do not require calculus or any such.

View History Share Link

aetr3yu3 karma2021-02-01 07:36:23 UTC

Would you recommend Python over everything else?

View History Share Link

Over_Intention33423 karma2021-02-01 07:39:35 UTC

Depends on what you're trying, but at the beginning Pythons seems like a save choice.

View History Share Link

koalefant3 karma2021-02-01 08:00:06 UTC

What does primeclue think about GameStop? Should I buy more or just hold on to the ones I have 🙌

View History Share Link

Over_Intention33426 karma2021-02-01 08:04:02 UTC

I've never run GameStop values through this software so I don't know.

View History Share Link

quohr3 karma2021-02-01 08:07:13 UTC

[deleted]

View History Share Link

Over_Intention33421 karma2021-02-01 08:24:00 UTC

How does primeclue determine how to approximate the solution space? What do you mean? How do you ensure that your approach does not overfit? That's unsolved, itsn't it? Primeclue splits training data into two parts and only one part is used for actual training (something like n-fold validation) Under what circumstances do you believe primeclue would offer an advantage .. ? There is an example called 'test_training'. For some reason TensorFlow fails it miserably but Primeclu gives like 60+ % correctness. Also, it seems like predicting stock market runs works better with Primeclue.

View History Share Link

Excel073 karma2021-02-01 07:25:46 UTC

What is the best programming language for machine learning and why?

View History Share Link

Over_Intention33428 karma2021-02-01 07:35:10 UTC

If you have to ask I would say python. Simple, has TensorFlow and others.

View History Share Link

bsnshdbsb2 karma2021-02-01 08:48:30 UTC

Complete noob here. How do I even start working on this field? What should be my path or approach? Should I learn every bit of ML or master a specific. Appreciate any feedback.

View History Share Link

Over_Intention33421 karma2021-02-01 08:57:33 UTC

Learn a bit of Python and then do some courses, like TensorFlow on Coursera.

View History Share Link

diamondketo2 karma2021-02-01 08:57:18 UTC

How are the function nodes in the trees built? Does the user specifies them based on their model or does your algorithm learn to choose the best functions?

Are the non-data leaf nodes free parameters?

If yes, how does your algorithm optimize and estimate the best parameter? I understand your algorithm prunes the tree; how does free parameter and pruning come together; do you optimize the free parameter first then prune?
If no, how does your algorithm choose the best parameter (e.g., why e, why pi?)

View History Share Link

Over_Intention33421 karma2021-02-01 09:05:03 UTC

User does not need to specify anything. It all starts randomly.

View History Share Link

melancholic_inertia1 karma2021-02-01 10:21:06 UTC

[deleted]

View History Share Link

Over_Intention33421 karma2021-02-01 10:25:10 UTC

No.

View History Share Link

retrorectum1 karma2021-02-01 07:47:59 UTC

Thanks for doing this and it sounds interesting. I have couple of questions: 1. What's the false positive rate you are going for to be able to use it? 2. How much data will be used 3. What are some cases you are personally worried about to be able to have a good success rate on it?

View History Share Link

Over_Intention33420 karma2021-02-01 07:54:37 UTC

Depends on the problem.
It can process ten of thousands of rows and still have reasonable results reasonably quickly.
Hmmm... It's not so good on MNIST fashion data set (around 85% accuracy). Probably because there is so many points to look at.

View History Share Link

jpropaganda1 karma2021-02-01 08:06:00 UTC

Have you heard of conducto? Do you think that would increase your pipeline processing? www.conducto.com

View History Share Link

Over_Intention33421 karma2021-02-01 08:26:10 UTC

I've never heard of it, thanks for the link.

View History Share Link

umop_apisdn1 karma2021-02-01 08:12:07 UTC

Do you have to define the complete architecture - ie size of the binary tree and the function at each node - beforehand, or are these learnt as well?

View History Share Link

Over_Intention33421 karma2021-02-01 10:38:36 UTC

User does not need to define architecture, although there is some control over things like how often a tree creates a branch, how deep initial trees are and so on.

View History Share Link

Kyloman1 karma2021-02-01 08:40:15 UTC

What advice to you have for people wanting to learn how to create their own Artifical Intelligence projects?
I am decently adept at coding, but it's such a huge and complicated topic I have no idea where to start.

View History Share Link

Over_Intention33421 karma2021-02-01 08:40:51 UTC

Read a lot to get some creative juices flowing.

View History Share Link

diamondketo1 karma2021-02-01 08:51:44 UTC

Why don't you write a technical paper in a statistics journal? Get peer-reviewed by a career statistician.

View History Share Link

Over_Intention33423 karma2021-02-01 08:55:33 UTC

I'm not interested in scientific career, I'm mainly a programmer.

View History Share Link

diamondketo2 karma2021-02-01 09:00:39 UTC

Are you not interested in validating whether your algorithm is (1) new and (2) works better than neural network and classification decision trees?

View History Share Link

Over_Intention33424 karma2021-02-01 09:06:28 UTC

Neither. I'm only interested to make work better and be useful to me and others.

View History Share Link

mvsopen1 karma2021-02-01 07:42:20 UTC

Where is AI and ML heading? And when will we be forced to adopt a code of ethics for future AI development?

View History Share Link

Over_Intention33425 karma2021-02-01 07:57:05 UTC

Code of ethics for AI my seem like artificial brakes on what it's capable of, so I hope ethics must be on human side during application of AI results.

Edit: What I meant is: I hope we won't have to hard code ethics into AI, we as humans must be more careful how we apply AI.

View History Share Link

thekillerdonut2 karma2021-02-01 08:41:21 UTC

It has already been demonstrated that people will not responsibly apply machine learning AI (some examples listed here), either intentionally or because they aren't aware that it isn't a perfectly accurate system. In light of this, as the person creating this type of technology, do you feel an ethical responsibility to apply ethics while you still have control over it?

I realize this is a fairly pointed question. I ask because I was very interested in going into AI research while I was in college, but the more I learned what people did with this type of technology, the more contributing to it deeply violated my own code of ethics.

View History Share Link

Over_Intention33421 karma2021-02-01 08:58:41 UTC

Absolutely yes!

View History Share Link

LaChicaGo1 karma2021-02-01 08:18:27 UTC

What is your favourite programming language? How do you feel about DataRobot and other "black box" programs?

View History Share Link

Over_Intention33422 karma2021-02-01 08:31:30 UTC

Definitely Rust. I've never heard of DataRobot

View History Share Link

eyegazer4441 karma2021-02-01 08:20:03 UTC

Have you heard of Replika? How is your algorithm better or different to that?

View History Share Link

Over_Intention33421 karma2021-02-01 08:28:29 UTC

Never heard of it.

View History Share Link

ex_D0T41 karma2021-02-01 09:59:19 UTC

Hello, I'm not very familiar with things like this. Is there anything you can suggest to get started with coding? I see a lot of courses but I would like to hear from someone who codes. I've been interested but I can't find something to start with without feeling like I'm doing something wrong.

View History Share Link

Over_Intention33422 karma2021-02-01 10:03:12 UTC

Nothing better than actually getting started. There are plenty of manuals for Python for example.

View History Share Link

ex_D0T41 karma2021-02-01 10:10:37 UTC

Is there one you'd say is the best? Or are they all virtually the same?

View History Share Link

Over_Intention33422 karma2021-02-01 10:14:09 UTC

Sorry, I've learnt Python as my fifth or sixth language so I didn't use a manual. But it seems like this:

https://www.learnpython.org/

is ok. Good luck!

View History Share Link

thenielser1 karma2021-02-01 10:07:16 UTC

Are you planning on writing an actual research paper instead of a github page?

Would be nice to see a clear and concise paper explaining the differences and theoretical background.

View History Share Link

Over_Intention33421 karma2021-02-01 10:08:04 UTC

No such plans. Mainly because code changes often, any paper would be outdated before it's finished.

View History Share Link

bunch_of_particles0 karma2021-02-01 07:49:41 UTC

What is your thought on the importance of ethics in AI?

View History Share Link

Over_Intention33425 karma2021-02-01 08:02:02 UTC

Because computers are soulless, humans must do more to make up for it.

View History Share Link

dingoateyobaby0 karma2021-02-01 09:28:44 UTC

People misuse the words AI on things that are not truely AI. I believe if AI doesn't gather and manipulate the data by itself than it's not an AI. Is your project a true AI or simply a "program"?

View History Share Link

Over_Intention33421 karma2021-02-01 09:33:29 UTC

It won't gather data for you, you need to feed it with data first. So by your definition, it's just a program.

View History Share Link

PolarisLodestar-1 karma2021-02-01 07:49:22 UTC

In the midst of crypto/blockchain mania, what do you think of GNY? It’s the first decentralized machine learning system. They’ve been in development for over 2 years and are launching the Mainnet this week!

View History Share Link

Over_Intention33422 karma2021-02-01 07:55:11 UTC

I'v never heard of it, thanks for the link.

View History Share Link

thesearcherofgold-1 karma2021-02-01 08:00:49 UTC

What kind applications do you aim this AI to fit in? Is a human-like virtual girlfriend within the realm of possibilities?

View History Share Link

Over_Intention33423 karma2021-02-01 08:03:01 UTC

This is more about getting answers from data.

View History Share Link

Smile_in_the_mirror-1 karma2021-02-01 07:48:11 UTC

What are the chances of an AI becoming self aware?

View History Share Link

Over_Intention33421 karma2021-02-01 07:58:05 UTC

Quite high, but we aren't there yet.

View History Share Link

kieronhix-2 karma2021-02-01 08:21:28 UTC

Do you think we should be cautious of a potential AI uprising like Elon Musk claims he’s afraid of?

View History Share Link

Over_Intention33422 karma2021-02-01 08:28:16 UTC

No.

View History Share Link

Internal-Lifeguard51-6 karma2021-02-01 07:30:17 UTC

Do you think your soul has entered your program? I know RNA and other genetic material transfers “memories” through generations. Do you think your AI could unknowingly be channeling your own being?

View History Share Link

Over_Intention33422 karma2021-02-01 07:35:41 UTC

Never thought about it. Intriguing.

View History Share Link