<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="3.10.0">Jekyll</generator><link href="/feed.xml" rel="self" type="application/atom+xml" /><link href="/" rel="alternate" type="text/html" /><updated>2025-10-23T05:47:00+00:00</updated><id>/feed.xml</id><title type="html">Selvan Sunitha Ravi</title><subtitle>Personal Website</subtitle><entry><title type="html">ML Interview Question Bank</title><link href="/jekyll/update/2023/04/29/ml-interview.html" rel="alternate" type="text/html" title="ML Interview Question Bank" /><published>2023-04-29T02:14:52+00:00</published><updated>2023-04-29T02:14:52+00:00</updated><id>/jekyll/update/2023/04/29/ml-interview</id><content type="html" xml:base="/jekyll/update/2023/04/29/ml-interview.html"><![CDATA[<p>Having recently spent a good portion of my time interviewing with companies for ML roles, I realised that I couldn’t find a comprehensive list of topics or questions to go through before any ML interview. <a href="https://rohankumar.github.io/">Rohan</a> and I compiled this list of questions and references as we interviewed with numerous companies for the second half of last year.</p>

<h3 id="questions">Questions</h3>

<h4 id="general">General</h4>

<ol>
  <li>What is bias-variance tradeoff? Which models exbhibit low variance?</li>
  <li>How can we avoid underfitting and overfitting?</li>
  <li>How do you deal with imbalanced data?</li>
  <li>What is the difference between L1 and L2 regularisation? Why does L1 drive weights to zero?</li>
  <li>Why do we use batchnorm?</li>
  <li>What is the difference between SGD and GD?</li>
  <li>Compare Momentum, RMSProp, Adagrad and Adam.</li>
  <li>How do you decide between standardization and normalization?</li>
  <li>What is the reparameterization trick?</li>
  <li>What is Xavier init and why do we use it?</li>
  <li>When no. of features is greater than number of  examples, why do we not have a unique solution to minimize the residual sum of squares?</li>
  <li>What is inductive bias?</li>
  <li>How can we reduce the dimensionality of a large dataset? (PCA)</li>
  <li>Express the bias variance decomposition as MSE loss.</li>
  <li>What is the effect of batch size on training?</li>
</ol>

<h4 id="models">Models</h4>
<ol>
  <li>What are the drawbacks of K-means algorithm?</li>
  <li>How can you choose the optimal k for K-means?</li>
  <li>In KNN, if k=1, does it have high variance or bias?</li>
  <li>Compare bagging and boosting methods.</li>
  <li>What is weakly supervised learning?</li>
  <li>What are the basic assumptions to be made for linear regression?</li>
  <li>Why would we not pass ordinal values to a model for categories?</li>
  <li>What is the Kernel trick for SVMs?</li>
  <li>How do you do dropout in RNNs? How does it work during test time?</li>
  <li>What are the assumptions we make for Naive Bayes algorithm?</li>
  <li>What is the difference between Naive Bayes and Logistic Regression?</li>
  <li>In linear regression, why do we use sum of squares?</li>
  <li>What is the gradient boosting algorithm? Difference between Gradient boosted Decision Trees and Random Forests?</li>
  <li>What is the difference between SVMs and Logistic Regression? Write down the loss functions.</li>
  <li>What is the difference between Collaborative and content based filtering?</li>
</ol>

<h4 id="evaluation">Evaluation</h4>
<ol>
  <li>What is Type I and Type II error?</li>
  <li>What is precision, recall, F1 and ROC curve?</li>
  <li>How do you do k-fold validation for time series data?</li>
  <li>How do we evaluate multi-label classification problems?</li>
</ol>

<h3 id="useful-references">Useful References:</h3>
<ol>
<li> Chip Huyen's <a href="https://huyenchip.com/ml-interviews-book/">book</a> on ML interviews gives a comprehensive overview of the interview process. </li>

<li> Patrick Halina has a really good <a href="http://patrickhalina.com/posts/ml-systems-design-interview-guide/#ml-systems-questions">Systems Design guide</a>. I have used this as a reference for every System Design interview I have ever done. </li>

<li> CS 229 <a href="https://stanford.edu/~shervine/teaching/cs-229/"> Cheatsheets </a> are useful for a quick revision of concepts. </li>

<li> If you are interviewing for NLP roles, I would recommend reading the <a href="https://arxiv.org/abs/1810.04805">BERT paper </a> and <a href="https://arxiv.org/abs/1706.03762"> Transformer paper </a>before the interview. </li>






<!-- Jekyll also offers powerful support for code snippets:


<figure class="highlight"><pre><code class="language-ruby" data-lang="ruby"><span class="k">def</span> <span class="nf">print_hi</span><span class="p">(</span><span class="nb">name</span><span class="p">)</span>
  <span class="nb">puts</span> <span class="s2">"Hi, </span><span class="si">#{</span><span class="nb">name</span><span class="si">}</span><span class="s2">"</span>
<span class="k">end</span>
<span class="n">print_hi</span><span class="p">(</span><span class="s1">'Tom'</span><span class="p">)</span>
<span class="c1">#=&gt; prints 'Hi, Tom' to STDOUT.</span></code></pre></figure>


Check out the [Jekyll docs][jekyll-docs] for more info on how to get the most out of Jekyll. File all bugs/feature requests at [Jekyll’s GitHub repo][jekyll-gh]. If you have questions, you can ask them on [Jekyll Talk][jekyll-talk].

[jekyll-docs]: https://jekyllrb.com/docs/home
[jekyll-gh]:   https://github.com/jekyll/jekyll
[jekyll-talk]: https://talk.jekyllrb.com/ -->
</ol>]]></content><author><name></name></author><category term="jekyll" /><category term="update" /><summary type="html"><![CDATA[Having recently spent a good portion of my time interviewing with companies for ML roles, I realised that I couldn’t find a comprehensive list of topics or questions to go through before any ML interview. Rohan and I compiled this list of questions and references as we interviewed with numerous companies for the second half of last year.]]></summary></entry></feed>